一种面向大规模副本存储系统的可靠性模型

穆  飞  薛  巍  舒继武  郑纬民

一种面向大规模副本存储系统的可靠性模型

穆飞薛巍舒继武郑纬民

An Analytical Model for Large-Scale Storage System with Replicated Data

Mu Fei, Xue Wei, Shu Jiwu, and Zheng Weimin

摘要

摘要: 可靠性对大规模存储系统至关重要，在大规模存储系统中设备失效日趋频繁，副本技术成为提高系统可靠性的主流技术之一.基于Markov模型，针对多副本存储系统建立了度量系统可靠性的理论模型.该模型能够反应失效检测延迟对系统可靠性的影响.通过该模型还可以度量存储系统关键参数如系统规模、副本阶数、单节点容量、单节点平均失效时间、数据对象平均大小、平均修复带宽等对系统可靠性的影响，从而为存储系统的设计提供理论基础.

Abstract: Nowadays storage systems become larger and larger, so the number of storage devices is increasing rapidly, which makes storage device failure occur quite frequently in large scale storage systems. Data replica technology begins to be adopted prevalently to enhance storage system reliability. When designing a large scale storage system, there are many factors that could affect the reliability of the storage system, such as failure detection latency, storage node capacity selection, data object size design, replica rank selection and so on. On the other hand, system reliability can not be exactly experimented, so a theoretical model is needed to evaluate it. In this paper, an analytical framework is represented to evaluate the reliability for large scale storage systems which adopt replica technology to protect data. Based on the Markov model, this analytical model could provide quantitative answers to measure the impact of a series of storage system design factors on the reliability of storage systems, such as the rank of the replicated data, the capacity of the storage system, the capacity of storage nodes, the size of data object, the repair bandwidth, mean time failure detection latency and so on. Hence, many storage system design tradeoffs could be reasoned by this framework.

HTML全文

参考文献(0)

施引文献

资源附件(0)