• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

纠删码存储系统数据更新方法研究综述

张耀, 储佳佳, 翁楚良

张耀, 储佳佳, 翁楚良. 纠删码存储系统数据更新方法研究综述[J]. 计算机研究与发展, 2020, 57(11): 2419-2431. DOI: 10.7544/issn1000-1239.2020.20190675
引用本文: 张耀, 储佳佳, 翁楚良. 纠删码存储系统数据更新方法研究综述[J]. 计算机研究与发展, 2020, 57(11): 2419-2431. DOI: 10.7544/issn1000-1239.2020.20190675
Zhang Yao, Chu Jiajia, Weng Chuliang. Survey on Data Updating in Erasure-Coded Storage Systems[J]. Journal of Computer Research and Development, 2020, 57(11): 2419-2431. DOI: 10.7544/issn1000-1239.2020.20190675
Citation: Zhang Yao, Chu Jiajia, Weng Chuliang. Survey on Data Updating in Erasure-Coded Storage Systems[J]. Journal of Computer Research and Development, 2020, 57(11): 2419-2431. DOI: 10.7544/issn1000-1239.2020.20190675
张耀, 储佳佳, 翁楚良. 纠删码存储系统数据更新方法研究综述[J]. 计算机研究与发展, 2020, 57(11): 2419-2431. CSTR: 32373.14.issn1000-1239.2020.20190675
引用本文: 张耀, 储佳佳, 翁楚良. 纠删码存储系统数据更新方法研究综述[J]. 计算机研究与发展, 2020, 57(11): 2419-2431. CSTR: 32373.14.issn1000-1239.2020.20190675
Zhang Yao, Chu Jiajia, Weng Chuliang. Survey on Data Updating in Erasure-Coded Storage Systems[J]. Journal of Computer Research and Development, 2020, 57(11): 2419-2431. CSTR: 32373.14.issn1000-1239.2020.20190675
Citation: Zhang Yao, Chu Jiajia, Weng Chuliang. Survey on Data Updating in Erasure-Coded Storage Systems[J]. Journal of Computer Research and Development, 2020, 57(11): 2419-2431. CSTR: 32373.14.issn1000-1239.2020.20190675

纠删码存储系统数据更新方法研究综述

基金项目: 国家自然科学基金项目(61772204,61732014)
详细信息
  • 中图分类号: TP309.3

Survey on Data Updating in Erasure-Coded Storage Systems

Funds: This work was supported by the National Natural Science Foundation of China (61772204, 61732014).
  • 摘要: 在分布式存储系统中,节点故障已成为一种常态,为了保证数据的高可用性,系统通常采用数据冗余的方式.目前主要有2种冗余机制:一种是多副本,另一种是纠删码.伴随着数据量的与日俱增,多副本机制带来的效益越来越低,人们逐渐将目光转向存储效率更高的纠删码.但是纠删码本身的复杂规则导致使用纠删码的分布式存储系统的读、写、更新操作的开销相比于多副本较大.所以纠删码通常被用于冷数据或者温数据的存储,热数据这种需要频繁访问更新的场景仍然用多副本机制存储.专注于纠删码存储系统内的数据更新,从硬盘I/O、网络传输、系统优化3方面综述了目前纠删码更新相关的优化工作,对目前具有代表性的编码方案的更新性能做了对比分析,最后展望了未来研究趋势.通过分析发现目前的纠删码更新方案仍然无法获得和多副本相近的更新性能.如何在纠删码更新规则和系统架构角度优化纠删码存储系统,使其能够替换掉热数据场景下的多副本机制,降低热数据存储开销仍是未来值得深入研究的问题.
    Abstract: In a distributed storage system, node failure has become a normal state. In order to ensure high availability of data, the system usually adopts data redundancy. At present, there are mainly two kinds of redundancy mechanisms. One is multiple replications, and the other is erasure coding. With the increasing amount of data, the benefits of the multi-copy mechanism are getting lower and lower, and people are turning their attention to erasure codes with higher storage efficiency. However, the complicated rules of the erasure coding itself cause the overhead of the read, write, and update operations of the distributed storage systems using the erasure coding to be larger than that of the multiple copies. Therefore, erasure coding is usually used for cold data or warm data storage. Hot data, which requires frequent access and update, is still stored in multiple copies. This paper focuses on the data update in erasure-coded storage systems, summarizes the current optimization work related to erasure coding update from the aspects of hard disk I/O, network transmission and system optimization, makes a comparative analysis on the update performance of representative coding schemes at present, and finally looks forward to the future research trends. Through analysis, it is concluded that the current erasure coding update schemes still cannot obtain the update performance similar to that of multiple copies. How to optimize the erasure-coded storage system in the context of erasure coding update rules and system architecture, so that it can replace the multi-copy mechanism under the hot data scenario, and reducing the hot data storage overhead is still a problem worthy of further study in the future.
  • 期刊类型引用(7)

    1. 杨秀璋,武帅,宋籍文,廖文婧,任天舒,刘建义. 基于LDA和关系图谱的数据治理文献主题演化研究. 信息技术与信息化. 2022(08): 6-12 . 百度学术
    2. 黄飞杰,张卫东,侯石鹏,宋红文. 基于GSP算法的卷烟消费者研究. 信息与电脑(理论版). 2022(16): 58-60 . 百度学术
    3. 张瑾,朱桂祥,王宇琛,郑烁佳,陈镜潞. 基于异质图表达学习的跨境电商推荐模型. 电子与信息学报. 2022(11): 4008-4017 . 百度学术
    4. 冯晨娇,宋鹏,王智强,梁吉业. 一种基于3因素概率图模型的长尾推荐方法. 计算机研究与发展. 2021(09): 1975-1986 . 本站查看
    5. 牛俊洁,崔忠伟,赵晨洁,王永金,吴恋. 个性化旅游推荐技术研究及发展综述. 物联网技术. 2020(03): 86-88+91 . 百度学术
    6. 史亚奇. 基于人性化特征的旅游地智能推荐系统. 现代电子技术. 2020(11): 183-186 . 百度学术
    7. 张如花,屈正庚. 基于AHP的旅游网站评价体系研究. 甘肃科学学报. 2019(05): 32-36 . 百度学术

    其他类型引用(11)

计量
  • 文章访问数:  971
  • HTML全文浏览量:  4
  • PDF下载量:  487
  • 被引次数: 18
出版历程
  • 发布日期:  2020-10-31

目录

    /

    返回文章
    返回