• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhang Zilin, Liu Duo, Tan Yujuan, Wu Yu, Luo Longpan, Wang Weilüe, Qiao Lei. An Erasure-Coded Data Update Method for Distributed Storage Clusters[J]. Journal of Computer Research and Development, 2022, 59(11): 2451-2466. DOI: 10.7544/issn1000-1239.20210211
Citation: Zhang Zilin, Liu Duo, Tan Yujuan, Wu Yu, Luo Longpan, Wang Weilüe, Qiao Lei. An Erasure-Coded Data Update Method for Distributed Storage Clusters[J]. Journal of Computer Research and Development, 2022, 59(11): 2451-2466. DOI: 10.7544/issn1000-1239.20210211

An Erasure-Coded Data Update Method for Distributed Storage Clusters

Funds: This work was supported by the National Natural Science Foundation of China (62072059) and the Funds for Chongqing Distinguished Young Scholars (cstc2020jcyj-jqX0012).
More Information
  • Published Date: October 31, 2022
  • Erasure coding is widely deployed in distributed storage clusters to provide data reliability, but the disk I/O overhead becomes a performance bottleneck when data updates are intensive. On the one hand, traditional data update strategies need to read the original data chunk, and then write new data when updating the data chunk. In the case of intensive updates, frequent write-after-read seriously affects the write performance of the storage clusters. On the other hand, the operations of updating the parity chunk include reading the increments randomly distributed in the log file and merging them with the data file, which also introduces additional disk seek overhead. In this paper, a data updating method, named PARD (parity logging with reserved space and data delta), is proposed to solve these problems. The main idea of PARD is to use the linear calculations of erasure coding to reduce write-after-read, and take advantage of the disk characteristics to reduce the disk seek overhead. PARD comprises three key design features: 1) Adopting in-place data updates and log-based parity updates. 2) Taking advantage of the linear calculations of erasure coding to construct the log based on data increments. For a series of write requests to the same data chunk, only the first update needs to read the original data chunk, and the subsequent update executes the pure write, which remarkably reduces the write-after-read. 3) According to the characteristics of disk, reserving space for the log at the end of data file to reduce the disk seek overhead of reading and writing log. Experiments show that when the chunk size is 4 MB, PARD gains at least, 30.4%, 47.0% and 82.0% improvements in update throughput compared with PLR, PARIX, and FO, respectively.
  • Related Articles

    [1]Wang Zirui, Jiang Dejun. Key Techniques of Swapping Mechanism Based on Ultra-Low Latency SSD[J]. Journal of Computer Research and Development, 2024, 61(3): 557-570. DOI: 10.7544/issn1000-1239.202330538
    [2]Wu Xian, Tang Hongbo, Zhao Yu, Xu Mingyan. A Cross-Cluster Real-Time Migration Method for Stateful Container[J]. Journal of Computer Research and Development, 2024, 61(2): 494-502. DOI: 10.7544/issn1000-1239.202220602
    [3]Wang Zihan, Zhang Jiao, Zhang Yuan, Pan Tian, Huang Tao. A Transport Control Protocol for Low Earth Orbit Satellite Networks Based on Link Information Estimation[J]. Journal of Computer Research and Development, 2023, 60(8): 1846-1857. DOI: 10.7544/issn1000-1239.202220299
    [4]Fu Wenwen, Liu Rulin, Quan Wei, Jiang Xuyan, Sun Zhigang. nPSA:A Low-Latency, Deterministic Switching Architecture for TSN Chips[J]. Journal of Computer Research and Development, 2023, 60(6): 1322-1336. DOI: 10.7544/issn1000-1239.202111205
    [5]Fan Zhihua, Wu Xinxin, Li Wenming, Cao Huawei, An Xuejun, Ye Xiaochun, Fan Dongrui. Dataflow Architecture Optimization for Low-Precision Neural Networks[J]. Journal of Computer Research and Development, 2023, 60(1): 43-58. DOI: 10.7544/issn1000-1239.202111275
    [6]Liao Xiaojian, Yang Zhe, Yang Hongzhang, Tu Yaofeng, Shu Jiwu. A Low-Latency Storage Engine with Low CPU Overhead[J]. Journal of Computer Research and Development, 2022, 59(3): 489-498. DOI: 10.7544/issn1000-1239.20210574
    [7]Fu Yongquan, Li Dongsheng. Application Driven Network Latency Measurement Analysis and Optimization Techniques Edge Computing Environment: A Survey[J]. Journal of Computer Research and Development, 2018, 55(3): 512-523. DOI: 10.7544/issn1000-1239.2018.20170793
    [8]Gong Haigang, Yu Changyuan, Liu Ming, Yi Fasheng, Wang Xiaomin, Chen Lijun. A Self-Adaptive, Energy-Efficient Low Latency MAC Protocol for Wireless Sensor Network[J]. Journal of Computer Research and Development, 2007, 44(11): 1866-1872.
    [9]Li Yong, Wang Zhiying, Zhao Xuemi, and Yue Hong. Design of Application Specific Instruction-Set Processors Directed by Configuration Stream Driven Computing Architecture[J]. Journal of Computer Research and Development, 2007, 44(4): 714-721.
    [10]Zhang Yujun, Li Zhongcheng, Zheng Hongxia, Tian Ye, and Sun Jingbo. Transport Mechanism Applied to Computer Network Protocol Conformance Testing[J]. Journal of Computer Research and Development, 2005, 42(1): 102-108.
  • Cited by

    Periodical cited type(1)

    1. 郑章财,徐锋. 嵌入式服务器软件接口通信容量调节算法仿真. 计算机仿真. 2024(04): 265-269 .

    Other cited types(0)

Catalog

    Article views (118) PDF downloads (59) Cited by(1)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return