• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Sun Hui, Lou Bendong, Huang Jianzhong, Zhao Yuhong, Fu Song. Near-Data Processing-Based Parallel Compaction Optimization for Key-Value Stores[J]. Journal of Computer Research and Development, 2022, 59(3): 597-616. DOI: 10.7544/issn1000-1239.20210577
Citation: Sun Hui, Lou Bendong, Huang Jianzhong, Zhao Yuhong, Fu Song. Near-Data Processing-Based Parallel Compaction Optimization for Key-Value Stores[J]. Journal of Computer Research and Development, 2022, 59(3): 597-616. DOI: 10.7544/issn1000-1239.20210577

Near-Data Processing-Based Parallel Compaction Optimization for Key-Value Stores

Funds: This work was supported by the University Synergy Innovation Program of Anhui Province (GXXT-2019-007), the State Key Laboratory of Computer Architecture (ICT, CAS) (CARCH201915), and the National Natural Science Foundation of China (62072001, 61702004, 61572209).
More Information
  • Published Date: February 28, 2022
  • Large-scale unstructured data management brings unprecedented challenges to existing relational databases. The log-structured merge tree (LSM-tree) based key-value store has been widely used and plays an essential role in data-intensive applications. The LSM-tree can convert random-write operations into sequential ones, thereby improving write performance. However, the LSM-tree key-value storage system also has some problems. First, the key-value storage system uses compaction operations to update data to balance system performance, but it impacts system performance and causes serious write amplification. Second, the traditional computing-centric data transmission also limits the overall system performance in compaction. This paper applied the data-centric near-data processing (NDP) model in the storage system. We propose a collaborative parallel compaction optimization for LSM-tree key-value stores named CoPro. The two parallel (i.e., data and pipeline parallelism) are fully utilized to improve compaction performance. When the compaction is triggered, the host-side CoPro determines the partitioning ratio of the compaction tasks according to the offloading strategy and divides tasks according to the ratio. Then, compaction subtasks are offloaded to the host and device sides, respectively, through the semantic management module. We design a decision component in the host-side and device-side CoPro, which is remarked as CoPro+. CoPro+ can dynamically adjust the parallelism according to changes in the resource of system and the value of key-value pairs in workloads. Extensive experimental results validate the benefits of CoPro compared with two popular NDP-based key-value stores.
  • Cited by

    Periodical cited type(1)

    1. 李赫洋. 智能存算融合系统研究进展与发展趋势. 舰船电子工程. 2023(12): 24-32 .

    Other cited types(7)

Catalog

    Article views (326) PDF downloads (119) Cited by(8)
    Turn off MathJax
    Article Contents

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return