• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

基于双向位图的CSR大规模图存储优化

甘新标, 谭雯, 刘杰

甘新标, 谭雯, 刘杰. 基于双向位图的CSR大规模图存储优化[J]. 计算机研究与发展, 2021, 58(3): 458-466. DOI: 10.7544/issn1000-1239.2021.20200090
引用本文: 甘新标, 谭雯, 刘杰. 基于双向位图的CSR大规模图存储优化[J]. 计算机研究与发展, 2021, 58(3): 458-466. DOI: 10.7544/issn1000-1239.2021.20200090
Gan Xinbiao, Tan Wen, Liu Jie. Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space[J]. Journal of Computer Research and Development, 2021, 58(3): 458-466. DOI: 10.7544/issn1000-1239.2021.20200090
Citation: Gan Xinbiao, Tan Wen, Liu Jie. Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space[J]. Journal of Computer Research and Development, 2021, 58(3): 458-466. DOI: 10.7544/issn1000-1239.2021.20200090
甘新标, 谭雯, 刘杰. 基于双向位图的CSR大规模图存储优化[J]. 计算机研究与发展, 2021, 58(3): 458-466. CSTR: 32373.14.issn1000-1239.2021.20200090
引用本文: 甘新标, 谭雯, 刘杰. 基于双向位图的CSR大规模图存储优化[J]. 计算机研究与发展, 2021, 58(3): 458-466. CSTR: 32373.14.issn1000-1239.2021.20200090
Gan Xinbiao, Tan Wen, Liu Jie. Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space[J]. Journal of Computer Research and Development, 2021, 58(3): 458-466. CSTR: 32373.14.issn1000-1239.2021.20200090
Citation: Gan Xinbiao, Tan Wen, Liu Jie. Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space[J]. Journal of Computer Research and Development, 2021, 58(3): 458-466. CSTR: 32373.14.issn1000-1239.2021.20200090

基于双向位图的CSR大规模图存储优化

基金项目: 国家数值风洞项目(NNW2019ZT6-B21, NNW2019ZT6-B20, NNW2019ZT5-A10);国家重点研发计划项目(2018YFB0204301);湖南省自然科学基金项目(2020JJ4669);并行分布式处理实验室基金项目(6142110190206, 6142110180203)
详细信息
  • 中图分类号: TP391

Bidirectional-Bitmap Based CSR for Reducing Large-Scale Graph Space

Funds: This work was supported by the National Numerical Wind Tunnel Project (NNW2019ZT6-B21, NNW2019ZT6-B20, NNW2019ZT5-A10), the National Key Research and Development Program of China (2018YFB0204301), the Hunan Provincial Natural Science Foundation of China (2020JJ4669), and the Foundation of Parallel and Distributed Processing Laboratory (6142110190206, 6142110180203).
  • 摘要: 大数据时代, Graph500是评测超级计算机处理数据密集型应用能力的重要工具, E级验证系统的图遍历处理能力主要受限于内存空间和访存带宽, 尤其是内存空间利用率直接决定了图的测试规模和测试性能.针对天河E级验证系统小内存特征, 提出了基于双向位图的大规模图数据压缩存储方法(bidirectional-bitmap based CSR, Bi-CSR), Bi-CSR在CSR矩阵压缩的基础上引入行方向位图和列方向位图协同完成稀疏矩阵压缩存储, 行方向位图主要负责行方向位图的压缩存储与索引, 列方向位图除了进一步压缩图存储空间, 还负责为顶点遍历向量并行优化提供加速空间.Bi-CSR大幅度减少了稀疏矩阵存储空间.面向天河E级验证系统, 当图输入规模为2\+\{37\}时, Graph500的图存储空间节约效率接近70%, 全系统稳定测试性能为2.131E+12TEPS, 性能最大加速比超过100倍.
    Abstract: Graph500 is an important and famous benchmark to evaluate data-intensive applications for supercomputers in the big data era. The graph traversal processing ability of pre-exascale system is mainly restricted to memory space and communication bandwidth, especially the utilization of memory space ultimately determines the testing graph scale, and the graph testing scale absolutely dominates the performance of Graph500. Hence, Bi-CSR (bidirectional-bitmap CSR) is proposed based on CSR (compressed sparse row) for testing Graph500 on Tianhe pre-exascale system, The Bi-CSR would reduce large-scale graph space by introducing row-bitmap and column-bitmap to compress sparse matrix storage for Graph500.The aim of row-bitmap based on CSR is mainly cutting down graph memory space, while column-bitmap based on CSR would not only further reduce memory space but also improve graph traversal by using array of VPE(vector processing element), because VPEs are optimized and equipped in Tianhe pre-exascale system, which would speedup graph traversal when making fully use of VPEs. Accordingly, Bi-CSR would greatly reduce large-scale graph space while introducing row-bitmap and column-bitmap to compress sparse matrix storage of Graph500 for Tianhe pre-exascale system. Experimental results demonstrate that Bi-CSR would reduce large-scale graph space by 70% when testing input of Graph500 is 2\+\{37\} on Tianhe pre-exascale system with 2.131E+12 TEPS (traversed edges per second).
  • 期刊类型引用(5)

    1. 谢朝武,黄锐. 目的地旅游安全事件集群:概念框架与测度体系研究. 旅游学刊. 2023(05): 42-57 . 百度学术
    2. 严定宇,张宇鹏,陆希玉,曹华平. 对网络空间安全建模的系统思考. 网络安全与数据治理. 2023(12): 34-40 . 百度学术
    3. 刘小虎,张恒巍,马军强,张玉臣,谭晶磊. 基于攻防博弈的网络防御决策方法研究综述. 网络与信息安全学报. 2022(01): 1-14 . 百度学术
    4. 杨轶杰,朱广劼,司群,杨文. 铁路网络空间可视化实现路径分析. 铁路计算机应用. 2021(11): 15-20 . 百度学术
    5. 刘小虎,张恒巍,张玉臣,胡浩,程建. 基于博弈论的网络攻防行为建模与态势演化分析. 电子与信息学报. 2021(12): 3629-3638 . 百度学术

    其他类型引用(3)

计量
  • 文章访问数:  984
  • HTML全文浏览量:  3
  • PDF下载量:  379
  • 被引次数: 8
出版历程
  • 发布日期:  2021-02-28

目录

    /

    返回文章
    返回