Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse

Xu Chuanfu; Qiu Haozhong; Che Yonggang

doi:10.7544/issn1000-1239.202550125

Xu Chuanfu, Qiu Haozhong, Che Yonggang. Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse[J]. Journal of Computer Research and Development, 2025, 62(6): 1434-1442. DOI: 10.7544/issn1000-1239.202550125

Citation:

Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse

Graphical Abstract

Graphical Abstract

Abstract

Abstract

Many high performance computing (HPC) applications such as solving sparse linear systems involve the calculation of sequences of sparse matrix-vector multiplications (SpMV) like Ax, A²x, …, A^sx, which is known as the sparse matrix power kernel (MPK). Since MPK calls SpMV using the same sparse matrix, reusing the matrix elements in cache, instead of repeatedly loading them from main memory, can potentially alleviate the memory-constrained problem for SpMV and enhance the performance of MPK. However, reusing the matrix introduces data dependencies between subsequent SpMVs. Prior work mainly either focuses on optimizing an individual SpMV invocation, or introduces significant overheads for cache data reuse in MPK. We propose a cache-aware MPK (Ca-MPK) to optimize MPK via cache data reuse. Based on the dependency graph of a sparse matrix, an architecture-aware recursive partitioning of the graph is designed to obtain subgraphs/submatrices which are fit into cache. Separating subgraphs (i.e., separators) are constructed to decouple data dependencies among subgraphs. Then cache data reuse is achieved by executing sequences of SpMVs on subgraphs with a specified order. Performance evaluation demonstrates that Ca-MPK outperforms the MPK implementations based on the Intel OneMKL library and the state-of-the-art approach, with an average speedup of up to about 1.57 times and 1.40 times, respectively.

FullText(HTML)

References (23)

Supplements (1)

Cited By

Turn off MathJax

Article Contents

Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content