• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search

SW-IntraCC: A Collective Communication Mechanism for the Internals of Sunway AI Acceleration
Zhao Yulong, Gu Yanqing, Tian Songtao, Wu Chunzhi, Tang Lingtao, Zhang Lufei, Qin Xiaojun, Liu Xin, Chen Zuoning
2025, 62(6): 1333-1346. DOI: 10.7544/issn1000-1239.202550143
Abstract PDF
Large-Scale Exact Diagonalization Methods for the New Generation of Tianhe Supercomputing Systems
Li Biao, Liu Jie, Wang Qinglin
2025, 62(6): 1347-1362. DOI: 10.7544/issn1000-1239.202550150
Abstract PDF
Performance Modeling and Optimization for Large-Scale Heterogeneous Consistency Integrated Computing System
Li Rengang, Tang Yinan, Guo Zhenhua, Wang Li, Zong Zan, Yang Guangwen
2025, 62(6): 1363-1379. DOI: 10.7544/issn1000-1239.202550120
Abstract PDF
Resilio: An Elastic Fault-tolerant Training System for Large Language Models
Li Yan, Yang Sile, Liu Chengchun, Wang Linmei, Tian Yaolin, Zhang Xinhang, Zhu Yu, Li Chunpu, Sun Lei, Yan Shengen, Xiao Limin, Zhang Weifeng
2025, 62(6): 1380-1395. DOI: 10.7544/issn1000-1239.202550147
Abstract PDF
Samples Dispatching Mechanism for Accelerating Recommendation Model Training in Edge Intelligent Computing System
Li Guopeng, Tan Haisheng, Zhang Chi, Ni Hongqiu, Wang Zilong, Zhang Xinyue, Xu Yang, Tian Han, Chen Guoliang
2025, 62(6): 1396-1415. DOI: 10.7544/issn1000-1239.202550128
Abstract PDF
Synergistic Optimization Method for Adaptive Hierarchical Federated Learning in Heterogeneous Edge Environments
Feng Yiming, Qian Zhen, Li Guanghui, Dai Chenglong
2025, 62(6): 1416-1433. DOI: 10.7544/issn1000-1239.202550146
Abstract PDF
Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse
Xu Chuanfu, Qiu Haozhong, Che Yonggang
2025, 62(6): 1434-1442. DOI: 10.7544/issn1000-1239.202550125
Abstract PDF
SparseMode: A Sparse Compiler Framework for Efficient SpMV Vectorized Code Generation
Wang Haotian, Ding Yan, He Xianhao, Xiao Guoqing, Yang Wangdong
2025, 62(6): 1443-1454. DOI: 10.7544/issn1000-1239.202550139
Abstract PDF
Multi-Slave Core Assisted Parallel Composition Algorithm for Sequential Task Flows on the SW39000 Processor
Fu You, Jia Shuhui, Chen Li, Hua Rong, Du Yunlong, Gao Xiran
2025, 62(6): 1455-1468. DOI: 10.7544/issn1000-1239.202550166
Abstract PDF
Optimizing Cross-Architecture Programming Model Adaptation in SIMD-to-RVV Dynamic Binary Translation
Lai Yuanming, Li Yalong, Hu Hanzhi, Xie Mengyao, Wang Zhe, Wu Chenggang
2025, 62(6): 1469-1491. DOI: 10.7544/issn1000-1239.202550135
Abstract PDF
Yingtian-Lake: A Wafer-Scale General-Purpose Heterogeneous Multi-chiplet Petascale Computer
Dong Wenkuo, Yin Chunsuo, Zhang Zhimeng, Wang Pengchao, Sha Jiang, Wang Mengya, Zhu Minqi, Liu Hongwei, Liu Yuhang, Hao Qinfen
2025, 62(6): 1492-1512. DOI: 10.7544/issn1000-1239.202550163
Abstract PDF
Pipe-RLHF: A Computation Mode-Aware Parallel Framework for RLHF
Xu Ying, Wang Mengdi, Cheng Long, Liu Lian, Zhao Shixin, Zhang Lei, Wang Ying
2025, 62(6): 1513-1529. DOI: 10.7544/issn1000-1239.202550127
Abstract PDF
DAQ: Divide-and-Conquer Strategy Based Adaptive Low-Bit Quantization Method for Vision Transformer
Lü Qianru, Xu Jinwei, Jiang Jingfei, Li Dongsheng
2025, 62(6): 1530-1546. DOI: 10.7544/issn1000-1239.202550145
Abstract PDF
NTT Butterfly Arithmetic Acceleration Based on Dataflow Architecture
Shi Hongbo, Fan Zhihua, Li Wenming, Zhang Zhiyuan, Mu Yudong, Ye Xiaochun, An Xuejun
2025, 62(6): 1547-1561. DOI: 10.7544/issn1000-1239.202550160
Abstract PDF
BeeZip2: A Domain-Specific Accelerator for High Performance Lossless Data Compression
Gao Ruihao, Shi Shunchen, Li Xueqi, Tan Guangming
2025, 62(6): 1562-1580. DOI: 10.7544/issn1000-1239.202550017
Abstract PDF
A Reconfigurable Single-Precision Approximate Floating-Point Multiplier Design
Li Pengcheng, Huang Libo, Chen Gang, Lai Mingche, Deng Lin, Liu Wei, Yang Qianming, Wang Yongwen
2025, 62(6): 1581-1593. DOI: 10.7544/issn1000-1239.202550116
Abstract PDF