Advanced Search

    column
    SW-IntraCC: A Collective Communication Mechanism for the Internals of Sunway AI Acceleration
    Zhao Yulong, Gu Yanqing, Tian Songtao, Wu Chunzhi, Tang Lingtao, Zhang Lufei, Qin Xiaojun, Liu Xin, Chen Zuoning
    2025, 62(6): 1333-1346. DOI: 10.7544/issn1000-1239.202550143
    Abstract PDF
    Large-Scale Exact Diagonalization Methods for the New Generation of Tianhe Supercomputing Systems
    Li Biao, Liu Jie, Wang Qinglin
    2025, 62(6): 1347-1362. DOI: 10.7544/issn1000-1239.202550150
    Abstract PDF
    Performance Modeling and Optimization for Large-Scale Heterogeneous Consistency Integrated Computing System
    Li Rengang, Tang Yinan, Guo Zhenhua, Wang Li, Zong Zan, Yang Guangwen
    2025, 62(6): 1363-1379. DOI: 10.7544/issn1000-1239.202550120
    Abstract PDF
    Resilio: An Elastic Fault-tolerant Training System for Large Language Models
    Li Yan, Yang Sile, Liu Chengchun, Wang Linmei, Tian Yaolin, Zhang Xinhang, Zhu Yu, Li Chunpu, Sun Lei, Yan Shengen, Xiao Limin, Zhang Weifeng
    2025, 62(6): 1380-1395. DOI: 10.7544/issn1000-1239.202550147
    Abstract PDF
    Samples Dispatching Mechanism for Accelerating Recommendation Model Training in Edge Intelligent Computing System
    Li Guopeng, Tan Haisheng, Zhang Chi, Ni Hongqiu, Wang Zilong, Zhang Xinyue, Xu Yang, Tian Han, Chen Guoliang
    2025, 62(6): 1396-1415. DOI: 10.7544/issn1000-1239.202550128
    Abstract PDF
    Synergistic Optimization Method for Adaptive Hierarchical Federated Learning in Heterogeneous Edge Environments
    Feng Yiming, Qian Zhen, Li Guanghui, Dai Chenglong
    2025, 62(6): 1416-1433. DOI: 10.7544/issn1000-1239.202550146
    Abstract PDF
    Optimizing Sequences of Sparse Matrix-Vector Multiplications via Cache Data Reuse
    Xu Chuanfu, Qiu Haozhong, Che Yonggang
    2025, 62(6): 1434-1442. DOI: 10.7544/issn1000-1239.202550125
    Abstract PDF
    SparseMode: A Sparse Compiler Framework for Efficient SpMV Vectorized Code Generation
    Wang Haotian, Ding Yan, He Xianhao, Xiao Guoqing, Yang Wangdong
    2025, 62(6): 1443-1454. DOI: 10.7544/issn1000-1239.202550139
    Abstract PDF
    Multi-Slave Core Assisted Parallel Composition Algorithm for Sequential Task Flows on the SW39000 Processor
    Fu You, Jia Shuhui, Chen Li, Hua Rong, Du Yunlong, Gao Xiran
    2025, 62(6): 1455-1468. DOI: 10.7544/issn1000-1239.202550166
    Abstract PDF
    Optimizing Cross-Architecture Programming Model Adaptation in SIMD-to-RVV Dynamic Binary Translation
    Lai Yuanming, Li Yalong, Hu Hanzhi, Xie Mengyao, Wang Zhe, Wu Chenggang
    2025, 62(6): 1469-1491. DOI: 10.7544/issn1000-1239.202550135
    Abstract PDF
    Yingtian-Lake: A Wafer-Scale General-Purpose Heterogeneous Multi-chiplet Petascale Computer
    Dong Wenkuo, Yin Chunsuo, Zhang Zhimeng, Wang Pengchao, Sha Jiang, Wang Mengya, Zhu Minqi, Liu Hongwei, Liu Yuhang, Hao Qinfen
    2025, 62(6): 1492-1512. DOI: 10.7544/issn1000-1239.202550163
    Abstract PDF
    Pipe-RLHF: A Computation Mode-Aware Parallel Framework for RLHF
    Xu Ying, Wang Mengdi, Cheng Long, Liu Lian, Zhao Shixin, Zhang Lei, Wang Ying
    2025, 62(6): 1513-1529. DOI: 10.7544/issn1000-1239.202550127
    Abstract PDF
    DAQ: Divide-and-Conquer Strategy Based Adaptive Low-Bit Quantization Method for Vision Transformer
    Lü Qianru, Xu Jinwei, Jiang Jingfei, Li Dongsheng
    2025, 62(6): 1530-1546. DOI: 10.7544/issn1000-1239.202550145
    Abstract PDF
    NTT Butterfly Arithmetic Acceleration Based on Dataflow Architecture
    Shi Hongbo, Fan Zhihua, Li Wenming, Zhang Zhiyuan, Mu Yudong, Ye Xiaochun, An Xuejun
    2025, 62(6): 1547-1561. DOI: 10.7544/issn1000-1239.202550160
    Abstract PDF
    BeeZip2: A Domain-Specific Accelerator for High Performance Lossless Data Compression
    Gao Ruihao, Shi Shunchen, Li Xueqi, Tan Guangming
    2025, 62(6): 1562-1580. DOI: 10.7544/issn1000-1239.202550017
    Abstract PDF
    A Reconfigurable Single-Precision Approximate Floating-Point Multiplier Design
    Li Pengcheng, Huang Libo, Chen Gang, Lai Mingche, Deng Lin, Liu Wei, Yang Qianming, Wang Yongwen
    2025, 62(6): 1581-1593. DOI: 10.7544/issn1000-1239.202550116
    Abstract PDF