• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Zhang Kun, Guo Feng, Zheng Fang, Xie Xianghui. Design of a Pipeline-Coupled Instruction Loop Cache for Many-Core Processors[J]. Journal of Computer Research and Development, 2017, 54(4): 813-820. DOI: 10.7544/issn1000-1239.2017.20160116
Citation: Zhang Kun, Guo Feng, Zheng Fang, Xie Xianghui. Design of a Pipeline-Coupled Instruction Loop Cache for Many-Core Processors[J]. Journal of Computer Research and Development, 2017, 54(4): 813-820. DOI: 10.7544/issn1000-1239.2017.20160116

Design of a Pipeline-Coupled Instruction Loop Cache for Many-Core Processors

More Information
  • Published Date: March 31, 2017
  • Energy efficiency is a great challenge in the design of future high performance computers. Since the many-core processor becomes a key choice of future high performance computers, the optimization of its micro-architecture is very important for the improvement of energy efficiency. This paper proposes a pipeline-coupled instruction loop cache for the many-core processor. The instruction loop cache is small sized so that it will provide more energy-efficient instruction storage. As an attempt of implementation-aware micro-architecture research, the loop cache is designed under constraints of hardware costs from the beginning. In order to alleviate the impact to the pipeline performance, the loop cache adopts a prefetching technique. The instruction loop cache prefetches the exit path of the loop into the cache when a loop is detected. The prefetching mechanism guarantees that the design of the loop cache in the pipeline can lead to the improvement of the energy efficiency. The instruction loop cache is implemented in the gem5 simulator. Experiments on a set of SPEC2006 benchmarks show that a typical configuration can reduce on average 27% of instruction fetching power and 31.5% power of the pipeline front-end.
  • Related Articles

    [1]Liu Bingtao, Wang Da, Ye Xiaochun, Zhang Hao, Fan Dongrui, Zhang Zhimin. A Dataflow Cache Processor Frontend Design[J]. Journal of Computer Research and Development, 2016, 53(6): 1221-1237. DOI: 10.7544/issn1000-1239.2016.20150317
    [2]He Yanxiang, Shen Fanfan, Zhang Jun, Jiang Nan, Li Qing’an, Li Jianhua. Cache Optimization Approaches of Emerging Non-Volatile Memory Architecture: A Survey[J]. Journal of Computer Research and Development, 2015, 52(6): 1225-1241. DOI: 10.7544/issn1000-1239.2015.20150104
    [3]Su Wen, Zhang Longbing, Gao Xiang, Su Menghao. A Cache Locking and Direct Cache Access Based Network Processing Optimization Method[J]. Journal of Computer Research and Development, 2014, 51(3): 681-690.
    [4]Chen Fangyuan, Zhang Dongsong, Liu Cong, Wang Zhiying. Analysis of Inter-Thread Interference on Shared Cache Multi-Core Architectures Based on Instruction Fetch Timing Frame[J]. Journal of Computer Research and Development, 2013, 50(1): 206-217.
    [5]Tang Yixuan, Wu Junmin, Chen Guoliang, Sui Xiufeng, Huang Jing. A Utility Based Cache Optimization Mechanism for Multi-Thread Workloads[J]. Journal of Computer Research and Development, 2013, 50(1): 170-180.
    [6]Zhang Huizhen, Wang Chao, Li Xi, Zhou Xuehai. An Optimized Code-Generating Algorithm for Reconfigurable Instruction Set Processors[J]. Journal of Computer Research and Development, 2012, 49(9): 2018-2026.
    [7]Jia Yaocang, Wu Chenggang, Zhang Zhaoqing. Program’s Performance Profiling Optimization for Guiding Static Cache Partitioning[J]. Journal of Computer Research and Development, 2012, 49(1): 93-102.
    [8]Wang Wenxiang, Zhang Guangfei, Shen Haihua. A 2D-Cache Based Memory Bandwidth Optimization Method for H.264 Motion Compensation[J]. Journal of Computer Research and Development, 2012, 49(1): 55-63.
    [9]Hu Xiao and Chen Shuming. Code Layout for Phase Prefetch on Instruction Cache[J]. Journal of Computer Research and Development, 2009, 46(5): 747-755.
    [10]Zhou Xuehai, Yu Jie, Li Xi, and Wand Zhigang. Research on Reliability Evaluation of Cache Based on Instruction Behavior[J]. Journal of Computer Research and Development, 2007, 44(4): 553-559.
  • Cited by

    Periodical cited type(5)

    1. 谢汶兵,田雪,漆锋滨,武成岗,王俊,罗巧玲. 二进制翻译技术综述. 软件学报. 2024(06): 2687-2723 .
    2. 刘登峰,李东亚,柴志雷,周浩杰,丁海峰. 基于QEMU的SIMD指令替换浮点指令框架. 湖南大学学报(自然科学版). 2024(08): 70-77 .
    3. 余子濠 ,陈璐 ,孙凝晖 ,包云岗 . 以RISC-V为目标的动态二进制翻译代码质量优化方法. 计算机研究与发展. 2023(10): 2322-2334 . 本站查看
    4. 李明亮,庞建民,岳峰. 基于地址重用的二进制翻译本地代码替换. 信息工程大学学报. 2022(01): 38-44 .
    5. 李男,庞建民. 基于中间表示规则替换的二进制翻译中间代码优化方法. 国防科技大学学报. 2021(04): 156-162 .

    Other cited types(2)

Catalog

    Article views (1184) PDF downloads (590) Cited by(7)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return