• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Endong, Tang Shibin, Chen Jicheng, Wang Hongwei, Ni Fan, Zhao Yaqian. Directory Cache Design for Multi-Core Processor[J]. Journal of Computer Research and Development, 2015, 52(6): 1242-1253. DOI: 10.7544/issn1000-1239.2015.20150140
Citation: Wang Endong, Tang Shibin, Chen Jicheng, Wang Hongwei, Ni Fan, Zhao Yaqian. Directory Cache Design for Multi-Core Processor[J]. Journal of Computer Research and Development, 2015, 52(6): 1242-1253. DOI: 10.7544/issn1000-1239.2015.20150140

Directory Cache Design for Multi-Core Processor

More Information
  • Published Date: May 31, 2015
  • With the development of Internet of things, cloud computing and Internet public opinion analysis, big data applications are growing into the critical workloads in current data center. Directory cache is used to guarantee cache coherence in chip multi-processor, which is massively deployed in data centers. Previous researches proposed all kinds of innovation to improve the utilization of directory cache capacity and scalability, making it more suitable for high-performance computing. Big data workloads are timing sensitive, which is not satisfied by previous works. To meet the requirement of big data workloads, master-salve directory is a novel directory cache design, which can optimize the path of memory instruction. In the novel directory cache design, master directory picks up private data accesses and provides services for them to reduce miss-latency, and slave directory provides cache coherence for shared memory space to improve the utilization of cache capacity and the scalability of chip multi-processor. Our experiment benchmark is CloudSuite-v1.0, running on Simics+GEMS simulator. Compared with sparse directory with 2×capacity, the experimental results show that master-slave directory can reduce hardware overhead by 24.39%, and reduce miss-latency by 28.45%, and improve IPC by 3.5%. Compared with in-cache directory, the results show that master-slave directory sacrifices 5.14% miss-latency and 1.1% IPC, but reduces hardware overhead by 42.59%.
  • Related Articles

    [1]Duan Zhuohui, Liu Haikun, Zhao Jinwei, Liu Yihang, Liao Xiaofei, Jin Hai. A Reconfigurable Cache Consistency Mechanism for Distributed Memory Pool[J]. Journal of Computer Research and Development, 2023, 60(9): 1960-1972. DOI: 10.7544/issn1000-1239.202330448
    [2]Chen Zhiqiang, Zhou Hongwei, Feng Quanyou, Deng Rangyu. Design and Implementation of Configurable Cache Coherence Protocol for Multi-Core Processor[J]. Journal of Computer Research and Development, 2021, 58(6): 1166-1175. DOI: 10.7544/issn1000-1239.2021.20210174
    [3]Xie Zhen, Tan Guangming, Sun Ninghui. Research on Optimal Performance of Sparse Matrix-Vector Multiplication and Convoulution Using the Probability-Process-Ram Model[J]. Journal of Computer Research and Development, 2021, 58(3): 445-457. DOI: 10.7544/issn1000-1239.2021.20180601
    [4]He Ximing, Ma Sheng, Huang Libo, Chen Wei, Wang Zhiying. A Simple and Efficient Cache Coherence Protocol Based on Self-Updating[J]. Journal of Computer Research and Development, 2019, 56(4): 719-729. DOI: 10.7544/issn1000-1239.2019.20170898
    [5]Chen Jicheng, Li Yihan, Zhao Yaqian, Wang Endong, Shi Hongzhi, Tang Shibin. A Shared-Forwarding State Based Multiple-Tier Cache Coherency Protocol[J]. Journal of Computer Research and Development, 2017, 54(4): 764-774. DOI: 10.7544/issn1000-1239.2017.20160141
    [6]Zheng Jianwei, Yang Ping, Wang Wanliang, Bai Cong. Kernel Sparse Representation Classification with Group Weighted Constraints[J]. Journal of Computer Research and Development, 2016, 53(11): 2567-2582. DOI: 10.7544/issn1000-1239.2016.20150743
    [7]Luan Hua, Zhou Mingquan, Fu Yan. Frequent Graph Mining on Multi-Core Processor[J]. Journal of Computer Research and Development, 2015, 52(12): 2844-2856. DOI: 10.7544/issn1000-1239.2015.20140598
    [8]Zhang Lunkai, Song Fenglong, Wang Da, Fan Dongrui, Sun Ninghui. Improving the Performance of Sparse Directories[J]. Journal of Computer Research and Development, 2014, 51(9): 1955-1970. DOI: 10.7544/issn1000-1239.2014.20131173
    [9]Su Wen, Zhang Longbing, Gao Xiang, Su Menghao. A Cache Locking and Direct Cache Access Based Network Processing Optimization Method[J]. Journal of Computer Research and Development, 2014, 51(3): 681-690.
    [10]Zhang Xiaoqiang, Peng Lin, Peng Yuanxi, and Xie Lunguo. A Lightweight Directory Based Algorithm for STM[J]. Journal of Computer Research and Development, 2008, 45(9): 1517-1523.

Catalog

    Article views (1402) PDF downloads (850) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return