• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Mao Haiyu, Shu Jiwu. 3D Memristor Array Based Neural Network Processing in Memory Architecture[J]. Journal of Computer Research and Development, 2019, 56(6): 1149-1160. DOI: 10.7544/issn1000-1239.2019.20190099
Citation: Mao Haiyu, Shu Jiwu. 3D Memristor Array Based Neural Network Processing in Memory Architecture[J]. Journal of Computer Research and Development, 2019, 56(6): 1149-1160. DOI: 10.7544/issn1000-1239.2019.20190099

3D Memristor Array Based Neural Network Processing in Memory Architecture

Funds: This work was supported by the National Key Research and Development Program of China (2018YFB1003301) and the National Natural Science Foundation of China (61832011).
More Information
  • Published Date: May 31, 2019
  • Nowadays, due to the rapid development of artificial intelligence, the memristor-based processing in memory (PIM) architecture for neural network (NN) attracts a lot of researchers’ interests since it performs much better than traditional von Neumann architecture. Equipped with the peripheral circuit to support function units, memristor arrays can process a forward propagation with higher parallelism and much less data movement than that in CPU and GPU. However, the hardware of the memristor-based PIM suffers from the large area overhead of peripheral circuit outside the memristor array and non-trivial under-utilization of function units. This paper proposes a 3D memristor array based PIM architecture for NNs (FMC) by gathering the peripheral circuit of function units into a function pool for sharing among memristor arrays that pile up on the pool. We also propose a data mapping scheme for the 3D memristor array based PIM architecture to further increase the utilization of function units and reduce the data transmission among different cubes. The software-hardware co-design for the 3D memristor array based PIM not only makes the most of function units but also shortens the wire interconnections for better high-performance and energy-efficient data transmission. Experiments show that when training a single neural network, our proposed FMC can achieve up to 43.33 times utilization of the function units and can achieve up to 58.51 times utilization of the function units when training multiple neural networks. At the same time, compared with the 2D-PIM which has the same amount of compute array and storage array, FMC only occupies 42.89% area of 2D-PIM. What’s more, FMC has 1.5 times speedup and 1.7 times energy saving compared with 2D-PIM.
  • Related Articles

    [1]Wang Hao, Wang Yong, Feng Changlei, Gai Weixin, Wu Peng, Qian Jiang. Review of Chiplet Interconnection Technology[J]. Journal of Computer Research and Development. DOI: 10.7544/issn1000-1239.202440585
    [2]Chen Zhonglin, Shan Zhiguang, Xiao Guoyu, Chen Shanzhi. The Trust Interconnection Control Model of E-Government Network Construction[J]. Journal of Computer Research and Development, 2018, 55(12): 2637-2650. DOI: 10.7544/issn1000-1239.2018.20170773
    [3]Hu Fei, You Zhiqiang, Liu Peng, Kuang Jishun. Circuit Design of Convolutional Neural Network Based on Memristor Crossbar Arrays[J]. Journal of Computer Research and Development, 2018, 55(5): 1097-1107. DOI: 10.7544/issn1000-1239.2018.20170107
    [4]Li Chuxi, Fan Xiaoya, Zhao Changhe, Zhang Shengbing, Wang Danghui, An Jianfeng, Zhang Meng. A Memristor-Based Processing-in-Memory Architecture for Deep Convolutional Neural Networks Approximate Computation[J]. Journal of Computer Research and Development, 2017, 54(6): 1367-1380. DOI: 10.7544/issn1000-1239.2017.20170099
    [5]Zhang Jianmin, Li Tiejun, Li Sikun. An Address Cache of Interconnect Network in Parallel Computers[J]. Journal of Computer Research and Development, 2016, 53(2): 390-398. DOI: 10.7544/issn1000-1239.2016.20148039
    [6]Lu Feifei, Zhu Guiming, Tao Zhirong, Xie Xianghui, Guo Deke. MDCent:A Modular Data Center Interconnection with High Scalability and High Performance[J]. Journal of Computer Research and Development, 2015, 52(5): 1127-1136. DOI: 10.7544/issn1000-1239.2015.20140043
    [7]Xi Zhengjun and Li Yongming. Measurement-Based Quantum Circuits Model[J]. Journal of Computer Research and Development, 2011, 48(11): 2155-2160.
    [8]Wang Dawei, Cao Zheng, Liu Xinchun, You Dingshan, Sun Ninghui. Research and Design of High Performance Interconnection Network Switch[J]. Journal of Computer Research and Development, 2008, 45(12): 2069-2078.
    [9]Wang Huandong, Gao Xiang, Chen Yunji, Hu Weiwu. Interconnection of Godson-3 Multi-Core Processor[J]. Journal of Computer Research and Development, 2008, 45(12): 2001-2010.
    [10]Li Yan, Li Tian, Xie Bing, Zhang Lu, and Sun Jiasu. A P2P-Based Component Library Interconnection Technique Supporting Query Refactoring[J]. Journal of Computer Research and Development, 2007, 44(12): 2122-2129.

Catalog

    Article views (1570) PDF downloads (759) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return