高级检索
    马超, 戴紫彬, 李伟, 南龙梅, 金羽. RPRU:一种面向处理器的比特抽取与移位统一架构[J]. 计算机研究与发展, 2018, 55(2): 426-437. DOI: 10.7544/issn1000-1239.2018.20160775
    引用本文: 马超, 戴紫彬, 李伟, 南龙梅, 金羽. RPRU:一种面向处理器的比特抽取与移位统一架构[J]. 计算机研究与发展, 2018, 55(2): 426-437. DOI: 10.7544/issn1000-1239.2018.20160775
    Ma Chao, Dai Zibin, Li Wei, Nan Longmei, Jin Yu. RPRU: A Unified Architecture for Rotation and Bit-Extraction Operations in General-Propose Processor[J]. Journal of Computer Research and Development, 2018, 55(2): 426-437. DOI: 10.7544/issn1000-1239.2018.20160775
    Citation: Ma Chao, Dai Zibin, Li Wei, Nan Longmei, Jin Yu. RPRU: A Unified Architecture for Rotation and Bit-Extraction Operations in General-Propose Processor[J]. Journal of Computer Research and Development, 2018, 55(2): 426-437. DOI: 10.7544/issn1000-1239.2018.20160775

    RPRU:一种面向处理器的比特抽取与移位统一架构

    RPRU: A Unified Architecture for Rotation and Bit-Extraction Operations in General-Propose Processor

    • 摘要: 比特抽取与循环移位操作都可以利用位级置换完成.目前,它们在硬件实现时,大都采用分离的、各自独立的设计方式,这造成了硬件逻辑资源的浪费.尽管有些研究成果将它们统一设计,但是实现路由算法的电路却是独立的,逻辑资源消耗较多.因此,通过研究循环移位和比特抽取这2种比特级操作在多级动态互连网络Inverse Butterfly中的映射原理,并结合该网络的自路由和递归特性,提出了一种针对这2种操作的统一路由算法.该算法不仅具有较高的并行性,而且硬件实现简洁,利于处理器架构集成.在此基础上,构造了一种可重构比特抽取-移位硬件单元(reconfigurable parallel bit extraction-rotation hardware unit, RPRU),并对其关键路径电路进行了优化设计.然后,在CMOS 90nm工艺下完成了逻辑综合.实验结果表明:利用该路由算法所构造的硬件单元与以往同类设计相比,面积减少了近30%.

       

      Abstract: Parallel bit extraction and rotation-shift operations can be completed by bit level permutation. At present, they are mainly implemented independently, which results in the waste of hardware logic resources. Although some of the researches unified the two operations into a single hardware unit, it was required to design two dedicated circuits to implement the routing algorithms for each operation. Consequently, the consumption of the logic resources is still high. To solve this problem, a unified routing algorithm is proposed by studying the mapping principle of rotation-shift and parallel bit extraction operations based on one kind of dynamic multistage interconnect network named Inverse Butterfly Network. The algorithm utilizes the self-routing and recursive characteristics of the network. It not only has high parallelism, but also is simple in hardware implementation, which is conductive to integration for the general-propose processor architecture. On this basis, we also develop a reconfigurable parallel bit extraction hardware unit with rotation-shift function named RPRU, and optimize the critical path of the unit. Then, we synthesize it into CMOS 90nm process. The experimental results show that the area of our RPRU using the unified algorithm is less by 30% than that of the previous design with identical functions.

       

    /

    返回文章
    返回