高级检索
    李功丽, 戴紫彬, 徐进辉, 王寿成, 朱玉飞, 冯晓. 基于流体系架构的分组密码处理器设计[J]. 计算机研究与发展, 2017, 54(12): 2833-2842. DOI: 10.7544/issn1000-1239.2017.20160670
    引用本文: 李功丽, 戴紫彬, 徐进辉, 王寿成, 朱玉飞, 冯晓. 基于流体系架构的分组密码处理器设计[J]. 计算机研究与发展, 2017, 54(12): 2833-2842. DOI: 10.7544/issn1000-1239.2017.20160670
    Li Gongli, Dai Zibin, Xu Jinhui, Wang Shoucheng, Zhu Yufei, Feng Xiao. Design of Block Cipher Processor Based on Stream Architecture[J]. Journal of Computer Research and Development, 2017, 54(12): 2833-2842. DOI: 10.7544/issn1000-1239.2017.20160670
    Citation: Li Gongli, Dai Zibin, Xu Jinhui, Wang Shoucheng, Zhu Yufei, Feng Xiao. Design of Block Cipher Processor Based on Stream Architecture[J]. Journal of Computer Research and Development, 2017, 54(12): 2833-2842. DOI: 10.7544/issn1000-1239.2017.20160670

    基于流体系架构的分组密码处理器设计

    Design of Block Cipher Processor Based on Stream Architecture

    • 摘要: 为提升密码处理器性能,构建了密码处理器性能模型.基于该模型,提出多级资源共享、绑定前/后异或操作、最大化算法并行度等处理器性能提升技术,并根据性能提升技术确定了功能单元的种类和数量.然而功能单元不仅数量较多,而且在操作位宽和操作延迟方面均有较大差异,如何有效组织这些功能单元成为了一个关键问题.利用流体系结构可以高效集成大量功能单元的特点,设计并实现了基于流体系结构的可重构分组密码处理器原型,并通过把功能单元划分为基本处理单元,bank间共享单元和簇间共享单元3个层次来解决功能单元处理位宽和操作延迟的差异.在65nm CMOS工艺下对处理器原型进行综合,并在该结构上映射了典型的分组密码算法.实验结果证明:该处理器以较小的面积获得了较高的性能,对典型分组密码算法的处理速度,不仅超越了国际上的密码专用指令处理器,而且高于国内可重构阵列结构密码处理器.

       

      Abstract: To improve the performance of cipher processor, the performance model of cipher processor is proposed. And based on this model, three ways for enhancing cipher processor's performance are presented, which are sharing multi-level resources, binding operations of pre-xor or post-xor and maximizing parallelism of block cipher algorithms. According to these improvement methods, the type and amount of cryptographic function units are determined. However, the function units are not only numerous but also different in operation data width and latency, so how to organize these function units efficiently becomes a key problem. The stream processor architecture can integrate a large number of function units to obtain high performance. Then, we design and implement a reconfigurable block cipher processor prototype which is based on stream processor architecture, and in order to organize the numerous function units effectively, the function units are divided into basic units, inter-bank-shared units and inter-cluster-shared units respectively according to their processing width. The prototype is synthesized in 65nm CMOS process and several typical block cipher algorithms are mapped on it. The evaluation results show that the processor prototype is area-efficient and its performance is not only beyond that of international application specific instruction cipher processors, but also higher than that of the domestic reconfigurable array processors.

       

    /

    返回文章
    返回