一种可重构的单精度近似浮点乘法器设计

李鹏程; 黄立波; 陈刚; 赖明澈; 邓林; 刘威; 杨乾明; 王永文

doi:10.7544/issn1000-1239.202550116

一种可重构的单精度近似浮点乘法器设计

A Reconfigurable Single-Precision Approximate Floating-Point Multiplier Design

摘要

摘要: 人工智能（AI）和物联网（IoT）技术的迅速发展，对计算能效提出了更高的要求，终端设备在硬件资源开销方面同样面临巨大挑战. 为了应对能效问题，新型低功耗近似计算单元的设计得到了广泛研究. 在数字信号处理和图像处理等应用场景中，存在大量的浮点运算. 这些应用消耗了大量的硬件资源，但它们具有一定的容错性，没有必要进行完全精确的计算. 据此，提出了一种基于移位近似算法MTA（multiplication to shift addition）和非对称截断的单精度可重构近似浮点乘法器设计方法. 首先，采用了一种低功耗的近似算法MTA，将部分操作数的乘法运算转换为移位加法. 其次，为了在精度和成本之间取得平衡，设计了针对操作数高有效位的非对称截断处理，并对截断后保留的部分进行精确计算. 通过采用不同位宽的MTA近似计算和改变截断后部分积阵列的行数，生成了广阔的设计空间，从而可以在精度和成本之间进行多种权衡调整. 与精确浮点乘法器相比，所提出设计MTA5T5的精度损失（MRED）仅约为0.32%，功耗降低了85.80%，面积减少了79.53%. 对于精度较低的MTA3T3，其精度损失约为1.92%，而功耗和面积分别降低了90.55%和85.80%. 最后，进行了FIR滤波和图像处理的应用测试，结果表明所提出的设计在精度和开销方面具有显著优势.

Abstract: The rapid development of artificial intelligence (AI) and Internet of things (IoT) technologies has increased demands for the computing energy efficiency, and terminal devices are also facing great challenges in terms of hardware resource overhead. In order to solve the problem of energy efficiency, new designs of approximate computing units with low power have been widely studied. In application scenarios such as digital signal processing and image processing, there are a large number of floating-point operations. These applications consume a lot of hardware resources. However, they have certain fault tolerance characteristics and do not have to perform accurate calculations. In this paper, we propose a 32-bit single-precision reconfigurable approximate floating-point multiplier design method based on multiplication to shift addition (MTA) and asymmetric truncation. Firstly, we utilize an efficient and low-power approximate algorithm that transforms multiplication into shift-addition for partial operands. Secondly, to ensure a balance between accuracy and cost, we propose asymmetric truncation processing for the most significant bits (MSBs) and perform precise calculations on the truncated retained parts. The design achieves high precision and low power. A wide design space is generated by combining the MTA algorithm with different bit widths and varying the number of rows in the partial product array after truncation. This allows for various trade-offs between accuracy and cost. Compared with the exact floating-point multiplier, MTA5T5 has an accuracy loss of mean relative error distance (MRED) of only about 0.32%, a power reduction of 85.80%, and an area reduction of about 79.53%. The accuracy loss of MTA3T3 with lower accuracy is about 1.92%, while the power and area decrease to 90.55% and 85.80%, respectively. Ultimately, we test the application of FIR filtering and image processing, and the results show that the proposed design has significant advantages in terms of accuracy.

HTML全文

参考文献(33)

施引文献

资源附件(1)