FAQ-CNN: A Flexible Acceleration Framework for Quantized Convolutional Neural Networks on Embedded FPGAs

Xie Kunpeng; Lu Ye; Jin Zongming; Liu Yiqing; Gong Cheng; Chen Xinwei; Li Tao

doi:10.7544/issn1000-1239.20210142

Journal of Computer Research and Development > 2022 > 59(7): 1409-1427. > DOI: 10.7544/issn1000-1239.20210142

Xie Kunpeng, Lu Ye, Jin Zongming, Liu Yiqing, Gong Cheng, Chen Xinwei, Li Tao. FAQ-CNN: A Flexible Acceleration Framework for Quantized Convolutional Neural Networks on Embedded FPGAs[J]. Journal of Computer Research and Development, 2022, 59(7): 1409-1427. DOI: 10.7544/issn1000-1239.20210142

Citation:

PDF (3589 KB)

FAQ-CNN: A Flexible Acceleration Framework for Quantized Convolutional Neural Networks on Embedded FPGAs

Xie Kunpeng^1,2,
Lu Ye^1,2,3,
Jin Zongming^1,2,
Liu Yiqing^1,2,
Gong Cheng^1,2,
Chen Xinwei⁴,
Li Tao^1,2,3

¹(College of Computer Science, Nankai University, Tianjin 300350)
²(Tianjin Key Laboratory of Network and Data Security Technology(Nankai University), Tianjin 300350)
³(State Key Laboratory of Computer Architecture (Institute of Computing Technology, Chinese Academy of Sciences), Beijing 100190)
⁴(Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University), Fuzhou 350108)

Funds: This work was supported by the National Key Research and Development Program of China (2018YFB2100304), the National Natural Science Foundation of China (62002175), the Open Project Fund of State Key Laboratory of Computer Architecture (Institute of Computing Technology, Chinese Academy of Sciences) (CARCHB202016), the Special Funding for Excellent Enterprise Technology Correspondent of Tianjin (21YDTPJC00380), the Open Project of Fujian Provincial Key Laboratory of Information Processing and Intelligent Control (Minjiang University) (MJUKF-IPIC202105), and the Innovation Fund of Chinese Universities Industry-University-Research (2020HYA01003).

More Information

Published Date: June 30, 2022

Graphical Abstract

Abstract

Abstract

Quantization can compress convolutional neural network (CNN) model size and improve computing efficiency. However, the existing accelerator designs for CNN quantization are usually faced with the challenges of various algorithms, poor reusability of code modules, low efficiency of data exchange and insufficient utilization of resources, and so on. To meet these challenges, we propose a flexible acceleration framework for the quantized CNNs named FAQ-CNN to optimize accelerator design from three aspects of computing, communication and storage. FAQ-CNN can support rapid deployment of quantized CNN model in the form of software tools. Firstly, a component for quantization algorithms is designed to separate the calculation part from the process of value projection in quantization algorithm; the optimization techniques such as operator fusion, double buffering and pipeline are also utilized to improve the execution efficiency of CNN inference task in parallel. Then, the hierarchical and bitwidth-independent encoding and parallel decoding method are both proposed to efficiently support batch transmission and parallel computing for low bitwidth data. Finally, the resource allocation optimization model which can be transformed into an integer nonlinear programming problem is established for FAQ-CNN; the heuristic pruning strategy is used to reduce design space size. The extensive experimental results show that FAQ-CNN can support almost all kinds of quantized CNN accelerators efficiently and flexibly. When the activation and weight value are set to 16 b, the computing performance of FAQ-CNN accelerator is 1.4 times that of the Caffeine. When 8 b configuration is applied, FAQ-CNN can achieve the superior performance by 1.23TOPS.

FullText(HTML)

References (0)

[1]	Qi Lei, Ren Zihao, Liu Junxi, Geng Xin. Person Re-identification Method Based on Hybrid Real-Synthetic Data[J]. Journal of Computer Research and Development, 2025, 62(2): 418-431. DOI: 10.7544/issn1000-1239.202330718
[2]	Gao Yujia, Wang Pengfei, Liu Liang, Ma Huadong. Personalized Federated Learning Method Based on Attention-Enhanced Meta-Learning Network[J]. Journal of Computer Research and Development, 2024, 61(1): 196-208. DOI: 10.7544/issn1000-1239.202220922
[3]	Zhang Wanli, Chen Yue, Yang Kuiwu, Zhang Tian, Hu Xuexian. An Adversarial Example Generation Method for Locally Occluded Face Recognition[J]. Journal of Computer Research and Development, 2023, 60(9): 2067-2079. DOI: 10.7544/issn1000-1239.202220474
[4]	Chen Liwen, Ye Feng, Huang Tianqiang, Huang Liqing, Weng Bin, Xu Chao, Hu Jie. An Unsupervised Person Re-Identification Method Based on Intra-/Inter-Camera Merger[J]. Journal of Computer Research and Development, 2023, 60(2): 415-425. DOI: 10.7544/issn1000-1239.202110732
[5]	Chu Zhen, Mi Qing, Ma Wei, Xu Shibiao, Zhang Xiaopeng. Part-Level Occlusion-Aware Human Pose Estimation[J]. Journal of Computer Research and Development, 2022, 59(12): 2760-2769. DOI: 10.7544/issn1000-1239.20210723
[6]	Hu Yu, Chen Xiaobo, Liang Jun, Chen Ling, Liang Shurong. Vehicle Re-Identification Method Based on Part Features and Multi-Attention Fusion[J]. Journal of Computer Research and Development, 2022, 59(11): 2497-2506. DOI: 10.7544/issn1000-1239.20210599
[7]	Lu Ping, Dong Husheng, Zhong Shan, Gong Shengrong. Person Re-identification by Cross-View Discriminative Dictionary Learning with Metric Embedding[J]. Journal of Computer Research and Development, 2019, 56(11): 2424-2437. DOI: 10.7544/issn1000-1239.2019.20180740
[8]	Dai Chenchao, Wang Hongyuan, Ni Tongguang, Chen Shoubing. Person Re-Identification Based on Deep Convolutional Generative Adversarial Network and Expanded Neighbor Reranking[J]. Journal of Computer Research and Development, 2019, 56(8): 1632-1641. DOI: 10.7544/issn1000-1239.2019.20190195
[9]	Ding Zongyuan, Wang Hongyuan, Chen Fuhua, Ni Tongguang. Person Re-Identification Based on Distance Centralization and Projection Vectors Learning[J]. Journal of Computer Research and Development, 2017, 54(8): 1785-1794. DOI: 10.7544/issn1000-1239.2017.20170014
[10]	Chen Puqiang, Guo Lijun, Zhang Rong, Zhao Jieyu. Patch Matching with Global Spatial Constraints for Person Re-Identification[J]. Journal of Computer Research and Development, 2015, 52(3): 596-605. DOI: 10.7544/issn1000-1239.2015.20131481

Cited By

Cited by

Periodical cited type(5)

1.	傅冰飞，陈同林，许枫，朱麟，李斌，薛向阳. 基于背景-前景组成式建模的电路板异常检测. 计算机研究与发展. 2025(01): 144-159 . 本站查看
2.	孙留存，于龙，刘斌. 基于人工智能的电力巡检机器人网络故障自动化检测系统. 自动化与仪表. 2025(02): 63-65+72 .
3.	薛泼. 发电厂智能化视频监控终端网络入侵检测研究. 电气技术与经济. 2025(02): 341-344 .
4.	廖吟秋，王亚春. 基于cusum算法的电商直播信号异常波动特征建模. 自动化与仪器仪表. 2023(06): 54-57+62 .
5.	杨亚琦，李博雄，杨东霞，刘燕. 基于信息熵的异常数据判别方法. 科学技术创新. 2023(24): 194-199 .