ISSN 1000-1239 CN 11-1777/TP

计算机研究与发展 ›› 2020, Vol. 57 ›› Issue (5): 1022-1036.doi: 10.7544/issn1000-1239.2020.20190445

• 人工智能 • 上一篇    下一篇

基于三尺度嵌套残差结构的交通标志快速检测算法

李旭东,张建明,谢志鹏,王进   

  1. (长沙理工大学计算机与通信工程学院 长沙 410114) (综合交通运输大数据智能处理湖南省重点实验室(长沙理工大学) 长沙 410114) (lxd@stu.csust.edu.cn)
  • 出版日期: 2020-05-01
  • 基金资助: 
    国家自然科学基金项目(61972056,61811530332);湖南省自然科学基金项目(2019JJ50666);长沙理工大学“双一流”科学研究国际合作拓展项目(2019IC34);湖南省研究生培养创新基地项目(2019-248-51);湖南省研究生科研创新项目(CX20190695)

A Fast Traffic Sign Detection Algorithm Based on Three-Scale Nested Residual Structures

Li Xudong, Zhang Jianming, Xie Zhipeng, Wang Jin   

  1. (School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha 410114) (Hunan Provincial Key Laboratory of Intelligent Processing of Big Data on Transportation (Changsha University of Science and Technology), Changsha 410114)
  • Online: 2020-05-01
  • Supported by: 
    This work was supported by the National Natural Science Foundation of China (61972056, 61811530332), the Natural Science Foundation of Hunan Province of China (2019JJ50666), the “Double First-class” International Cooperation and Development Scientific Research Project of Changsha University of Science and Technology (2019IC34), the Postgraduate Training Innovation Base Construction Project of Hunan Province (2019-248-51), and the Postgraduate Scientific Research Innovation Fund of Hunan Province (CX20190695).

摘要: 智能驾驶对交通标志自动检测的实时性及鲁棒性有着极高要求.目标检测中YOLOv3-tiny检测算法是轻量网络,实时性好、但准确率不高.将YOLOv3-tiny检测算法作为基础网络,提出了一种三尺度嵌套残差结构的交通标志快速检测算法.首先,在基础网络上采用逐像素相加的跨层连接,并未增加特征图的通道数,同时网络中形成1个小残差结构.其次,通过同样的跨层连接方式,增加了1层空间分辨率更高的预测输出,使得该尺度输出包含更丰富的空间信息,进而构成大残差结构.最终,将2个残差结构进行嵌套,形成了1个三尺度预测的嵌套残差网络模型,使得Tiny检测算法的部分主网络位于这2个残差结构中,起到3次调参的作用.实验结果表明:提出的算法能够快速鲁棒地检测真实场景中的交通标志.在德国交通标志检测数据集(German traffic sign detection benchmark, GTSDB)上交通标志总F\-1值为91.77%、检测时间为5ms;在长沙理工大学中国交通标志检测数据集(CSUST Chinese traffic sign detection benchmark, CCTSDB)上指示、禁令、警告三大类交通标志F\-1值分别为92.41%,93.91%,92.03%,检测时间为5ms.

关键词: 交通标志检测, YOLO检测算法, 嵌套残差网络, 多尺度预测, 长沙理工大学, 长沙理工大学中国交通标志检测数据集

Abstract: Automatic driving technology has high requirements for real-time and robustness of traffic sign detection in real world. The YOLOv3-tiny model is a lightweight network with good real-time performance in the object detection, but its accuracy is not high. In this paper, we use YOLOv3-tiny as the basic network and propose a fast traffic sign detection algorithm with three-scale nested residual structure. Firstly, shortcut based on pixel by pixel addition is employed in the YOLOv3-tiny network. It does not increase the number of feature map channels, and a small residual structure is formed in the network at the same time. Secondly, the predictive output with higher spatial resolution is also added through the shortcut, which contains more abundant spatial information, thus forming a large residual structure. Finally, the two residual structures are nested to form a three-scale predictive nested residual network, which makes the main network of Tiny located in these two residual structures and the parameters can be adjusted three times. The results show that the proposed algorithm can quickly and robustly detect traffic signs in real scenes. The F\-1 value of total traffic signs achieves 91.77% on German traffic sign detection benchmark and the detection time is 5ms. On CSUST Chinese traffic sign detection benchmark, F\-1 values of the Mandatory, the Prohibitory and the Warning are 92.41%, 93.91% and 92.03% respectively, and the detection time is 5ms.

Key words: traffic sign detection, you only look once (YOLO) detection algorithm, nested residual network, multi-scale prediction, Changsha University of Science and Technology (CSUST), CSUST Chinese traffic sign detection benchmark (CCTSDB)

中图分类号: