Citation: | Li Ao, Ge Yongxin, Liu Huijun, Yang Chunhua, Zhou Xiuzhuang. Content-Aware Explainable Pavement Distress Detection Model[J]. Journal of Computer Research and Development, 2024, 61(3): 701-715. DOI: 10.7544/issn1000-1239.202220795 |
To address the challenges of using high-resolution pavement images as input for existing convolutional neural network models and the inability of existing preprocessing algorithms to effectively perceive and retain information from low-ratio distress regions in original pavement images, a novel architectural unit called adaptive perception module (APM) paying greater attention to pavement distress region is proposed with the help of visual interpretation techniques, which achieves a rapid and accurate detection of pavement distress in high-resolution images and could be used to build a software system for automatic detection of pavement distress based on computer vision. Firstly, big kernel convolution and residual operations are used to reduce the origin image resolution and get the low-level but rich feature representation. Secondly, attention mechanism is developed to perceive and activate the region of pavement distress and filter the irrelevant background pixel noise. By means of joint learning, APM training could be completed without additional cost. After the visual interpretation method is used to aid the selection and design of the specific structure of APM, experimental results on the latest public dataset CQU-BPMDD show that the proposed APM significantly improves the classification accuracy, up to 84.47%. Experiments across different datasets CQU-BPDD demonstrate the generalization and robustness of APM. Code is available on https://github.com/Li-Ao-Git/apm.
[1] |
蔡骏,蒋立政. 高速公路沥青路面常见病害的成因分析及养护对策[J]. 工程与建设,2006,20(1):81−83
Cai Jun, Jiang Lizheng. Cause analysis and maintenance countermeasure of common distresss of expressway asphalt pavement [J] Engineering and Construction, 2006, 20(1): 81−83 (in Chinese)
|
[2] |
刘凡,王君锋,陈峙宇,等. 基于并行注意力UNet的裂缝检测方法[J]. 计算机研究与发展,2021,58(8):1718−1726 doi: 10.7544/issn1000-1239.2021.20210335
Liu Fan, Wang Junfeng, Chen Zhiyu, et al. Parallel attention based UNet for crack detection[J]. Journal of Computer Research and Development, 2021, 58(8): 1718−1726 (in Chinese) doi: 10.7544/issn1000-1239.2021.20210335
|
[3] |
罗晖,贾晨,李健. 基于改进YOLOv4的公路路面病害检测算法[J]. 激光与光电子学进展,2021,58(14):336−344
Luo Hui, Jia Chen, Li Jian. Road surface distress detection algorithm based on improved YOLOv4[J]. Laser & Optoelectronics Progress, 2021, 58(14): 336−344 (in Chinese)
|
[4] |
汪烨,陈骏武,夏鑫,等. 智能需求获取与建模研究综述[J]. 计算机研究与发展,2021,58(4):683−705 doi: 10.7544/issn1000-1239.2021.20200740
Wang Ye, Chen Junwu, Xia Xin, et al. Intelligent requirements elicitation and modeling: A literature review[J]. Journal of Computer Research and Development, 2021, 58(4): 683−705 (in Chinese) doi: 10.7544/issn1000-1239.2021.20200740
|
[5] |
Amhaz R, Chambon S, Idier J, et al. Automatic crack detection on two-dimensional pavement images: An algorithm based on minimal path selection[J]. IEEE Transactions on Intelligent Transportation Systems, 2016, 17(10): 2718−2729 doi: 10.1109/TITS.2015.2477675
|
[6] |
Koch C, Georgieva K, Kasireddy V, et al. A review on computer vision based defect detection and condition assessment of concrete and asphalt civil infrastructure[J]. Advanced Engineering Informatics, 2015, 29(2): 196−210 doi: 10.1016/j.aei.2015.01.008
|
[7] |
Kapela R, Śniatała P, Turkot A, et al. Asphalt surfaced pavement cracks detection based on histograms of oriented gradients[C]// Proc of the 22nd Int Conf Mixed Design of Integrated Circuits & Systems. Piscataway, NJ: IEEE, 2015: 579−584
|
[8] |
鞠卓亚,王志海. 基于选择性模式的贝叶斯分类算法[J]. 计算机研究与发展,2020,57(8):1605−1616 doi: 10.7544/issn1000-1239.2020.20200196
Ju Zhuoya, Wang Zhihai. A Bayesian classification algorithm based on selective patterns[J]. Journal of Computer Research and Development, 2020, 57(8): 1605−1616 (in Chinese) doi: 10.7544/issn1000-1239.2020.20200196
|
[9] |
付忠良. 不平衡多分类问题的连续AdaBoost算法研究[J]. 计算机研究与发展,2011,48(12):2326−2333
Fu Zhongliang. Real AdaBoost algorithm for multi-class and imbalanced classification problems[J]. Journal of Computer Research & Development, 2011, 48(12): 2326−2333 (in Chinese)
|
[10] |
Gopalakrishnan K, Khaitan S K, Choudhary A, et al. Deep convolutional neural networks with transfer learning for computer vision-based data-driven pavement distress detection[J]. Construction and Building Materials, 2017, 157: 322−330 doi: 10.1016/j.conbuildmat.2017.09.110
|
[11] |
Li Shengyuan, Zhao Xuefeng. Convolutional neural networks-based crack detection for real concrete surface[C]// Proc of the 23rd Conf on Sensors and Smart Structures Technologies for Civil, Mechanical, and Aerospace Systems. Bellingham, WA: SPIE, 2018, 10598: 955−961
|
[12] |
Cha Youngjin, Choi W, Büyüköztürk O. Deep learning-based crack damage detection using convo-lutional neural networks[J]. Computer‐Aided Civil and Infrastructure Engineering, 2017, 32(5): 361−378 doi: 10.1111/mice.12263
|
[13] |
Tang Wenhao, Huang Sheng, Zhao Qiming, et al. An iteratively optimized patch label inference network for automatic pavement distress detection[J]. IEEE Transactions on Intelligent Transportation Systems, 2021, 23(7): 8652−8661
|
[14] |
Huang Guixin, Huang Sheng, Huangfu Luwen, et al. Weakly supervised patch label inference network with image pyramid for pavement distresss recognition in the wild[C]// Proc of the 46th IEEE Int Conf on Acoustics, Speech and Signal Processing. Piscataway, NJ: IEEE, 2021: 7978−7982
|
[15] |
Talebi H, Milanfar P. Learning to resize images for computer vision tasks[C]// Proc of the 20th IEEE/CVF Int Conf on Computer Vision. Piscataway, NJ: IEEE, 2021: 497−506
|
[16] |
Wang Ziji, Turko R, Shaikh O, et al. CNN explainer: Learning convolutional neural networks with interactive visualization[J]. IEEE Transactions on Visualization and Computer Graphics, 2020, 27(2): 1396−1406
|
[17] |
Woo S, Park J, Lee J Y, et al. CBAM: Convolutional block attention module[C]//Proc of the 15th European Conf on Computer Vision. Berlin: Springer, 2018: 3−19
|
[18] |
Liu Huijun, Yang Chunhua, Li Ao, et al. Deep domain adaptation for pavement crack detection[J]. IEEE Transactions on Intelligent Transportation Systems, 2022, 24(2): 1669−1681
|
[19] |
Wang Weixing, Wang Mengfei, Li Hongxia, et al. Pavement crack image acquisition methods and crack extraction algorithms: A review[J]. Journal of Traffic and Transportation Engineering (English Edition), 2019, 6(6): 535−556 doi: 10.1016/j.jtte.2019.10.001
|
[20] |
Jenkins M D, Carr T A, Iglesias M I, et al. A deep convolutional neural network for semantic pixel-wise segmentation of road and pavement surface cracks[C]//Proc of the 26th European Signal Processing Conf. Piscataway, NJ: IEEE, 2018: 2120−2124
|
[21] |
Yang Fan, Zhang Lei, Yu Sijia, et al. Feature pyramid and hierarchical boosting network for pavement crack detection[J]. IEEE Transactions on Intelligent Transportation Systems, 2019, 21(4): 1525−1535
|
[22] |
Liu Wenjun, Huang Yuchun, Li Ying, et al. FPCNet: Fast pavement crack detection network based on encoder-decoder architecture[J]. arXiv preprint, arXiv:1907.02248, 2019
|
[23] |
Zhang Kaige, Zhang Yingtao, Cheng Hengda. CrackGAN: Pavement crack detection using partially accurate ground truths based on generative adversarial learning[J]. IEEE Transactions on Intelligent Transportation Systems, 2020, 22(2): 1306−1319
|
[24] |
Xiang Xuezhi, Zhang Yuqi, El Saddik A. Pavement crack detection network based on pyramid structure and attention mechanism[J]. IET Image Processing, 2020, 14(8): 1580−1586 doi: 10.1049/iet-ipr.2019.0973
|
[25] |
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation[C]//Proc of the 28th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2015: 3431−3440
|
[26] |
Zhang Lei, Yang Fan, Zhang Y D, et al. Road crack detection using deep convolutional neural network[C]//Proc of the 23rd IEEE Int Conf on Image Processing. Piscataway, NJ: IEEE, 2016: 3708−3712
|
[27] |
Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[J]. Communications of the ACM, 2017, 60(6): 84−90 doi: 10.1145/3065386
|
[28] |
Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C]//Proc of the 27th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2014: 580−587
|
[29] |
Zeiler M D, Fergus R. Visualizing and understanding convolutional networks[C]//Proc of the 13th European Conf on Computer Vision. Berlin: Springer, 2014: 818−833
|
[30] |
Zeiler M D, Krishnan D, Taylor G W, et al. Deconvolutional networks[C]//Proc of the 23rd IEEE Computer Society Conf on Computer Vision and Pattern Recognition. Los Alamitos, CA: IEEE Computer Society, 2010: 2528−2535
|
[31] |
Lowe D G. Distinctive image features from scale-invariant keypoints[J]. International Journal of Computer Vision, 2004, 60(2): 91−110 doi: 10.1023/B:VISI.0000029664.99615.94
|
[32] |
Zeiler M D, Taylor G W, Fergus R. Adaptive deconvolutional networks for mid and high level feature learning[C]// Proc of the 13th IEEE Int Conf on Computer Vision. Piscataway, NJ: IEEE, 2011: 2018−2025
|
[33] |
Zhou Bolei, Khosla A, Lapedriza A, et al. Learning deep features for discriminative localization[C]// Proc of the 29th IEEE/CVF Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2016: 2921−2929
|
[34] |
Selvaraju R R, Cogswell M, Das A, et al. Grad-CAM: Visual explanations from deep networks via gradient-based localization[C]// Proc of the 15th IEEE Int Conf on Computer Vision. Piscataway, NJ: IEEE, 2017: 618−626
|
[35] |
Veit A, Wilber M J, Belongie S. Residual networks behave like ensembles of relatively shallow networks [J/OL]. Advances in Neural Information Processing Systems, 2016 [2023-03-18]. https://proceedings.neurips.cc/paper/2016/hash/37bc2f75bf1bcfe8450a1a41c200364c-Abstract.html
|
[36] |
Liu Liyuan, Jiang Haoming, He Pengcheng, et al. On the variance of the adaptive learning rate and beyond[J]. arXiv preprint, arXiv: 1908. 03265, 2019
|
[37] |
Zhang M, Lucas J, Ba J, et al. Lookahead optimizer: k steps forward, 1 step back[J/OL]. Advances in Neural Information Processing Systems, 2019 [2023-03-18].https://proceedings.neurips.cc/paper/2019/hash/90fd4f88f588ae64038134f1eeaa023f-Abstract.html
|
[38] |
You Yangs, Gitman I, Ginsburg B. Large batch training of convolutional networks[J]. arXiv preprint, arXiv: 1708. 03888, 2017
|
[39] |
He Kaiming, Zhang Xiangyu, Ren Shaoqing, et al. Deep residual learning for image recognition[C]//Proc of the 29th IEEE Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2016: 770−778
|
[40] |
Dosovitskiy A, Beyer L, Kolesnikov A, et al. An image is worth 16×16 words: Transformers for image recognition at scale[J]. arXiv preprint, arXiv: 2010. 11929, 2020
|
[41] |
Ding Xiaohan, Zhang Xiangyu, Ma Ningning, et al. RepVGG: Making VGG-style convnets great again[C]// Proc of the 34th IEEE/CVF Conf on Computer Vision and Pattern Recognition. Piscataway, NJ: IEEE, 2021: 13733−13742
|
[42] |
Tan M, Le Q. Efficientnet_v2: Smaller models and faster training[C]// Proc of the 38th Int Conf on Machine Learning. Bellevue, WA: ILMS, 2021: 10096−10106
|
[1] | Qian Zhongsheng, Huang Heng, Zhu Hui, Liu Jinping. Multi-Perspective Graph Contrastive Learning Recommendation Method with Layer Attention Mechanism[J]. Journal of Computer Research and Development, 2025, 62(1): 160-178. DOI: 10.7544/issn1000-1239.202330804 |
[2] | Zhang Jinyu, Ma Chenxi, Li Chao, Zhao Zhongying. Towards Lightweight Cross-Domain Sequential Recommendation via Tri-Branches Graph External Attention Network[J]. Journal of Computer Research and Development, 2024, 61(8): 1930-1944. DOI: 10.7544/issn1000-1239.202440197 |
[3] | Xie Jun, Wang Yuzhu, Chen Bo, Zhang Zehua, Liu Qin. Aspect-Based Sentiment Analysis Model with Bi-Guide Attention Network[J]. Journal of Computer Research and Development, 2022, 59(12): 2831-2843. DOI: 10.7544/issn1000-1239.20210708 |
[4] | Qian Zhongsheng, Yang Jiaxiu, Li Duanming, Ye Zulai. Event Recommendation Strategy Combining User Long-Short Term Interest and vent Influence[J]. Journal of Computer Research and Development, 2022, 59(12): 2803-2815. DOI: 10.7544/issn1000-1239.20210693 |
[5] | Sun Qian, Xue Leiqi, Gao Ling, Wang Hai, Wang Yuxiang. Selection of Network Defense Strategies Based on Stochastic Game and Tabu Search[J]. Journal of Computer Research and Development, 2020, 57(4): 767-777. DOI: 10.7544/issn1000-1239.2020.20190870 |
[6] | Xu Jinghang, Zuo Wanli, Liang Shining, Wang Ying. Causal Relation Extraction Based on Graph Attention Networks[J]. Journal of Computer Research and Development, 2020, 57(1): 159-174. DOI: 10.7544/issn1000-1239.2020.20190042 |
[7] | Sun Xiaowan, Wang Ying, Wang Xin, Sun Yudong. Aspect-Based Sentiment Analysis Model Based on Dual-Attention Networks[J]. Journal of Computer Research and Development, 2019, 56(11): 2384-2395. DOI: 10.7544/issn1000-1239.2019.20180823 |
[8] | Zhang Han, Guo Yuanbo, Li Tao. Domain Named Entity Recognition Combining GAN and BiLSTM-Attention-CRF[J]. Journal of Computer Research and Development, 2019, 56(9): 1851-1858. DOI: 10.7544/issn1000-1239.2019.20180733 |
[9] | Guo Chi, Wang Lina, Guan Yiping, Zhang Xiaoying. A Network Immunization Strategy Based on Dynamic Preference Scan[J]. Journal of Computer Research and Development, 2012, 49(4): 717-724. |
[10] | Wang Bailing, Fang Binxing, Yun Xiaochun, Zhang Hongli, Chen Bo, Liu Yixuan. A New Friendly Worm Propagation Strategy Based on Diffusing Balance Tree[J]. Journal of Computer Research and Development, 2006, 43(9): 1593-1602. |