高级检索
    李傲, 葛永新, 刘慧君, 杨春华, 周修庄. 内容感知的可解释性路面病害检测模型[J]. 计算机研究与发展, 2024, 61(3): 701-715. DOI: 10.7544/issn1000-1239.202220795
    引用本文: 李傲, 葛永新, 刘慧君, 杨春华, 周修庄. 内容感知的可解释性路面病害检测模型[J]. 计算机研究与发展, 2024, 61(3): 701-715. DOI: 10.7544/issn1000-1239.202220795
    Li Ao, Ge Yongxin, Liu Huijun, Yang Chunhua, Zhou Xiuzhuang. Content-Aware Explainable Pavement Distress Detection Model[J]. Journal of Computer Research and Development, 2024, 61(3): 701-715. DOI: 10.7544/issn1000-1239.202220795
    Citation: Li Ao, Ge Yongxin, Liu Huijun, Yang Chunhua, Zhou Xiuzhuang. Content-Aware Explainable Pavement Distress Detection Model[J]. Journal of Computer Research and Development, 2024, 61(3): 701-715. DOI: 10.7544/issn1000-1239.202220795

    内容感知的可解释性路面病害检测模型

    Content-Aware Explainable Pavement Distress Detection Model

    • 摘要: 针对实际场景中高分辨路面图像难以直接作为现有卷积神经网络(convolutional neural network,CNN)的输入、现有预处理及下采样算法无法有效感知并保留原始路面图像中低占比的病害区域信息等问题,借助于可视化解释的技术手段,设计了一种即插即用的图像内容自适应感知模块(adaptive perception module,APM),既平衡了高分辨路面图像与CNN输入限制,又能够自适应感知激活前景病害区域,从而实现高分辨路面图像中病害类型的快速准确检测,构建可信路面病害视觉检测软件系统. APM利用大卷积核和下采样残差操作降低原始图像分辨率并获取图像浅层特征表示;通过注意力机制自适应感知并激活图像中路面病害区域信息,过滤无关的背景信息. 利用联合学习的方式,无需额外监督信息完成对APM的训练. 通过可视化解释方法辅助选择和设计APM的具体结构,在最新公开数据集CQU-BPMDD上的实验结果表明:APM相比于现有的图像预处理采样算法均有明显提升,分类准确率最高为84.47%;在CQU-BPDD上的实验结果及APM决策效果可视化分析表明APM具备良好的泛化性与鲁棒性. 实验代码已开源:https://github.com/Li-Ao-Git/apm.

       

      Abstract: To address the challenges of using high-resolution pavement images as input for existing convolutional neural network models and the inability of existing preprocessing algorithms to effectively perceive and retain information from low-ratio distress regions in original pavement images, a novel architectural unit called adaptive perception module (APM) paying greater attention to pavement distress region is proposed with the help of visual interpretation techniques, which achieves a rapid and accurate detection of pavement distress in high-resolution images and could be used to build a software system for automatic detection of pavement distress based on computer vision. Firstly, big kernel convolution and residual operations are used to reduce the origin image resolution and get the low-level but rich feature representation. Secondly, attention mechanism is developed to perceive and activate the region of pavement distress and filter the irrelevant background pixel noise. By means of joint learning, APM training could be completed without additional cost. After the visual interpretation method is used to aid the selection and design of the specific structure of APM, experimental results on the latest public dataset CQU-BPMDD show that the proposed APM significantly improves the classification accuracy, up to 84.47%. Experiments across different datasets CQU-BPDD demonstrate the generalization and robustness of APM. Code is available on https://github.com/Li-Ao-Git/apm.

       

    /

    返回文章
    返回