高级检索

    DiffAD:基于差分卷积注意力的多类无监督异常检测方法

    DiffAD: Difference Convolution Attention Leveraged Multi-class Unsupervised Anomaly Detection Method

    • 摘要: 本文提出一种新颖的多类别无监督异常检测方法——DiffAD,以应对复杂工业场景下视觉异常检测所面临的标注匮乏与精度不足等挑战.该方法采用渐进式特征重建策略,其核心在于设计了一个特征重建组件DADE,该组件创新性地将重建过程划分为混沌、预细化和强细化三个阶段,有效提升了重建质量与过程稳定性.DADE的突出创新点在于融合差分卷积注意力与细节增强机制,通过整合差分卷积与多头自注意力机制,并借助残差密集连接,显著增强了对图像细微变化及高频信息的捕捉能力,进而提升了异常定位的精准度.在MVTec-AD、VisA、MVTec-3D和Uni-Medical四个代表性数据集上的广泛实验表明,DiffAD在图像级和像素级异常检测指标上整体表现显著优于现有主流方法,充分彰显了其在无监督视觉检测领域的实际应用价值与潜力.

       

      Abstract: This paper proposes a novel multi-class unsupervised anomaly detection method, DiffAD, to address the challenges of label scarcity and insufficient accuracy in visual anomaly detection within complex industrial scenarios. The method employs a progressive feature reconstruction strategy, with its core being the design of a feature reconstruc-tion component named DADE. This component innovatively divides the reconstruction process into three stages: chaotic, pre-refinement, and strong refinement, effectively enhancing reconstruction quality and process stability. The outstanding innovation of DADE lies in the integration of differential convolutional attention and a detail enhance-ment mechanism. By combining differential convolutions with multi-head self-attention mechanisms and leveraging residual dense connections, it significantly improves the ability to capture subtle variations and high-frequency information in images, thereby enhancing the precision of anomaly localization. Extensive experiments on four repre-sentative datasets, MVTec-AD, VisA, MVTec-3D, and Uni-Medical, demonstrate that DiffAD outperforms existing mainstream methods in both image-level and pixel-level anomaly detection metrics, comprehensively demonstrating its practical application value and potential in the field of unsupervised visual detection.

       

    /

    返回文章
    返回