基于重构难度感知的视频快照压缩成像方法

陈灿; 石鹏飞; 周超; 胡俊; 张登银

doi:10.7544/issn1000-1239.202440681

基于重构难度感知的视频快照压缩成像方法

Video Snapshot Compressive Imaging Method Based on Reconstruction Difficulty Perception

摘要

摘要: 人工智能应用需要有与之适配的高速视频成像新技术以便更好地感知周围环境，基于深度学习的快照压缩成像（snapshot compressive imaging，SCI）提供了一种具有前景的解决方案. 如何利用深度学习技术从观测值中重构高速视频是领域的前沿热点. 然而，现有重构方法注重挖掘先验信息以提升重构质量，忽略了掩码和图像纹理对重构难度的影响，使得重构质量仍有进一步提高的空间. 针对这个问题，提出一种基于重构难度感知的视频快照压缩成像方法（reconstruction difficulty perception-based SCI，RdpSCI）. 基于观测值中包含的信息由掩码和图像纹理共同决定的观察，所提方法提出挖掘掩码和图像纹理与重构难度之间的关联，引导深度网络进行重构. 特别地，提出一种改进残差密集连接网络模块（improved ResDNet，I-ResDNet），通过引入重构难度权重向量引导特征融合，并创新地在ResDNet中引入通道打乱操作，降低特征融合效果对于通道划分方式的依赖，能够在不显著增加模型参数量的情况下增强特征融合能力. 实验表明，RdpSCI相比于领域现有最优方法STFormer和EfficientSCI，在基准灰度数据集和基准彩色数据集上，重构质量分别有0.68 dB和0.54 dB的提升.

Abstract: Artificial intelligence applications require highly advanced high-speed video imaging technologies to perceive the surrounding environment better. Deep learning-based snapshot compressive imaging (SCI) offers a promising solution. How to reconstruct high-speed videos from observed data using deep learning techniques is a frontier hotspot in the field. However, existing reconstruction methods focus on mining prior information, neglecting the direct influence of masks and image textures on reconstruction difficulty. There is still room for further improvement in reconstruction quality. To address this issue, we propose a reconstruction difficulty perception-based SCI (RdpSCI) method. Based on the observation that masks and image textures jointly determine the information contained in the observed data, RdpSCI first proposes to explore the correlation between masks, image textures, and reconstruction difficulty, guiding deep networks for reconstruction. Specifically, it introduces an improved residual dense network (I-ResDNet) module, innovatively incorporating channel shuffling operations into ResDNet to reduce the dependency of feature fusion effects on channel partitioning methods. The proposed I-ResDNet also introduces a reconstruction difficulty weight vector to guide feature fusion, enhancing feature fusion capability without significantly increasing model parameters. Experiments show that compared to the state-of-the-art methods STFormer and EfficientSCI, RdpSCI achieves improvements of 0.68 dB and 0.54 dB in reconstruction quality on benchmark grayscale and colour datasets, respectively.

HTML全文

参考文献(36)

施引文献

资源附件(0)