基于深度学习的场景分割算法研究综述

张蕊; 李锦涛

doi:10.7544/issn1000-1239.2020.20190513

基于深度学习的场景分割算法研究综述

张蕊^1,2,
李锦涛¹

¹(中国科学院计算技术研究所北京 100190)
²(上海寒武纪信息科技有限公司上海 200135) (zhangrui@ict.ac.cn)

基金项目: 国家重点研发计划项目(2017YFA0700900，2017YFA0700902，2017YFA0700901，2017YFB1003101);国家自然科学基金项目(61432016,61532016,61672491,61602441,61602446,61732002,61702478,61732007，61732020);北京市自然科学基金项目(JQ18013);国家“九七三”重点基础研究发展计划基金项目(2015CB358800);“核高基”国家科技重大专项基金项目(2018ZX01031102);中国科学院科技成果转移转化重点专项基金项目(KFJ-HGZX-013);中国科学院前沿科学重点研究项目(QYZDB-SSW-JSC001);中国科学院战略性先导科技专项(XDB32050200,XDC01020000);中国科学院标准化研究项目(BZ201800001)

详细信息

中图分类号: TP391
计量
- 文章访问数: 2490
- HTML全文浏览量: 23
- PDF下载量: 1111
出版历程
- 发布日期: 2020-03-31

A Survey on Algorithm Research of Scene Parsing Based on Deep Learning

Zhang Rui^1,2,
Li Jintao¹

¹(Institute of Computing Technology, Chinese Academic of Sciences, Beijing 100190)
²(Cambricon Tech. Ltd, Shanghai 200135)

Funds: This work was supported by the National Key Research and Development Program of China (2017YFA0700900, 2017YFA0700902, 2017YFA0700901, 2017YFB1003101), the National Natural Science Foundation of China (61432016, 61532016, 61672491, 61602441, 61602446, 61732002, 61702478, 61732007, 61732020), the Beijing Natural Science Foundation (JQ18013), the National Basic Research Program of China (973 Program) (2015CB358800), the National Science and Technology Major Projects of Hegaoji (2018ZX01031102), the Transformation and Transfer of Scientific and Technological Achievements of Chinese Academy of Sciences (KFJ-HGZX-013), the Key Research Projects in Frontier Science of Chinese Academy of Sciences (QYZDB-SSW-JSC001), the Strategic Priority Research Program of Chinese Academy of Sciences (XDB32050200, XDC01020000), and the Standardization Research Project of Chinese Academy of Sciences (BZ201800001).

摘要

摘要: 场景分割的目标是判断场景图像中每个像素的类别.场景分割是计算机视觉领域重要的基本问题之一，对场景图像的分析和理解具有重要意义，同时在自动驾驶、视频监控、增强现实等诸多领域具有广泛的应用价值.近年来，基于深度学习的场景分割技术取得了突破性进展，与传统场景分割算法相比获得分割精度的大幅度提升.首先分析和描述场景分割问题面临的3个主要难点：分割粒度细、尺度变化多样、空间相关性强；其次着重介绍了目前大部分基于深度学习的场景分割算法采用的“卷积-反卷积”结构；在此基础上，对近年来出现的基于深度学习的场景分割算法进行梳理，介绍针对场景分割问题的3个主要难点，分别提出基于高分辨率语义特征图、基于多尺度信息和基于空间上下文等场景分割算法；简要介绍常用的场景分割公开数据集；最后对基于深度学习的场景分割算法的研究前景进行总结和展望.
- 场景分割 /
- 图像分割 /
- 深度学习 /
- 神经网络 /
- 全卷积网络
Abstract: Scene parsing aims to predict the category of each pixel in a scene image. Scene parsing is a fundamental and important task in computer vision. It has great significance of analyzing and understanding scene images, and has a wide range of applications in many fields such as automatic driving, video surveillance, and augmented reality. Recently, scene parsing algorithm based on deep learning has a breakthrough, and achieves great improvement compared with the traditional scene parsing algorithms. In this survey, we firstly analyze and describe the three difficulties in scene parsing, including fine-grained parsing results, multiple scale deformations, and strong spatial relationships. Then we focus on the “convolutional-deconvolutional” framework which is widely used in most of the deep learning based scene parsing algorithms. Furthermore, we introduce the newly proposed scene parsing algorithm based on deep learning in recent years. To tackle the three difficulties in scene parsing, the recent deep learning based algorithms employ high-resolution feature maps, multi-scale information and contextual information to further improve the performance of scene parsing. After that, we briefly introduce the common public scene parsing datasets. Finally, we make the conclusion for scene parsing algorithm based on deep learning and point out some potential opportunities.
- scene parsing /
- image segmentation /
- deep learning /
- neural network /
- fully convolutional network

HTML全文

参考文献(0)

施引文献(6)

期刊类型引用(3)

1.	孙颖，丁卫平，黄嘉爽，鞠恒荣，李铭，耿宇. RCAR-UNet：基于粗糙通道注意力机制的视网膜血管分割网络. 计算机研究与发展. 2023(04): 947-961 . 本站查看
2.	张创邦，王青海. 直觉模糊知识粒的分解与合成研究. 计算机与数字工程. 2022(02): 270-275+299 . 百度学术
3.	朱国成. 基于概率语言术语集中考虑专家权重的决策方法研究. 曲阜师范大学学报(自然科学版). 2021(04): 72-80 . 百度学术