Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames

Li Xiangpan; Zhang Biao; Sun Fengchi; Liu Jie

doi:10.7544/issn1000-1239.2020.20190578

Journal of Computer Research and Development > 2020 > 57(6): 1218-1226. > DOI: 10.7544/issn1000-1239.2020.20190578 CSTR: 32373.14.issn1000-1239.2020.20190578

Li Xiangpan, Zhang Biao, Sun Fengchi, Liu Jie. Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames[J]. Journal of Computer Research and Development, 2020, 57(6): 1218-1226. DOI: 10.7544/issn1000-1239.2020.20190578

Citation:

PDF (2156 KB)

Indoor Scene Understanding by Fusing Multi-View RGB-D Image Frames

¹(College of Computer Science, Nankai University, Tianjin 300750)
²(College of Software, Nankai University, Tianjin 300750)
³(College of Artificial Intelligence, Nankai University, Tianjin 300750)

Funds: This work was supported by the National Natural Science Foundation of China (61873327).

More Information

Published Date: May 31, 2020

Graphical Abstract

Abstract

Abstract

For intelligent robots, it’s an important and challenging ability to understand environment correctly, and so, scene understanding becomes a key problem in robotics community. In the future, more and more families will have service robots living with them. Family robots need to sense and understand surrounding environment reliably in an autonomous way, depending on their on-board sensors and scene understanding algorithms. Specifically, a running robot has to recognize various objects and the relations between them to autonomously implement tasks and perform intelligent man-robot interaction. Usually, RGB-D(RGB depth) visual sensors commonly used by robots to capture color and depth information have limited field of view, and so it is often difficult to directly get the single image of the whole scene in large-scale indoor spaces. Fortunately, robots can move to different locations and get more RGB-D images from multiple perspectives which can cover the whole scene in total. In this situation, we propose an indoor scene understanding algorithm based on information fusion of multi-view RGB-D images. This algorithm detects objects and extracts object relationship on single RGB-D image, then detects instance-level objects on multiple RGB-D image frames, and constructs object relation oriented topological map as the model of the whole scene. By dividing the RGB-D images into cells, then extracting color histogram features from the cells, we manage to find and associate the same objects in different frames using the object instance detection algorithm based on the longest common subsequence, overcoming the adverse influence on image fusion caused by RGB-D camera’s viewpoint changes. Finally, the experimental results on the NYUv2 dataset demonstrate the effectiveness of the proposed algorithm.
- object detection,
- object instance detection,
- RGB-D image,
- object-relation topological map,
- scene understanding

FullText(HTML)

References (0)

[1]	Cui Yuanning, Sun Zequn, Hu Wei. A Pre-trained Universal Knowledge Graph Reasoning Model Based on Rule Prompts[J]. Journal of Computer Research and Development, 2024, 61(8): 2030-2044. DOI: 10.7544/issn1000-1239.202440133
[2]	Huang Lisheng, Ran Jinye, Luo Jing, Zhang Xiangyin. Estimating QoE for OTT Video Service Through XDR Data Analysis[J]. Journal of Computer Research and Development, 2021, 58(2): 418-426. DOI: 10.7544/issn1000-1239.2021.20190759
[3]	Chen Weili, Zheng Zibin. Blockchain Data Analysis: A Review of Status, Trends and Challenges[J]. Journal of Computer Research and Development, 2018, 55(9): 1853-1870. DOI: 10.7544/issn1000-1239.2018.20180127
[4]	Zhang Lei, Zhang Yi. Big Data Analysis by Infinite Deep Neural Networks[J]. Journal of Computer Research and Development, 2016, 53(1): 68-79. DOI: 10.7544/issn1000-1239.2016.20150663
[5]	Zhang Bin, Le Jiajin, Sun Li, Xia Xiaoling, Wang Mei, Li Yefeng. Materialization Strategies in Big Data Analysis System Based on Column-Store[J]. Journal of Computer Research and Development, 2015, 52(5): 1061-1070. DOI: 10.7544/issn1000-1239.2015.20140693
[6]	Jiang Zhuoxuan, Zhang Yan, Li Xiaoming. Learning Behavior Analysis and Prediction Based on MOOC Data[J]. Journal of Computer Research and Development, 2015, 52(3): 614-628. DOI: 10.7544/issn1000-1239.2015.20140491
[7]	Chen Shimin. Big Data Analysis and Data Velocity[J]. Journal of Computer Research and Development, 2015, 52(2): 333-342. DOI: 10.7544/issn1000-1239.2015.20140302
[8]	Deng Hongxia, Xiang Jie, You Ya, Li Haifang. Analysis Method of Thinking Data Based on fMRI[J]. Journal of Computer Research and Development, 2014, 51(4): 773-780.
[9]	Zhou Jiang, Wang Weiping, Meng Dan, Ma Can, Gu Xiaoyan, Jiang Jie. Key Technology in Distributed File System Towards Big Data Analysis[J]. Journal of Computer Research and Development, 2014, 51(2): 382-394.
[10]	Liu Wenfen, Guan Wei, Cao Jia, and Zhang Weiming. Detection of Secret Message in Spatial LSB Steganography Based on Contaminated Data Analysis[J]. Journal of Computer Research and Development, 2006, 43(6): 1058-1064.

Cited By

Cited by

Periodical cited type(10)

1.	贺岩，潘俊杰. 基于Neo4j的太湖流域诗词知识图谱构建研究. 电脑编程技巧与维护. 2025(02): 145-148 .
2.	张强，高劲松，龙家庆，杨晓燕，夏红玉，蒋智慧. 基于知识重构的词人时空情感轨迹可视化研究——以辛弃疾为例. 情报学报. 2023(06): 729-739 .
3.	王亚楠. 镇江“大运河”主题诗词文化资源的组织性建构. 文化创新比较研究. 2023(18): 1-7 .
4.	宋雪雁，罗慧，杨芳芳. 知识重组视域下《全唐诗》送别诗的时空结构研究. 图书情报工作. 2023(20): 15-24 .
5.	宋雪雁，罗慧，杨芳芳. 《全唐诗》送别诗诗人社交网络分析. 兰台世界. 2023(12): 43-48+52 .
6.	宋雪雁，霍晓楠，刘寅鹏，邓君. 数字人文视角下《全唐诗》贬谪诗人社会关系研究. 现代情报. 2022(02): 14-21 .
7.	欧阳子薇，柳雨欣，于娜. 以弘扬古诗词文化为主题的移动应用设计研究. 包装工程. 2022(04): 197-202 .
8.	司莉，郭财强. 基于内容分析的数字人文领域中知识组织价值体现研究综述. 图书情报工作. 2022(13): 127-137 .
9.	张卫，王昊，李晓敏，Song Min. 数字人文视角下古诗意象知识抽取及其文化图式构建研究. 图书情报工作. 2022(24): 104-117 .
10.	李永卉，周树斌，周宇婷，卢章平. 基于图数据库Neo4j的宋代镇江诗词知识图谱构建研究. 大学图书馆学报. 2021(02): 52-61 .