人与场景交互运动生成技术综述

景宇宸; 温玉辉; 李晨光; 于剑

doi:10.7544/issn1000-1239.202550352

人与场景交互运动生成技术综述

Review of Human-Scene Interactive Motion Generation

摘要

摘要: 人与场景交互运动生成任务涉及计算机视觉、计算机图形学和机器人控制学等多个领域. 该任务旨在利用深度学习算法从大量交互运动数据中建模并学习人与场景的交互运动关系，生成人与室内场景或其中物体的各种交互运动，包括避障漫游、人椅交互和物体抓取等. 相比于传统物理仿真方法，基于数据驱动的交互运动生成方法摆脱了对物理仿真引擎的依赖，具有更高的计算效率和更强的泛化能力，在游戏设计、影视制作和人机交互等领域具有广泛的应用前景. 然而，当前对人与场景交互运动生成技术的研究尚未形成系统性归纳，本研究系统梳理与阐述当前人体与场景交互运动生成技术的核心进展. 首先阐释3维人体与场景的数据表示方法；在此基础上系统地归纳不同交互任务类型及技术挑战，详述相关基准数据集的核心特征及评估指标体系；最后总结现有技术路线的局限性并分析未来研究的突破方向与潜在发展路径.

Abstract: The task of generating human–scene interaction motions involves multiple disciplines, including computer vision, computer graphics, and robotics. The primary goal of this task is to leverage deep learning algorithms to model and learn the relationships between humans and scenes from large-scale interaction motion data, thereby generating diverse human motions interacting with indoor scenes or objects within them. These motions include, for instance, obstacle-avoidance navigation, human–chair interaction, and object grasping. Compared with traditional physics-based simulation approaches, data-driven methods for interaction motion generation are free from reliance on physical simulation engines, providing higher computational efficiency and stronger generalization capability. As a result, these methods hold broad application prospects in areas such as game design, film production, and human–computer interaction. Nevertheless, current research on human–scene interaction motion generation has not yet formed a systematic review or synthesis. To address this gap, this work systematically organizes and explicates the core advancements in this domain. Specifically, it first introduces data representation methods for 3D humans and scenes. Building on this foundation, it then summarizes the different types of interaction tasks and the associated technical challenges, while also presenting the key characteristics of relevant benchmark datasets and evaluation metrics. Finally, it highlights the limitations of existing technical approaches and discusses potential breakthrough directions and development paths for future research.

HTML全文

参考文献(108)

施引文献

资源附件(0)