高级检索

    基于深度学习的2D虚拟人驱动技术综述

    Survey of 2D Virtual Human Driving Technology Based on Deep Learning

    • 摘要: 近年来,随着深度学习和多模态信息融合技术的快速发展,现有2D虚拟人驱动技术能够根据输入的视频、语音、表情和姿态等信息,驱动指定人物生成逼真的表情和动作,为虚拟娱乐、在线教育和智能交互等场景带来了极大的便利。综述系统地梳理了基于深度学习的2D虚拟人驱动技术的发展现状,概括了其原理与所采用的基础模型。同时,根据驱动部位及实现架构的不同,对驱动技术进行分类与概述,重点阐述并对比了近年来的主要方法,介绍了集成驱动的研究状况与发展前景。此外,还深入探讨了当前研究面临的生成逼真性、数据集质量、模型实时性等方面的技术挑战,旨在为相关领域研究人员提供较为全面的技术梳理与展望,帮助推动该领域的进一步发展。

       

      Abstract: In recent years, the rapid advancement of deep learning and multimodal information fusion has significantly propelled the development of 2D virtual human driving technologies. Existing methods can now generate highly realistic facial expressions and body movements for specified characters by leveraging diverse input data such as video, audio, facial cues, and poses. This remarkable progress offers substantial convenience for applications in virtual entertainment, online education, and intelligent interaction. This survey provides a systematic review of the current state and evolutionary trajectory of deep learning-based 2D virtual human driving methodologies. It thoroughly outlines the underlying principles and fundamental models commonly employed in this domain. The driving techniques are categorized and summarized according to both the target driving components (e.g., face, body) and the corresponding architectural frameworks, with a focused comparison and detailed elaboration of prominent methods developed in recent years. The research status and development prospects of holistic driving systems are also introduced. Furthermore, this paper delves into the critical technical challenges currently faced by the field, including generation realism, the quality of datasets, and the real-time performance of models. The aim of this survey is to offer a comprehensive technological overview and perspective for researchers in related fields, thereby contributing to the further advancement of this domain.

       

    /

    返回文章
    返回