• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
高级检索

多媒体内容理解的研究现状与展望

彭宇新, 綦金玮, 黄鑫

彭宇新, 綦金玮, 黄鑫. 多媒体内容理解的研究现状与展望[J]. 计算机研究与发展, 2019, 56(1): 183-208. DOI: 10.7544/issn1000-1239.2019.20180770
引用本文: 彭宇新, 綦金玮, 黄鑫. 多媒体内容理解的研究现状与展望[J]. 计算机研究与发展, 2019, 56(1): 183-208. DOI: 10.7544/issn1000-1239.2019.20180770
Peng Yuxin, Qi Jinwei, Huang Xin. Current Research Status and Prospects on Multimedia Content Understanding[J]. Journal of Computer Research and Development, 2019, 56(1): 183-208. DOI: 10.7544/issn1000-1239.2019.20180770
Citation: Peng Yuxin, Qi Jinwei, Huang Xin. Current Research Status and Prospects on Multimedia Content Understanding[J]. Journal of Computer Research and Development, 2019, 56(1): 183-208. DOI: 10.7544/issn1000-1239.2019.20180770
彭宇新, 綦金玮, 黄鑫. 多媒体内容理解的研究现状与展望[J]. 计算机研究与发展, 2019, 56(1): 183-208. CSTR: 32373.14.issn1000-1239.2019.20180770
引用本文: 彭宇新, 綦金玮, 黄鑫. 多媒体内容理解的研究现状与展望[J]. 计算机研究与发展, 2019, 56(1): 183-208. CSTR: 32373.14.issn1000-1239.2019.20180770
Peng Yuxin, Qi Jinwei, Huang Xin. Current Research Status and Prospects on Multimedia Content Understanding[J]. Journal of Computer Research and Development, 2019, 56(1): 183-208. CSTR: 32373.14.issn1000-1239.2019.20180770
Citation: Peng Yuxin, Qi Jinwei, Huang Xin. Current Research Status and Prospects on Multimedia Content Understanding[J]. Journal of Computer Research and Development, 2019, 56(1): 183-208. CSTR: 32373.14.issn1000-1239.2019.20180770

多媒体内容理解的研究现状与展望

基金项目: 国家自然科学基金项目(61771025,61532005)
详细信息
  • 中图分类号: TP391

Current Research Status and Prospects on Multimedia Content Understanding

  • 摘要: 随着多媒体和网络技术的迅猛发展,海量的图像、视频、文本、音频等多媒体数据快速涌现.这些不同媒体的数据在形式上多源异构,语义上相互关联.认知科学研究表明,人脑生理组织结构决定了其对外界的感知和认知过程是跨越多种感官信息的融合处理.如何对不同媒体的数据进行语义分析和关联建模以实现多媒体内容理解,成为了一个研究和应用的关键问题,受到了学术界和工业界的广泛关注.选取了多媒体内容理解的5个最新热点研究方向:图像细分类与检索、视频分类与目标检测、跨媒体检索、视觉描述与生成、视觉问答,分别阐述了它们的基本概念、代表性方法、研究现状等,并进一步阐述了多媒体内容理解面临的重要挑战,同时给出未来的发展趋势,旨在帮助读者全面了解多媒体内容理解的研究现状,吸引更多研究人员投身相关研究并为他们提供技术参考,推动该领域的进一步发展.
    Abstract: With the rapid development of multimedia and Internet technologies, a large amount of multimedia data has been rapidly emerging, such as image, video, text and audio. Data of different media types from multi-source is heterogeneous in the form but relevant in the semantic. As indicated in the research of cognitive science, the perception and cognition of the environment is through the fusion across different sensory organs of human, which is decided by the human brain’s organization structure. Therefore, it has been a key challenge to perform data semantic analysis and correlation modeling across different media types, for achieving comprehensive multimedia content understanding, which has drawn wide interests of both academic and industrial areas. In this paper, the basic concepts, representative methods and research status of 5 latest highlighting research topics of multimedia content understanding are referred, including fine-grained image classification and retrieval, video classification and object detection, cross-media retrieval, visual description and generation, and visual question answering. This paper further presents the major challenges of multimedia content understanding, as well as gives the development trend in the future. The goal of this paper is to help readers get a comprehensive understanding on the research status of multimedia content understanding, draw more attention of researchers to relevant research topics, and provide the technical insights to promote further development of this area.
  • 期刊类型引用(7)

    1. 李皎,张秀山,宁远航. 降低跨分片交易比例的区块链分片方法. 计算机应用. 2024(06): 1889-1896 . 百度学术
    2. 张驰骋,李雷孝,杜金泽,史建平. 可编辑区块链研究综述. 计算机工程与应用. 2024(18): 32-49 . 百度学术
    3. 孙林昆,蒋文保,郭阳楠,李春强. 基于密码累加器的无状态区块链性能优化. 计算机工程. 2023(02): 46-53 . 百度学术
    4. 姜承扬,庞俊,贾大宇,于明鹤,信俊昌,刘晨. 结合社区发现和局部恢复码的区块链扩容研究. 计算机工程与应用. 2023(05): 297-304 . 百度学术
    5. 邓文丽,方欢. 基于健康数据库的无状态区块链在医疗保健的应用. 哈尔滨商业大学学报(自然科学版). 2023(04): 408-412 . 百度学术
    6. 刘孝保,孙海彬,阴艳超,姚廷强,杨林. 面向制造业产业链图状区块链模型. 计算机集成制造系统. 2023(12): 4267-4281 . 百度学术
    7. 傅丽玉,陆歌皓,吴义明,罗娅玲. 区块链技术的研究及其发展综述. 计算机科学. 2022(S1): 447-461+666 . 百度学术

    其他类型引用(14)

计量
  • 文章访问数:  2404
  • HTML全文浏览量:  11
  • PDF下载量:  1279
  • 被引次数: 21
出版历程
  • 发布日期:  2018-12-31

目录

    /

    返回文章
    返回