• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Wang Yilei, Zhuo Yifan, Wu Yingjie, Chen Mingqin. Question Answering Algorithm on Image Fragmentation Information Based on Deep Neural Network[J]. Journal of Computer Research and Development, 2018, 55(12): 2600-2610. DOI: 10.7544/issn1000-1239.2018.20180606
Citation: Wang Yilei, Zhuo Yifan, Wu Yingjie, Chen Mingqin. Question Answering Algorithm on Image Fragmentation Information Based on Deep Neural Network[J]. Journal of Computer Research and Development, 2018, 55(12): 2600-2610. DOI: 10.7544/issn1000-1239.2018.20180606

Question Answering Algorithm on Image Fragmentation Information Based on Deep Neural Network

More Information
  • Published Date: November 30, 2018
  • Many fragmentation information is highly dispersed in different data sources, such as text, image, video and Web. They are characterized by structural disorder and content one-sided. Current researches implement the extraction, expression and understanding of multi-modal fragmentation information by constructing visual question answering (VQA) system. The VQA task is required to provide the correct answer to a given problem with a corresponding image. The aim of this paper is to design a complete framework and algorithm for image fragmentation information question answering under the basic background of visual question answering task. The main research includes image feature extraction, question text feature extraction, multi-modal feature fusion and answer reasoning. Deep neural network is constructed to extract features for representing images and problem information. Attention mechanism and variational inference method are combined to fusion two modal features of image and problem and reason answers. Experiment results show that the model can effectively extract and understand multi-modal fragmentation information, and improve the accuracy of VQA.
  • Related Articles

    [1]Tian Xuan, Wu Zhichao. Review of Knowledge Base Question Answering Based on Information Retrieval[J]. Journal of Computer Research and Development, 2025, 62(2): 314-335. DOI: 10.7544/issn1000-1239.202331013
    [2]Liu Mingyang, Wang Ruomei, Zhou Fan, Lin Ge. Video Question Answering Scheme Base on Multimodal Knowledge Active Learning[J]. Journal of Computer Research and Development, 2024, 61(4): 889-902. DOI: 10.7544/issn1000-1239.202221008
    [3]Bao Cuizhu, Ding Kai, Dong Jianfeng, Yang Xun, Xie Mande, Wang Xun. Research Progress of Video Question Answering Technologies[J]. Journal of Computer Research and Development, 2024, 61(3): 639-673. DOI: 10.7544/issn1000-1239.202220294
    [4]Chen Jinyin, Chen Yipeng, Chen Yiming, Zheng Haibin, Ji Shouling, Shi Jie, Cheng Yao. Fairness Research on Deep Learning[J]. Journal of Computer Research and Development, 2021, 58(2): 264-280. DOI: 10.7544/issn1000-1239.2021.20200758
    [5]Cheng Keyang, Wang Ning, Shi Wenxi, Zhan Yongzhao. Research Advances in the Interpretability of Deep Learning[J]. Journal of Computer Research and Development, 2020, 57(6): 1208-1217. DOI: 10.7544/issn1000-1239.2020.20190485
    [6]Zhang Yingying, Qian Shengsheng, Fang Quan, Xu Changsheng. Multi-Modal Knowledge-Aware Attention Network for Question Answering[J]. Journal of Computer Research and Development, 2020, 57(5): 1037-1045. DOI: 10.7544/issn1000-1239.2020.20190474
    [7]Zhang Rui, Li Jintao. A Survey on Algorithm Research of Scene Parsing Based on Deep Learning[J]. Journal of Computer Research and Development, 2020, 57(4): 859-875. DOI: 10.7544/issn1000-1239.2020.20190513
    [8]Yu Jun, Wang Liang, Yu Zhou. Research on Visual Question Answering Techniques[J]. Journal of Computer Research and Development, 2018, 55(9): 1946-1958. DOI: 10.7544/issn1000-1239.2018.20180168
    [9]Zhou Ye, Zhang Junping. Multi-Scale Deep Learning for Product Image Search[J]. Journal of Computer Research and Development, 2017, 54(8): 1824-1832. DOI: 10.7544/issn1000-1239.2017.20170197
    [10]Zhang Ruimao, Peng Jiefeng, Wu Yang, Lin Liang. The Semantic Knowledge Embedded Deep Representation Learning and Its Applications on Visual Understanding[J]. Journal of Computer Research and Development, 2017, 54(6): 1251-1266. DOI: 10.7544/issn1000-1239.2017.20171064
  • Cited by

    Periodical cited type(3)

    1. 邹芸竹,杜圣东,滕飞,李天瑞. 一种基于多模态深度特征融合的视觉问答模型. 计算机科学. 2023(02): 123-129 .
    2. 马雪景,王文焕,刘国巍. 人工神经网络在含噪图像边缘检测算法中的应用. 西安工程大学学报. 2021(02): 79-84 .
    3. 石乐义,朱红强,刘祎豪,刘佳. 基于相关信息熵和CNN-BiLSTM的工业控制系统入侵检测. 计算机研究与发展. 2019(11): 2330-2338 . 本站查看

    Other cited types(8)

Catalog

    Article views (2059) PDF downloads (677) Cited by(11)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return