• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Yu Haitao, Yang Xiaoshan, Xu Changsheng. Antagonistic Video Generation Method Based on Multimodal Input[J]. Journal of Computer Research and Development, 2020, 57(7): 1522-1530. DOI: 10.7544/issn1000-1239.2020.20190479
Citation: Yu Haitao, Yang Xiaoshan, Xu Changsheng. Antagonistic Video Generation Method Based on Multimodal Input[J]. Journal of Computer Research and Development, 2020, 57(7): 1522-1530. DOI: 10.7544/issn1000-1239.2020.20190479

Antagonistic Video Generation Method Based on Multimodal Input

Funds: This work was supported by the National Key Research and Development Program of China (2018AAA0100604), the National Natural Science Foundation of China (61702511, 61720106006, 61728210, 61751211, U1836220, U1705262, 61872424), and the Research Program of National Laboratory of Pattern Recognition (Z-2018007).
More Information
  • Published Date: June 30, 2020
  • Video generation is an important and challenging task in the field of computer vision and multimedia. The existing video generation methods based on generative adversarial networks (GANs) usually lack an effective scheme to control the coherence of video. The realization of artificial intelligence algorithms that can automatically generate real video is an important indicator of more complete visual appearance information and motion information understanding.A new multi-modal conditional video generation model is proposed in this paper. The model uses pictures and text as input, gets the motion information of video through text feature coding network and motion feature decoding network, and generates video with coherence motion by combining the input images. In addition, the method predicts video frames by affine transformation of input images, which makes the generated model more controllable and the generated results more robust. The experimental results on SBMG (single-digit bouncing MNIST gifs), TBMG(two-digit bouncing MNIST gifs) and KTH(kungliga tekniska hgskolan human actions) datasets show that the proposed method performs better on both the target clarity and the video coherence than existing methods. In addition, qualitative evaluation and quantitative evaluation of SSIM(structural similarity index) and PSNR(peak signal to noise ratio) metrics demonstrate that the proposed multi-modal video frame generation network plays a key role in the generation process.
  • Related Articles

    [1]Zhang Qiang, Ye Ayong, Ye Guohua, Deng Huina, Chen Aimin. k-Anonymous Data Privacy Protection Mechanism Based on Optimal Clustering[J]. Journal of Computer Research and Development, 2022, 59(7): 1625-1635. DOI: 10.7544/issn1000-1239.20210117
    [2]Xu Yuming, Zhu Ningbo, Ouyang Aijia, and Li Kenli. A Double-Helix Structure Genetic Algorithm for Task Scheduling on Heterogeneous Computing Systems[J]. Journal of Computer Research and Development, 2014, 51(6): 1240-1252.
    [3]Hu Xinping, He Yuzhi, Ni Weiwei, and Zhang Yong. A Privacy-Preserving Data Publishing Method Based on Genetic Algorithm with Roulette Wheel[J]. Journal of Computer Research and Development, 2012, 49(11): 2432-2439.
    [4]Shen Haihua, Wang Pengyu, Wei Wenli, Guo Qi. A Coverage Directed Test Generation Platform for Microprocessors Using Genetic Approach[J]. Journal of Computer Research and Development, 2009, 46(10): 1612-1625.
    [5]Song Jinling, Liu Guohua, Huang Liming, Zhu Caiyun. Algorithms to Find the Set of Relevant Views and Quasi-Identifiers for K-Anonymity Method[J]. Journal of Computer Research and Development, 2009, 46(1): 77-88.
    [6]Li Qinghua, Yang Shida, and Ruan Youlin. Improving Optimization for Genetic Algorithms Based on Level Set[J]. Journal of Computer Research and Development, 2006, 43(9): 1624-1629.
    [7]Yi Yeqing, Lin Yaping, Lin Mu, Li Xiaolong, Wang Lei. Blind Source Separation Based on Genetic Algorithm[J]. Journal of Computer Research and Development, 2006, 43(2): 244-252.
    [8]Wang Fei, Liu Dayou, Wang Songxin. Research on Incremental Learning of Bayesian Network Structure Based on Genetic Algorithms[J]. Journal of Computer Research and Development, 2005, 42(9): 1461-1466.
    [9]Han Xuedong, Hong Bingrong, and Meng Wei. An Immune Genetic Algorithm Based on Vaccine Autonomous Obtaining and Updating[J]. Journal of Computer Research and Development, 2005, 42(5): 740-745.
    [10]Zhang Min, Lin Chuan, and Ma Shaoping. Dynamic Parameter Learning Approach for Information Retrieval with Genetic Algorithm[J]. Journal of Computer Research and Development, 2005, 42(3).

Catalog

    Article views (1199) PDF downloads (332) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return