• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Fan Zhengguang, Qu Dan, Yan Honggang, Zhang Wenlin. Joint Acoustic Modeling of Multi-Features Based on Deep Neural Networks[J]. Journal of Computer Research and Development, 2017, 54(5): 1036-1044. DOI: 10.7544/issn1000-1239.2017.20160031
Citation: Fan Zhengguang, Qu Dan, Yan Honggang, Zhang Wenlin. Joint Acoustic Modeling of Multi-Features Based on Deep Neural Networks[J]. Journal of Computer Research and Development, 2017, 54(5): 1036-1044. DOI: 10.7544/issn1000-1239.2017.20160031

Joint Acoustic Modeling of Multi-Features Based on Deep Neural Networks

More Information
  • Published Date: April 30, 2017
  • In view of the complementary information and the relevance when training acoustic modes of different acoustic features, a joint acoustic modeling method of multi-features based on deep neural networks is proposed. In this method, similar to DNN multimodal and multitask learning, part of the DNN hidden layers are shared to make the association among the DNN acoustic models built with different features. Through training the acoustic models together, the common hidden explanatory factors are exploited among different learning tasks. Such exploitation allows the possibility of knowledge transferring across different learning tasks. Moreover, the number of the model parameters is decreased by using the low-rank matrix factorization method to reduce the training time. Lastly, the recognition results from different acoustic features are combined by using recognizer output voting error reduction (ROVER) algorithm to further improve the performance. Experimental results of continuous speech recognition on TIMIT database show that the joint acoustic modeling method performs better than modeling independently with different features. In terms of phone error rates (PER), the result combined by ROVER based on the joint acoustic models yields a relative gain of 4.6% over the result based on the independent acoustic models.
  • Related Articles

    [1]Han Songshen, Guo Songhui, Xu Kaiyong, Yang Bo, Yu Miao. Perturbation Analysis of the Vital Region in Speech Adversarial Example Based on Frame Structure[J]. Journal of Computer Research and Development, 2024, 61(3): 685-700. DOI: 10.7544/issn1000-1239.202221034
    [2]Li Ru, Wang Zhiqiang, Li Shuanghong, Liang Jiye, Collin Baker. Chinese Sentence Similarity Computing Based on Frame Semantic Parsing[J]. Journal of Computer Research and Development, 2013, 50(8): 1728-1736.
    [3]Zhou Jingang, Zhao Dazhe, Xu Li, Liu Jiren. Frame Refinement: Combining Frame-Based Software Development with Stepwise Refinement[J]. Journal of Computer Research and Development, 2013, 50(4): 711-721.
    [4]Zhang Yan, Yu Shengyang, Zhang Chongyang, Yang Jingyu. Extraction and Removal of Frame Line in Form Bill[J]. Journal of Computer Research and Development, 2008, 45(5): 909-914.
    [5]Mi Congjie, Liu Yang, and Xue Xiangyang. Video Texts Tracking and Segmentation Based on Multiple Frames[J]. Journal of Computer Research and Development, 2006, 43(9): 1523-1529.
    [6]Zhang Dongming, Shen Yanfei, Lin Shouxun, Zhang Yongdong. Low Complexity Mode Decision for H.264 Inter Frame Encoding[J]. Journal of Computer Research and Development, 2006, 43(9): 1516-1522.
    [7]Tang Yunting, Cheng Xianyi. The Studying of Frame APRF of Pattern-Recognition Based on Agent[J]. Journal of Computer Research and Development, 2006, 43(5): 867-873.
    [8]Wang Fangshi, Xu De, and Wu Weixin. A Cluster Algorithm of Automatic Key Frame Extraction Based on Adaptive Threshold[J]. Journal of Computer Research and Development, 2005, 42(10): 1752-1757.
    [9]Wang Rongrong, Jin Wanjun, and Wu Lide. A Novel Video Caption Detection Approach Using Multi-Frame Integration[J]. Journal of Computer Research and Development, 2005, 42(7): 1191-1197.
    [10]Zhang Chongyang, Chen Qiang, Lou Zhen, Yang Jingyu. A Form Frame Line Removal Algorithm Based on Gray-Level Image[J]. Journal of Computer Research and Development, 2005, 42(4): 635-639.
  • Cited by

    Periodical cited type(2)

    1. 谢景明,胡伟方,韩林,赵荣彩,荆丽娜. 基于“嵩山”超级计算机系统的量子傅里叶变换模拟. 计算机科学. 2021(12): 36-42 .
    2. Ze-yao MO. 超大规模并行计算:瓶颈与对策(英文). Frontiers of Information Technology & Electronic Engineering. 2018(10): 1251-1261 .

    Other cited types(1)

Catalog

    Article views (1238) PDF downloads (724) Cited by(3)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return