• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Shi Wenhua, Ni Yongjing, Zhang Xiongwei, Zou Xia, Sun Meng, Min Gang. Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization[J]. Journal of Computer Research and Development, 2018, 55(11): 2430-2438. DOI: 10.7544/issn1000-1239.2018.20170580
Citation: Shi Wenhua, Ni Yongjing, Zhang Xiongwei, Zou Xia, Sun Meng, Min Gang. Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization[J]. Journal of Computer Research and Development, 2018, 55(11): 2430-2438. DOI: 10.7544/issn1000-1239.2018.20170580

Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization

More Information
  • Published Date: October 31, 2018
  • In this paper, a monaural speech enhancement method combining deep neural network (DNN) with sparse non-negative matrix factorization (SNMF) is proposed. This method takes advantage of the sparse characteristic of speech signal in time-frequency (T-F) domain and the spectral preservation characteristic of DNN presented in speech enhancement, aiming to resolve the distortion problem introduced by low SNR situation and unvoiced components without structure characteristics in conventional non-negative matrix factorization (NMF) method. Firstly, the magnitude spectrogram matrix of noisy speech is decomposed by NMF with sparse constraint to obtain the corresponding coding matrix coefficients of speech and noise dictionary. The speech and noise dictionary are pre-trained independently. Then Wiener filtering method is used to get the separated speech and noise. DNN is employed to model the non-linear function which maps the log magnitude spectrum of the separated speech from Wiener filter to the target clean speech. Evaluations are conducted on the IEEE dataset, both stationary and non-stationary types of noise are selected to demonstrate the effectiveness of the proposed method. The experimental results show that the proposed method could effectively suppress the noise and preserve the speech component from the corrupted speech signal. It has better performance than the baseline methods in terms of perceptual quality and log-spectral distortion.
  • Related Articles

    [1]Li Jianhui, Shen Zhihong, Meng Xiaofeng. Scientific Big Data Management: Concepts, Technologies and System[J]. Journal of Computer Research and Development, 2017, 54(2): 235-247. DOI: 10.7544/issn1000-1239.2017.20160847
    [2]Shen Bilong, Zhao Ying, Huang Yan, Zheng Weimin. Survey on Dynamic Ride Sharing in Big Data Era[J]. Journal of Computer Research and Development, 2017, 54(1): 34-49. DOI: 10.7544/issn1000-1239.2017.20150729
    [3]ZhuWeiheng, YinJian, DengYuhui, LongShun, QiuShiding. Efficient Duplicate Detection Approach for High Dimensional Big Data[J]. Journal of Computer Research and Development, 2016, 53(3): 559-570. DOI: 10.7544/issn1000-1239.2016.20148218
    [4]Meng Xiaofeng, Du Zhijuan. Research on the Big Data Fusion: Issues and Challenges[J]. Journal of Computer Research and Development, 2016, 53(2): 231-246. DOI: 10.7544/issn1000-1239.2016.20150874
    [5]Li Weibang, Li Zhanhuai, Chen Qun, Jiang Tao, Liu Hailong, Pan Wei. Functional Dependencies Discovering in Distributed Big Data[J]. Journal of Computer Research and Development, 2015, 52(2): 282-294. DOI: 10.7544/issn1000-1239.2015.20140229
    [6]Meng Xiaofeng, Zhang Xiaojian. Big Data Privacy Management[J]. Journal of Computer Research and Development, 2015, 52(2): 265-281. DOI: 10.7544/issn1000-1239.2015.20140073
    [7]Liu Yahui, Zhang Tieying, Jin Xiaolong, Cheng Xueqi. Personal Privacy Protection in the Era of Big Data[J]. Journal of Computer Research and Development, 2015, 52(1): 229-247. DOI: 10.7544/issn1000-1239.2015.20131340
    [8]Meng Xiaofeng, Li Yong, Jonathan J. H. Zhu. Social Computing in the Era of Big Data: Opportunities and Challenges[J]. Journal of Computer Research and Development, 2013, 50(12): 2483-2491. DOI: 10.7544/issn1000-1239.2013.20130890
    [9]Li Jianzhong and Liu Xianmin. An Important Aspect of Big Data: Data Usability[J]. Journal of Computer Research and Development, 2013, 50(6): 1147-1162.
    [10]Meng Xiaofeng and Ci Xiang. Big Data Management: Concepts,Techniques and Challenges[J]. Journal of Computer Research and Development, 2013, 50(1): 146-169.
  • Cited by

    Periodical cited type(5)

    1. 廖鑫,黎懿熠,欧阳军林,周江盟,戴湘桃,秦拯. 一种基于深度学习的移动端隐写方法. 湖南大学学报(自然科学版). 2022(04): 18-25 .
    2. 何凤英. 改进卷积神经网络在图像隐写检测中的应用. 福建电脑. 2022(09): 1-6 .
    3. 黄思远,张敏情,柯彦,毕新亮. 基于显著性检测的图像隐写分析方法. 计算机应用. 2021(02): 441-448 .
    4. 黄思远,张敏情,柯彦,毕新亮. 基于自注意力机制的图像隐写分析方法. 计算机应用研究. 2021(04): 1190-1194 .
    5. 吴煌,李凯勇. 基于DCT域的数字图像隐写容量归一化方法. 计算机仿真. 2021(08): 207-211 .

    Other cited types(5)

Catalog

    Article views (1157) PDF downloads (523) Cited by(10)

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return