• 中国精品科技期刊
  • CCF推荐A类中文期刊
  • 计算领域高质量科技期刊T1类
Advanced Search
Shi Wenhua, Ni Yongjing, Zhang Xiongwei, Zou Xia, Sun Meng, Min Gang. Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization[J]. Journal of Computer Research and Development, 2018, 55(11): 2430-2438. DOI: 10.7544/issn1000-1239.2018.20170580
Citation: Shi Wenhua, Ni Yongjing, Zhang Xiongwei, Zou Xia, Sun Meng, Min Gang. Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization[J]. Journal of Computer Research and Development, 2018, 55(11): 2430-2438. DOI: 10.7544/issn1000-1239.2018.20170580

Deep Neural Network Based Monaural Speech Enhancement with Sparse Non-Negative Matrix Factorization

More Information
  • Published Date: October 31, 2018
  • In this paper, a monaural speech enhancement method combining deep neural network (DNN) with sparse non-negative matrix factorization (SNMF) is proposed. This method takes advantage of the sparse characteristic of speech signal in time-frequency (T-F) domain and the spectral preservation characteristic of DNN presented in speech enhancement, aiming to resolve the distortion problem introduced by low SNR situation and unvoiced components without structure characteristics in conventional non-negative matrix factorization (NMF) method. Firstly, the magnitude spectrogram matrix of noisy speech is decomposed by NMF with sparse constraint to obtain the corresponding coding matrix coefficients of speech and noise dictionary. The speech and noise dictionary are pre-trained independently. Then Wiener filtering method is used to get the separated speech and noise. DNN is employed to model the non-linear function which maps the log magnitude spectrum of the separated speech from Wiener filter to the target clean speech. Evaluations are conducted on the IEEE dataset, both stationary and non-stationary types of noise are selected to demonstrate the effectiveness of the proposed method. The experimental results show that the proposed method could effectively suppress the noise and preserve the speech component from the corrupted speech signal. It has better performance than the baseline methods in terms of perceptual quality and log-spectral distortion.
  • Related Articles

    [1]Zhao Xiaoyang, Li Zhongnian, Wang Wenyu, Xu Xinzheng. ADIC: An Adaptive Disentangled CNN Classifier for Interpretable Image Recognition[J]. Journal of Computer Research and Development, 2023, 60(8): 1754-1767. DOI: 10.7544/issn1000-1239.202330231
    [2]Xie Kunpeng, Yi Dezhi, Liu Yiqing, Liu Hang, He Xinyu, Gong Cheng, Lu Ye. SAF-CNN:A Sparse Acceleration Framework of Convolutional Neural Network forEmbedded FPGAs[J]. Journal of Computer Research and Development, 2023, 60(5): 1053-1072. DOI: 10.7544/issn1000-1239.202220735
    [3]Yin Zhaoxia, Guo Hongnian, Du Yang, Ma Wenjing, Lü Wanli, Zhang Xinpeng. Multi-Domain Reversible Data Hiding in JPEG Images and Payload Distribution Algorithm[J]. Journal of Computer Research and Development, 2022, 59(8): 1831-1840. DOI: 10.7544/issn1000-1239.20210411
    [4]Liu Yanxiao, Wu Ping, Sun Qindong. Secret Image Sharing Schemes Based on Region Convolution Neural Network[J]. Journal of Computer Research and Development, 2021, 58(5): 1065-1074. DOI: 10.7544/issn1000-1239.2021.20200898
    [5]Wang Yangyang, He Hongjie, Chen Fan, Zhang Shanjun. Reversible Data Hiding in JPEG Images Based on Distortion-Extension Cost[J]. Journal of Computer Research and Development, 2020, 57(11): 2271-2282. DOI: 10.7544/issn1000-1239.2020.20200434
    [6]Xing Xinying, Ji Junzhong, Yao Yao. Brain Networks Classification Based on an Adaptive Multi-Task Convolutional Neural Networks[J]. Journal of Computer Research and Development, 2020, 57(7): 1449-1459. DOI: 10.7544/issn1000-1239.2020.20190186
    [7]Lü Guohao, Luo Siwei, Huang Yaping, Jiang Xinlan. A Novel Regularization Method Based on Convolution Neural Network[J]. Journal of Computer Research and Development, 2014, 51(9): 1891-1900. DOI: 10.7544/issn1000-1239.2014.20140266
    [8]Zhang Zhan, Liu Guangjie, Dai Yuewei, Wang Zhiquan. A Self-Adaptive Image Steganography Algorithm Based on Cover-Coding and Markov Model[J]. Journal of Computer Research and Development, 2012, 49(8): 1668-1675.
    [9]Yang Chunfang, Liu Fenlin, and Luo Xiangyang. Histograms Difference and Quantitative Steganalysis of JPEG Steganography Based on Relative Entropy[J]. Journal of Computer Research and Development, 2011, 48(8): 1563-1569.
    [10]Zheng Qingfang, Gao Wen. Adaptive Skin Detection in JPEG Compressed Images[J]. Journal of Computer Research and Development, 2006, 43(7): 1194-1200.

Catalog

    Article views (1161) PDF downloads (523) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return