高级检索

    车载环境下基于样本熵的语音端点检测方法

    Voice Activity Detection Based on Sample Entropy in Car Environments

    • 摘要: 在语音处理中一个关键性问题是如何准确找到语音的起止位置,目前提出许多的语音端点检测算法不能得到理想的检测结果.由于样本熵是近似熵的改进算法,提出车载环境下基于样本熵的语音端点检测方法,并采用模糊C均值聚类算法和贝叶斯信息判决算法进行样本熵特征门限估计,以及使用双门限法进行语音端点检测.在TIMIT连续语音库上的实验表明,车载噪声环境下,样本熵法和近似熵法的检测正确率均远高于谱熵法和能量谱熵法,而样本熵法相对于近似熵法具有更好的检测效果,特别是当信噪比小于等于0dB时,样本熵法的检测性能优于近似熵法近10%.因此,样本熵法在车载智能语音领域具有很好的应用前景,能够为车载导航提供准确的语音端点检测技术.

       

      Abstract: One of the key issues in practical speech processing is to precisely locate endpoints of the input utterance to be free of non-speech regions. Although lots of studies have been performed to solve this problem, the operation of existing voice activity detection (VAD) algorithms is still far away from ideal. This paper proposes a robust feature for VAD method in car environments based on sample entropy (SampEn) which is an improved algorithm of approximate entropy (ApEn). In addition, we adopt fuzzy C means clustering algorithm and Bayesian information criterion algorithm to estimate the thresholds of the SampEn characteristic, and use dual thresholds method for VAD. Experiments on the TIMIT continuous speech database show that, in the car noise environments, the detection accuracy of SampEn and ApEn are both much higher than that of spectral entropy (SE) and energy spectral entropy (ESE). SampEn method has better detection performance than ApEn, especially when the SNR is not more than 0dB, and SampEn method detection performance is superior to ApEn nearly 10%. Therefore, the SampEn method has a good application prospect in automotive field and can provide accurate VAD techniques for car navigation.

       

    /

    返回文章
    返回