𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Combination of GMM-based speech estimation method and temporal domain SVD-based speech enhancement for noise robust speech recognition

✍ Scribed by Masakiyo Fujimoto; Yasuo Ariki


Publisher
John Wiley and Sons
Year
2007
Tongue
English
Weight
743 KB
Volume
38
Category
Article
ISSN
0882-1666

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

This study proposes a speech recognition method which is made robust to noise by combining speech signal estimation based on GMM and speech enhancement based on SVD in the temporal domain. Conventional speech signal estimation based on GMM has the problems that the time dependence of the noise is not considered and the performance is degraded in a low‐SNR environment. As regards the first problem, successive updating of the mean noise vector is performed in this study to follow the time variation of the noise. As regards the second problem, an attempt is made to improve performance by improving the SNR beforehand by means of speech enhancement based on SVD in the time domain. Furthermore, in speech enhancement based on SVD in the time domain, the over‐subtraction factor for the noise component is introduced in order to minimize the effect of noise, and adaptive determination of the factor is considered. The proposed method is evaluated using the AURORA2 database, and it is shown that the speech recognition accuracy is improved compared to conventional speech signal estimation based on GMM. © 2007 Wiley Periodicals, Inc. Syst Comp Jpn, 38(3): 23–38, 2007; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.20487