𝔖 Bobbio Scriptorium
✦   LIBER   ✦

New temporal features for robust speech recognition with emphasis on microphone variations

✍ Scribed by Jia-lin Shen; Wen L. Hwang


Publisher
Elsevier Science
Year
1999
Tongue
English
Weight
131 KB
Volume
13
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

✦ Synopsis


Although the delta and RASTA methods have been widely used in extracting the temporal properties of stationary features for robust speech recognition, there is still a need to investigate new temporal features for better performance. In this paper, we present two new temporal features for robust processing of speech signals with emphasis on microphone variations. First, the temporal feature is derived from a bank of RASTA-like filters, in which the parameters of each filter in this bank are estimated according to the statistical properties of the speech signals.

Secondly, a parametrized temporal filter (called a PTF) is proposed. The filter can be described by four parameters: the passband, the beginning transition, the ending transition and the smoothness of the magnitude of the filter response. Together, these parameters determine the magnitude of the frequency response of the PTF, and a transformation algorithm is then used to derive the temporal coefficients with real and causal characteristics. The discriminative ability of the PTF features can be further enhanced using the minimum classification error (MCE) algorithm. Experimental results show that the RASTA features are inferior to the PTF features both in quiet conditions and in the presence of microphone variations.