✦ LIBER ✦

Hidden Markov model training with contaminated speech material for distant-talking speech recognition

✍ Scribed by Matassoni, Marco (author);Omologo, Maurizio (author);Giuliani, Diego (author);Svaizer, Piergiorgio (author)

Publisher: Academic Press
Year: 2002
Tongue: English
Weight: 280 KB
Volume: 16
Category: Article
ISSN: 0885-2308
DOI: 10.1006/csla.2002.0191

No coin nor oath required. For personal study only.

✦ Synopsis

A challenging scenario is addressed in which a distant-talking speech recognizer operates in a noisy office environment with model adaptation. The use of a single far microphone as well as that of a microphone array input are investigated.

In addition to the benefits from the application of microphone array processing, system robustness is improved by training hidden Markov models (HMMs) with a contaminated version of a clean corpus. This artificial corpus is produced by exploiting information extracted from "real world" acoustic scenarios. The resulting models are then used as a starting point for unsupervised incremental adaptation.

Experimental results show that improvements in recognition accuracy due to multiple microphones, HMM training on contaminated speech and incremental adaptation are additive on a connected digits task. Moreover, the results show that unsupervised incremental adaptation receives the benefits of starting from models trained using contaminated speech. A final contribution of the paper refers to the influence of accuracy of speech activity detection, which seems to be relevant when moving towards real applications.

📜 SIMILAR VOLUMES

Hidden Markov model training with contam

Hidden Markov model training with contaminated speech material for distant-talking speech recognition

✍ Matassoni, Marco (author);Omologo, Maurizio (author);Giuliani, Diego (author);Sv 📂 Article 📅 2002 🏛 Academic Press 🌐 English ⚖ 280 KB

Large scale discriminative training of h

Large scale discriminative training of hidden Markov models for speech recognition

✍ P.C. Woodland; D. Povey 📂 Article 📅 2002 🏛 Elsevier Science 🌐 English ⚖ 197 KB

This paper describes, and evaluates on a large scale, the lattice based framework for discriminative training of large vocabulary speech recognition systems based on Gaussian mixture hidden Markov models (HMMs). This paper concentrates on the maximum mutual information estimation (MMIE) criterion wh

Contextual vector quantization for speec

Contextual vector quantization for speech recognition with discrete hidden Markov model

✍ Qiang Huo; Chorkin Chan 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 541 KB

Hidden Markov model-based speech recogni

Hidden Markov model-based speech recognition with intermediate wavelet transform domains

✍ R Singh; K Davis; P V.S Rao 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 633 KB

A discrete wavelet transform algorithm segregates the operand data set sequentially. It generates computational intermediates which represent it at graded resolutions and leads to a reciprocal domain within which information is multiply resolved in terms of the timefrequency localization of the comp

Hidden Markov model representation of qu

Hidden Markov model representation of quantized articulatory features for speech recognition

✍ Kevin Erler; Li Deng 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 707 KB

This paper describes a speech recognizer based on an HMM representation of quantized articulatory features and presents experimental results for its evaluation. Traditional schemes for HMM representation of speech have attempted to model a set of disjoint time segments. In order to create a more rob

Hidden Markov models with templates as n

Hidden Markov models with templates as non-stationary states: an application to speech recognition

✍ Oded Ghitza; M.Mohan Sondhi 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 619 KB