✦ LIBER ✦

Dynamic Bayesian networks for multi-band automatic speech recognition

✍ Scribed by Khalid Daoudi; Dominique Fohr; Christophe Antoine

Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 292 KB
Volume: 17
Category: Article
ISSN: 0885-2308
DOI: 10.1016/s0885-2308(03)00011-1

No coin nor oath required. For personal study only.

✦ Synopsis

This paper presents a new approach to multi-band automatic speech recognition which has the advantage to overcome many limitations of classical muti-band systems. The principle of this new approach is to build a speech model in the time-frequency domain using the formalism of dynamic Bayesian networks. In contrast to classical multi-band modeling, this formalism leads to a probabilistic speech model which allows communications between the different sub-bands and, consequently, no recombination step is required in recognition. We develop efficient learning and decoding algorithms both for isolated and continuous speech recognition. We present illustrative experiments on isolated and connected digit recognition tasks. These experiments show that the this new approach is very promising in the field of noisy speech recognition.

📜 SIMILAR VOLUMES

Bayesian network structures and inferenc

Bayesian network structures and inference techniques for automatic speech recognition

✍ Geoffrey Zweig 📂 Article 📅 2003 🏛 Elsevier Science 🌐 English ⚖ 350 KB

This paper describes the theory and implementation of Bayesian networks in the context of automatic speech recognition. Bayesian networks provide a succinct and expressive graphical language for factoring joint probability distributions, and we begin by presenting the structures that are appropriate

Dynamic-gain-tilt-free S-band optical am

Dynamic-gain-tilt-free S-band optical amplifiers employing silica-based phosphorous/alumina-codoped EDFs designed for multi-wavelength photonic networks

✍ Motoki Kakui; Masahiro Takagi; Shinji Endo; Shinji Ishikawa; Masayuki Shigematsu 📂 Article 📅 2005 🏛 Elsevier Science 🌐 English ⚖ 463 KB