✦ LIBER ✦

k-TSS language models in speech recognition systems

✍ Scribed by I. Torres; A. Varona

Publisher: Elsevier Science
Year: 2001
Tongue: English
Weight: 405 KB
Volume: 15
Category: Article
ISSN: 0885-2308
DOI: 10.1006/csla.2001.0162

No coin nor oath required. For personal study only.

✦ Synopsis

The aim of this work is to show the ability of stochastic regular grammars to generate accurate language models which can be well integrated, allocated and handled in a continuous speech recognition system. For this purpose, a syntactic version of the well-known n-gram model, called k-testable language in the strict sense (k-TSS), is used. The complete definition of a k-TSS stochastic finite state automaton is provided in the paper. One of the difficulties arising in representing a language model through a stochastic finite state network is that the recursive schema involved in the smoothing procedure must be adopted in the finite state formalism to achieve an efficient implementation of the backing-off mechanism. The use of the syntactic back-off smoothing technique applied to k-TSS language modelling allowed us to obtain a self-contained smoothed model integrating several k-TSS automata in a unique smoothed and integrated model, which is also fully defined in the paper. The proposed formulation leads to a very compact representation of the model parameters learned at training time: probability distribution and model structure. The dynamic expansion of the structure at decoding time allows an efficient integration in a continuous speech recognition system using a one-step decoding procedure.

An experimental evaluation of the proposed formulation was carried out on two Spanish corpora. These experiments showed that regular grammars generate accurate language models (k-TSS) that can be efficiently represented and managed in real speech recognition systems, even for high values of k, leading to very good system performance.

📜 SIMILAR VOLUMES

Analysing a simple language model·some g

Analysing a simple language model·some general conclusions for language models for speech recognition

✍ Joerg Ueberla 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 692 KB

Phoneme-based continuous speech recognit

Phoneme-based continuous speech recognition results for different language models in the 1000-word spicos system

✍ Hermann Ney; Annedore Paeseler 📂 Article 📅 1988 🏛 Elsevier Science 🌐 English ⚖ 820 KB

Erratum: Language modeling by stochastic

Erratum: Language modeling by stochastic dependency grammer for Japanese speech recognition

✍ Akinori Ito; Chiori Hori; Masaharu Katoh; Masaki Kohda 📂 Article 📅 2002 🏛 John Wiley and Sons 🌐 English ⚖ 21 KB

VOICE: An integrated speech recognition

VOICE: An integrated speech recognition synthesis system for the Hindi language

✍ P.V.S. Rao 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 743 KB

Continuous speech recognition and syntac

Continuous speech recognition and syntactic processing in Iranian Farsi language

✍ M. Sheikhan; M. Tebyani; M. Lotfizad 📂 Article 📅 1997 🏛 Springer US 🌐 English ⚖ 490 KB

Recognition of speech from live sports c

Recognition of speech from live sports coverage using acoustic and language model adaptation

✍ Yasuo Ariki; Jun Ogata; Masakiyo Fujimoto; Kiyoshi Tsukada 📂 Article 📅 2005 🏛 John Wiley and Sons 🌐 English ⚖ 749 KB