✦ LIBER ✦

Statistical methods in multi-speaker automatic speech recognition

✍ Scribed by Boyer, A. ;Di Martino, J. ;Divoux, P. ;Haton, J. P. ;Mari, J. F. ;Smaili, K.

Publisher: John Wiley and Sons
Year: 1990
Tongue: English
Weight: 632 KB
Volume: 6
Category: Article
ISSN: 8755-0024
DOI: 10.1002/asm.3150060302

No coin nor oath required. For personal study only.

✦ Synopsis

Automatic speech recognition and understanding (ASR) plays an important role in the framework of man-machine communication. Substantial industrial developments are at present in progress in this area. Hotteker, after 40 years or so of efforts several fundamental questions remain open. This paper is concerned with a comparative study of four different methods for multi-speaker word recognition: (i) clustering of acoustic templates, (ii) comparison with a finite state automaton, (iii) dynamic programming and vector quantization, (iv) stochastic Markov sources. In order to make things comparable, the four methods were tested with the same material made up of the ten digits (0 to 9) pronounced four times by 60 different speakers (30 males and 30 females). We will distinguish in our experiments between multispeaker systems (capable of recognizing words pronounced by speakers that have been used during the training phase of the system) and speaker-independent systems (capable of recognizing words pronounced by speakers totally unknown to the system). Half of the corpus (15 male and 15 female) were used for training, and the remaining part for test. hEI \ ORDS Automatic speech recognition Multi-speaker Markov models Dynamic programming Clustering

📜 SIMILAR VOLUMES

Predictive speaker adaptation in speech

Predictive speaker adaptation in speech recognition

✍ Stephen Cox 📂 Article 📅 1995 🏛 Elsevier Science 🌐 English ⚖ 172 KB

A major problem with most speaker adaptation schemes is that they rely on the speaker providing at least one example of each acoustic unit (word, phone, triphone, etc.) in the vocabulary in order to adapt the appropriate model. Rapid adaptation is difficult to achieve and some sounds may never be ad

Automatic speech recognition in machine-

Automatic speech recognition in machine-aided translation

✍ P.F. Brown; S.F. Chen; S.A. Della Pietra; V.J. Della Pietra; A.S. Kehler; R.L. M 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 368 KB

Modelling error recovery and repair in a

Modelling error recovery and repair in automatic speech recognition

✍ C. Baber; K.S. Hone 📂 Article 📅 1993 🏛 Elsevier Science ⚖ 682 KB

While automatic speech recognition (ASR) has achieved some level of success, it often fails to live up to its hype. One of the principal reasons for this apparent failure is the prevalence of "recognition errors". This makes error correction a topic of increasing importance to ASR system development

Automatic selection of phonetically dist

Automatic selection of phonetically distributed sentence sets for speaker adaptation with application to large vocabulary Mandarin speech recognition

✍ Jia-lin Shen; Hsin-min Wang; Ren-yuan Lyu; Lin-shan Lee 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 163 KB

This paper presents an approach of automatic selection of phonetically distributed sentence sets for speaker adaptation, and applies the concept to the task of Mandarin speech recognition with very large vocabulary. This is a different approach to the adaptation data selection problem. A computer al

Comparing Multi-layer Perceptrons and Ra

Comparing Multi-layer Perceptrons and Radial Basis Functions networks in speaker recognition

✍ M.W. Mak; W.G. Allen; G.G. Sexton 📂 Article 📅 1993 🏛 Elsevier Science ⚖ 443 KB

We have compared the performance of Multi-layer Perceptrons networks (MLP) and Radial Basis Function networks (RBF) in the task of speaker identification. The experiments are carried out on 400 utterances ( 10 digits, in English) from 10 speakers. LPC-derived Cepstrum Coefficients are used as the sp

Technical Note: theoretical and simulati

Technical Note: theoretical and simulation approaches to error correction strategies in automatic speech recognition

✍ W.A. Ainsworth 📂 Article 📅 1993 🏛 Elsevier Science ⚖ 109 KB