๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Predictive speaker adaptation in speech recognition

โœ Scribed by Stephen Cox


Publisher
Elsevier Science
Year
1995
Tongue
English
Weight
172 KB
Volume
9
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

โœฆ Synopsis


A major problem with most speaker adaptation schemes is that they rely on the speaker providing at least one example of each acoustic unit (word, phone, triphone, etc.) in the vocabulary in order to adapt the appropriate model. Rapid adaptation is difficult to achieve and some sounds may never be adapted because they are never heard. In this paper, a technique of adapting all the speech models to a new speaker's voice when he has given an incomplete set of the vocabulary is presented. The technique is based upon using the training-set to obtain estimates of correlations between sounds. Given some sounds from a new speaker at recognition time, these correlations are used to obtain estimates of unheard sounds which are used to adapt the speech models. The technique was applied to a database of 104 speakers speaking the English alphabet. When speakers spoke half of the vocabulary for enrollment prior to recognition, the technique gave a 78% decrease in error.


๐Ÿ“œ SIMILAR VOLUMES


N-Best-based unsupervised speaker adapta
โœ Tomoko Matsui; Sadaoki Furui ๐Ÿ“‚ Article ๐Ÿ“… 1998 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 251 KB

This paper proposes an instantaneous speaker adaptation method that uses N-best decoding for continuous mixture-density hidden-Markovmodel-based speech-recognition systems. This method is effective even for speakers whose decoding using speaker-independent (SI) models are error-prone and for whom sp

Statistical methods in multi-speaker aut
โœ Boyer, A. ;Di Martino, J. ;Divoux, P. ;Haton, J. P. ;Mari, J. F. ;Smaili, K. ๐Ÿ“‚ Article ๐Ÿ“… 1990 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 632 KB

Automatic speech recognition and understanding (ASR) plays an important role in the framework of man-machine communication. Substantial industrial developments are at present in progress in this area. Hotteker, after 40 years or so of efforts several fundamental questions remain open. This paper is

Speaker-independent speech recognition b
โœ Tetsuo Kosaka; Shoichi Matsunaga; Shigeki Sagayama ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 231 KB

We have already proposed the application of tree-structured speaker clustering to supervised speaker adaptation. This paper proposes its application to unsupervised speaker adaptation and speakerindependent (SI) speech recognition. This clustering involves the selection of a speaker cluster from amo