𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Weighted finite-state transducers in speech recognition

✍ Scribed by Mehryar Mohri; Fernando Pereira; Michael Riley


Book ID
102566878
Publisher
Elsevier Science
Year
2002
Tongue
English
Weight
215 KB
Volume
16
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

✦ Synopsis


We survey the use of weighted finite-state transducers (WFSTs) in speech recognition. We show that WFSTs provide a common and natural representation for hidden Markov models (HMMs), context-dependency, pronunciation dictionaries, grammars, and alternative recognition outputs. Furthermore, general transducer operations combine these representations flexibly and efficiently. Weighted determinization and minimization algorithms optimize their time and space requirements, and a weight pushing algorithm distributes the weights along the paths of a weighted transducer optimally for speech recognition.

As an example, we describe a North American Business News (NAB) recognition system built using these techniques that combines the HMMs, full cross-word triphones, a lexicon of 40 000 words, and a large trigram grammar into a single weighted transducer that is only somewhat larger than the trigram word grammar and that runs NAB in real-time on a very simple decoder. In another example, we show that the same techniques can be used to optimize lattices for second-pass recognition. In a third example, we show how general automata operations can be used to assemble lattices from different recognizers to improve recognition performance.


πŸ“œ SIMILAR VOLUMES


Springer Handbook of Speech Processing |
✍ Benesty, Jacob; Sondhi, M. Mohan; Huang, Yiteng (Arden) πŸ“‚ Article πŸ“… 2008 πŸ› Springer Berlin Heidelberg 🌐 German βš– 952 KB

From common consumer products such as cell phones and MP3 players to more sophisticated projects such as human-machine interfaces and responsive robots, speech technologies are now everywhere. Many think that it is just a matter of time before more applications of the science of speech become inesca