𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Automatic Speech Recognition: The Development of the SPHINX System

✍ Scribed by Kai-Fu Lee (auth.)


Publisher
Springer US
Year
1989
Tongue
English
Leaves
215
Series
The Springer International Series in Engineering and Computer Science 62
Edition
1
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


Speech Recognition has a long history of being one of the difficult problems in Artificial Intelligence and Computer Science. As one goes from problem solving tasks such as puzzles and chess to perceptual tasks such as speech and vision, the problem characteristics change dramatically: knowledge poor to knowledge rich; low data rates to high data rates; slow response time (minutes to hours) to instantaneous response time. These characteristics taken together increase the computational complexity of the problem by several orders of magnitude. Further, speech provides a challenging task domain which embodies many of the requirements of intelligent behavior: operate in real time; exploit vast amounts of knowledge, tolerate errorful, unexpected unknown input; use symbols and abstractions; communicate in natural language and learn from the environment. Voice input to computers offers a number of advantages. It provides a natural, fast, hands free, eyes free, location free input medium. However, there are many as yet unsolved problems that prevent routine use of speech as an input device by non-experts. These include cost, real time response, speaker independence, robustness to variations such as noise, microphone, speech rate and loudness, and the ability to handle non-grammatical speech. Satisfactory solutions to each of these problems can be expected within the next decade. Recognition of unrestricted spontaneous continuous speech appears unsolvable at present. However, by the addition of simple constraints, such as clarification dialog to resolve ambiguity, we believe it will be possible to develop systems capable of accepting very large vocabulary continuous speechdictation.

✦ Table of Contents


Front Matter....Pages i-xv
Introduction....Pages 1-16
Hidden Markov Modeling of Speech....Pages 17-43
Task and Databases....Pages 45-50
The Baseline SPHINX System....Pages 51-62
Adding Knowledge....Pages 63-89
Finding a Good Unit of Speech....Pages 91-114
Learning and Adaptation....Pages 115-127
Summary of Results....Pages 129-136
Conclusion....Pages 137-144
Back Matter....Pages 145-207

✦ Subjects


Signal, Image and Speech Processing;Artificial Intelligence (incl. Robotics)


πŸ“œ SIMILAR VOLUMES


Automatic Speech Recognition
✍ Renals Steve, King Simon. πŸ“‚ Library 🌐 English

In William J. Hardcastle, John Laver, and Fiona E. Gibbon, editors, Handbook of Phonetic Sciences, chapter 22 (Wiley Blackwell, 2010, ISBN: 978-1-4051-4590-9) β€” on pp. 804β€”838.<div class="bb-sep"></div>Speech recognitionβ€”the transcription of an acoustic speech signal into a string of wordsβ€”is a hard

The HARPY Speech Recognition System [Ph.
✍ Lowerre, Bruce T. πŸ“‚ Library πŸ“… 1976 πŸ› Carnegie-Mellon University 🌐 English

The Harpy connected speech recognition system is the result of an attempt to understand the relative importance of various design choices of two earlier speech recognition systems developed at Carnegie-Mellon University: The Hearsay-1 system and the Dragon system. Knowledge is represented in the Hea

Automatic Fingerprint Recognition System
✍ Nalini Ratha, Ruud Bolle πŸ“‚ Library πŸ“… 2003 πŸ› Springer 🌐 English

An authoritative survey of intelligent fingerprint-recognition concepts, technology, and systems is given. Editors and contributors are the leading researchers and applied R&D developers of this personal identification (biometric security) topic and technology. Biometrics and pattern recognition res

Automatic Fingerprint Recognition System
✍ Simon A. Cole (auth.), Nalini Ratha, Ruud Bolle (eds.) πŸ“‚ Library πŸ“… 2004 πŸ› Springer-Verlag New York 🌐 English

<p><P></P><P>Contrary to perhaps common belief, even after several decades of research, automatic fingerprint recognition is not a solved problem. New fingerprint sensing technologies, algorithmic advances, and abundant computing power continue to drive advances in this area and to open up new realm

Automatic Speech and Speaker Recognition
✍ L. R. Rabiner, B.-H. Juang, C.-H. Lee (auth.), Chin-Hui Lee, Frank K. Soong, Kul πŸ“‚ Library πŸ“… 1996 πŸ› Springer US 🌐 English

<p>Research in the field of automatic speech and speaker recognition has made a number of significant advances in the last two decades, influenced by advances in signal processing, algorithms, architectures, and hardware. These advances include: the adoption of a statistical pattern recognition para