The domain of speech processing has come to the point where researchers and engineers are concerned with how speech technology can be applied to new products, and how this technology will transform our future. One important problem is to improve robustness of speech processing under adverse cond
Robustness in Automatic Speech Recognition: Fundamentals and Applications
β Scribed by Jean-Claude Junqua, Jean-Paul Haton (auth.)
- Publisher
- Springer US
- Year
- 1996
- Tongue
- English
- Leaves
- 456
- Series
- The Kluwer International Series in Engineering and Computer Science 341
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-the-art linear prediction synthesizer. This caused a speech technology fever among the electronics industry. Particularly. applications of automatic speech recognition were rigorously attemptΒ ed by many companies. some of which were start-ups founded just for this purpose. Unfortunately. it did not take long before they realized that automatic speech recΒ ognition technology was not mature enough to satisfy the need of customers. The fever gradually faded away. In the meantime. constant efforts have been made by many researchers and engiΒ neers to improve the automatic speech recognition technology. Hardware capabilities have advanced impressively since that time. In the past few years. we have been witnessing and experiencing the advent of the "Information Revolution." What might be called the second surge of interest to comΒ mercialize speech technology as a natural interface for man-machine communication began in much better shape than the first one. With computers much more powerful and faster. many applications look realistic this time. However. there are still tremendous practical issues to be overcome in order for speech to be truly the most natural interface between humans and machines.
β¦ Table of Contents
Front Matter....Pages i-xxx
Front Matter....Pages 1-1
Nature and Perception of Speech Sounds....Pages 3-35
Background on Speech Analysis....Pages 37-71
Fundamentals of Automatic Speech Recognition....Pages 73-124
Front Matter....Pages 125-125
Speaker Variability and Specificity....Pages 127-153
Dealing with Noisy Speech and Channel Distortions....Pages 155-189
Front Matter....Pages 191-191
The Current Technology and Its Limits: An Overview....Pages 193-206
Towards Robust Speech Analysis....Pages 207-231
On the Use of a Robust Speech Representation....Pages 233-272
ASR of Noisy, Stressed, and Channel Distorted Speech....Pages 273-323
Word-Spotting and Rejection....Pages 325-345
Spontaneous Speech....Pages 347-369
On the use of Knowledge in ASR....Pages 371-392
Application Domain, Human Factors, and Dialogue....Pages 393-428
Back Matter....Pages 429-440
β¦ Subjects
Signal, Image and Speech Processing; Electrical Engineering
π SIMILAR VOLUMES
<p>The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received
<p>Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that
<p>Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings an
<p>Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings an
Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings and a