The domain of speech processing has come to the point where researchers and engineers are concerned with how speech technology can be applied to new products, and how this technology will transform our future. One important problem is to improve robustness of speech processing under adverse cond
Robust automatic speech recognition : a bridge to practical applications
โ Scribed by Deng, Li; Gong, Yifan; Haeb-Umbach, Reinhold; Li, Jinyu
- Publisher
- Academic Press
- Year
- 2016
- Tongue
- English
- Leaves
- 298
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
โฆ Synopsis
Robust Automatic Speech Recognition: A Bridge to Practical Applications establishes a solid foundation for automatic speech recognition that is robust against acoustic environmental distortion. It provides a thorough overview of classical and modern noise-and reverberation robust techniques that have been developed over the past thirty years, with an emphasis on practical methods that have been proven to be successful and which are likely to be further developed for future applications. The strengths and weaknesses of robustness-enhancing speech recognition techniques are carefully analyzed. The book covers noise-robust techniques designed for acoustic models which are based on both Gaussian mixture models and deep neural networks. In addition, a guide to selecting the best methods for practical applications is provided. The reader will:
- Gain a unified, deep and systematic understanding of the state-of-the-art technologies for robust speech recognition
- Learn the links and relationship between alternative technologies for robust speech recognition
- Be able to use the technology analysis and categorization detailed in the book to guide future technology development
- Be able to develop new noise-robust methods in the current era of deep learning for acoustic modeling in speech recognition
- The first book that provides a comprehensive review on noise and reverberation robust speech recognition methods in the era of deep neural networks
- Connects robust speech recognition techniques to machine learning paradigms with rigorous mathematical treatment
- Provides elegant and structural ways to categorize and analyze noise-robust speech recognition techniques
- Written by leading researchers who have been actively working on the subject matter in both industrial and academic organizations for many years
โฆ Table of Contents
Content:
Front Matter,Copyright,About the Authors,List of Figures,List of Tables,Acronyms,NotationsEntitled to full textChapter 1 - Introduction, Pages 1-7
Chapter 2 - Fundamentals of speech recognition, Pages 9-40
Chapter 3 - Background of robust speech recognition, Pages 41-63
Chapter 4 - Processing in the feature and model domains, Pages 65-106
Chapter 5 - Compensation with prior knowledge, Pages 107-136
Chapter 6 - Explicit distortion modeling, Pages 137-170
Chapter 7 - Uncertainty processing, Pages 171-186
Chapter 8 - Joint model training, Pages 187-202
Chapter 9 - Reverberant speech recognition, Pages 203-238
Chapter 10 - Multi-channel processing, Pages 239-260
Chapter 11 - Summary and future directions, Pages 261-280
Index, Pages 281-286
๐ SIMILAR VOLUMES
<p>Foreword Looking back the past 30 years. we have seen steady progress made in the area of speech science and technology. I still remember the excitement in the late seventies when Texas Instruments came up with a toy named "Speak-and-Spell" which was based on a VLSI chip containing the state-of-t
<p><P>Speech recognition technology is being increasingly employed in human-machine interfaces. A remaining problem however is the robustness of this technology to non-native accents, which still cause considerable difficulties for current systems.<BR>In this book, methods to overcome this problem a
<p>The need for automatic speech recognition systems to be robust with respect to changes in their acoustical environment has become more widely appreciated in recent years, as more systems are finding their way into practical applications. Although the issue of environmental robustness has received
<p>Automatic speech recognition (ASR) systems are finding increasing use in everyday life. Many of the commonplace environments where the systems are used are noisy, for example users calling up a voice search system from a busy cafeteria or a street. This can result in degraded speech recordings an