286 pages.<br/>Doctor of Philosophy in Computer Science<br/>University of California, Berkeley , 2002<br/>Professor Nelson Morgan, Dr. Lokendra Shastri, Cochairs<br/> <br/>Current-generation automatic speech recognition (ASR) systems assume that words are readily decomposable into constituent phonet
Speech Recognition Using Articulatory and Excitation Source Features
β Scribed by K. Sreenivasa Rao, Manjunath K E (auth.)
- Publisher
- Springer International Publishing
- Year
- 2017
- Tongue
- English
- Leaves
- 100
- Series
- SpringerBriefs in Electrical and Computer Engineering
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.
β¦ Table of Contents
Front Matter....Pages i-xi
Introduction....Pages 1-6
Literature Review....Pages 7-15
Articulatory Features for Phone Recognition....Pages 17-46
Excitation Source Features for Phone Recognition....Pages 47-63
Articulatory and Excitation Source Features for Phone Recognition in Read, Extempore and Conversation Modes of Speech....Pages 65-79
Summary and Conclusion....Pages 81-84
Back Matter....Pages 85-92
β¦ Subjects
Signal, Image and Speech Processing;Language Translation and Linguistics;Computational Linguistics
π SIMILAR VOLUMES
<p>This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Im
<p>βEmotion Recognition Using Speech Featuresβ provides coverage of emotion-specific features present in speech. The author also discusses suitable models for capturing emotion-specific information for distinguishing different emotions. The content of this book is important for designing and develop
ΠΠ·Π΄Π°ΡΠ΅Π»ΡΡΡΠ²ΠΎ Springer, 2013, -134 pp.<br/>During production of speech human beings impose emotional cues on the sequence of sound units to convey the intended message. Speech without emotional information is unnatural and monotonous. Most of the existing speech systems are able to process studio rec
<p>In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner. The authors also delve into the complem
<p><P><EM>Incorporating Knowledge Sources into Statistical Speech Recognition</EM> offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible