Speech Recognition Using Articulatory and Excitation Source Features

✍ Scribed by K. Sreenivasa Rao, Manjunath K E (auth.)

Publisher: Springer International Publishing
Year: 2017
Tongue: English
Leaves: 100
Series: SpringerBriefs in Electrical and Computer Engineering
Edition: 1
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) performance. Speech recognition is analyzed for read, extempore, and conversation modes of speech. Five groups of articulatory features (AFs) are explored for speech recognition, in addition to conventional spectral features. Each chapter provides the motivation for exploring the specific feature for SR task, discusses the methods to extract those features, and finally suggests appropriate models to capture the sound unit specific knowledge from the proposed features. The authors close by discussing various combinations of spectral, articulatory and source features, and the desired models to enhance the performance of SR systems.

✦ Table of Contents

Front Matter....Pages i-xi
Introduction....Pages 1-6
Literature Review....Pages 7-15
Articulatory Features for Phone Recognition....Pages 17-46
Excitation Source Features for Phone Recognition....Pages 47-63
Articulatory and Excitation Source Features for Phone Recognition in Read, Extempore and Conversation Modes of Speech....Pages 65-79
Summary and Conclusion....Pages 81-84
Back Matter....Pages 85-92

✦ Subjects

Signal, Image and Speech Processing;Language Translation and Linguistics;Computational Linguistics

📜 SIMILAR VOLUMES

A Syllable, Articulatory-Feature, and St

📁 A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition

✍ Chang Shuangyu. 📂 Library 🌐 English

286 pages. Doctor of Philosophy in Computer Science University of California, Berkeley , 2002 Professor Nelson Morgan, Dr. Lokendra Shastri, Cochairs Current-generation automatic speech recognition (ASR) systems assume that words are readily decomposable into constituent phonet

Language Identification Using Excitation

📁 Language Identification Using Excitation Source Features

✍ K. Sreenivasa Rao, Dipanjan Nandi (auth.) 📂 Library 📅 2015 🏛 Springer International Publishing 🌐 English

This book discusses the contribution of excitation source information in discriminating language. The authors focus on the excitation source component of speech for enhancement of language identification (LID) performance. Language specific features are extracted using two different modes: (i) Im

Emotion Recognition using Speech Feature

📁 Emotion Recognition using Speech Features

✍ K. Sreenivasa Rao, Shashidhar G. Koolagudi (auth.) 📂 Library 📅 2013 🏛 Springer-Verlag New York 🌐 English

“Emotion Recognition Using Speech Features” provides coverage of emotion-specific features present in speech. The author also discusses suitable models for capturing emotion-specific information for distinguishing different emotions. The content of this book is important for designing and develop

Emotion Recognition using Speech Feature

📁 Emotion Recognition using Speech Features

✍ Krothapalli S.R., Koolagudi S.G. 📂 Library 🌐 English

Издательство Springer, 2013, -134 pp. During production of speech human beings impose emotional cues on the sequence of sound units to convey the intended message. Speech without emotional information is unnatural and monotonous. Most of the existing speech systems are able to process studio rec

Robust Emotion Recognition using Spectra

📁 Robust Emotion Recognition using Spectral and Prosodic Features

✍ K. Sreenivasa Rao, Shashidhar G. Koolagudi (auth.) 📂 Library 📅 2013 🏛 Springer-Verlag New York 🌐 English

In this brief, the authors discuss recently explored spectral (sub-segmental and pitch synchronous) and prosodic (global and local features at word and syllable levels in different parts of the utterance) features for discerning emotions in a robust manner. The authors also delve into the complem

Incorporating Knowledge Sources into Sta

📁 Incorporating Knowledge Sources into Statistical Speech Recognition

✍ Wolfgang Minker, Satoshi Nakamura, Konstantin Markov, Sakriani Sakti (auth.) 📂 Library 📅 2009 🏛 Springer US 🌐 English

Incorporating Knowledge Sources into Statistical Speech Recognition offers solutions for enhancing the robustness of a statistical automatic speech recognition (ASR) system by incorporating various additional knowledge sources while keeping the training and recognition effort feasible