A Syllable, Articulatory-Feature, and Stress-Accent Model of Speech Recognition

✍ Scribed by Chang Shuangyu.

Tongue: English
Leaves: 286
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

286 pages.
Doctor of Philosophy in Computer Science
University of California, Berkeley , 2002
Professor Nelson Morgan, Dr. Lokendra Shastri, Cochairs

Current-generation automatic speech recognition (ASR) systems assume that words are readily decomposable into constituent phonetic components (\phonemes").
A detailed linguistic dissection of state-of-the-art speech recognition systems indicates that the conventional phonemic \beads-on-a-string" approach is of limited utility, particularly with respect to informal, conversational material. The study shows that there is a signi cant gap between the observed data and the pronunciation models of current ASR systems. It also shows that many important factors a ecting recognition performance are not modeled explicitly in these systems.
Motivated by these ndings, this dissertation analyzes spontaneous speech with respect to three important, but often neglected, components of speech (at least with respect to English ASR). These components are articulatory-acoustic features (AFs), the syllable and stress accent. Analysis results provide evidence for an alternative approach of speech modeling, one in which the syllable assumes pre2 eminent status and is melded to the lower as well as the higher tiers of linguistic representation through the incorporation of prosodic information such as stress accent. Using concrete examples and statistics from spontaneous speech material it is shown that there exists a systematic relationship between the realization of AFs and stress accent in conjunction with syllable position. This relationship can be used to provide an accurate and parsimonious characterization of pronunciation variation in spontaneous speech. An approach to automatically extract AFs from the acoustic signal is also developed, as is a system for the automatic stress-accent labeling of spontaneous speech.
Based on the results of these studies a syllable-centric, multi-tier model of speech recognition is proposed. The model explicitly relates AFs, phonetic segments and syllable constituents to a framework for lexical representation, and incorporates stress-accent information into recognition. A test-bed implementation of the model is developed using a fuzzy-based approach for combining evidence from various AF sources and a pronunciation-variation modeling technique using AF-variation statistics extracted from data.

✦ Subjects

Языки и языкознание;Английский язык;Фонетика / English Phonology and Phonetics;Теоретическая фонетика / Theoretical Phonology and Phonetics of English

📜 SIMILAR VOLUMES

Speech Recognition Using Articulatory an

📁 Speech Recognition Using Articulatory and Excitation Source Features

✍ K. Sreenivasa Rao, Manjunath K E (auth.) 📂 Library 📅 2017 🏛 Springer International Publishing 🌐 English

<p>This book discusses the contribution of articulatory and excitation source information in discriminating sound units. The authors focus on excitation source component of speech -- and the dynamics of various articulators during speech production -- for enhancement of speech recognition (SR) perfo

Emotion Recognition using Speech Feature

📁 Emotion Recognition using Speech Features

✍ K. Sreenivasa Rao, Shashidhar G. Koolagudi (auth.) 📂 Library 📅 2013 🏛 Springer-Verlag New York 🌐 English

<p>“Emotion Recognition Using Speech Features” provides coverage of emotion-specific features present in speech. The author also discusses suitable models for capturing emotion-specific information for distinguishing different emotions. The content of this book is important for designing and develop

Emotion Recognition using Speech Feature

📁 Emotion Recognition using Speech Features

✍ Krothapalli S.R., Koolagudi S.G. 📂 Library 🌐 English

Издательство Springer, 2013, -134 pp.<br/>During production of speech human beings impose emotional cues on the sequence of sound units to convey the intended message. Speech without emotional information is unnatural and monotonous. Most of the existing speech systems are able to process studio rec

Speech Enhancement, Modeling and Recogni

📁 Speech Enhancement, Modeling and Recognition - Algorithms, Applns.

✍ S. Ramakrishnan 📂 Library 🏛 🌐 English

A Theory of Stress and Accent

📁 A Theory of Stress and Accent

✍ Shosuke Haraguchi 📂 Library 📅 1991 🏛 De Gruyter Mouton 🌐 English

A Theory of Stress and Accent

📁 A Theory of Stress and Accent

✍ Shosuke Haraguchi 📂 Library 📅 1990 🏛 Foris Publications 🌐 English