Phonological Parsing in Speech Recognition describes a novel recognizer that is designed to exploit variant cues by parsing the input utterance into syllables and other suprasegmental constituents using phrase-structure parsing techniques. The unique matrix view of parsing on engineering grounds all
Phonological Parsing in Speech Recognition
β Scribed by Kenneth W. Church (auth.)
- Publisher
- Springer US
- Year
- 1987
- Tongue
- English
- Leaves
- 271
- Series
- The Kluwer International Series in Engineering and Computer Science 38
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
It is well-known that phonemes have different acoustic realizations depending on the context. Thus, for example, the phoneme /t! is typically realized with a heavily aspirated strong burst at the beginning of a syllable as in the word Tom, but without a burst at the end of a syllable in a word like cat. Variation such as this is often considered to be problematic for speech recogniΒ tion: (1) "In most systems for sentence recognition, such modifications must be viewed as a kind of 'noise' that makes it more difficult to hypothesize lexical candidates given an inΒ put phonetic transcription. To see that this must be the case, we note that each phonological rule [in a certain example] results in irreversible ambiguity-the phonological rule does not have a unique inverse that could be used to recover the underlying phonemic representation for a lexical item. For example, . . . schwa vowels could be the first vowel in a word like 'about' or the surface realization of almost any English vowel appearing in a sufficiently destressed word. The tongue flap [(] could have come from a /t! or a /d/. " [65, pp. 548-549] This view of allophonic variation is representative of much of the speech recognition literature, especially during the late 1970's. One can find similar statements by Cole and Jakimik [22] and by Jelinek [50].
β¦ Table of Contents
Front Matter....Pages i-xvii
Introduction....Pages 1-40
Representation of Segments....Pages 41-63
Allophonic Rules....Pages 65-82
An Alternative: Phrase-Structure Rules....Pages 83-95
Parser Implementation....Pages 97-118
Phonotactic Constraints....Pages 119-131
When Phonotactic Constraints are Not Enough....Pages 133-154
Robustness Issues....Pages 155-178
Conclusion....Pages 179-188
Back Matter....Pages 189-261
β¦ Subjects
Signal, Image and Speech Processing;Phonology;Artificial Intelligence (incl. Robotics);Computational Linguistics
π SIMILAR VOLUMES
<p>This book is a revised version of my doctoral thesis which was submitted in April 1993. The main extension is a chapter on evaluation of the system deΒ scribed in Chapter 8 as this is clearly an issue which was not treated in the original version. This required the collection of data, the develop
Chapters in the first part of the book cover all the essential speech processing techniques for building robust, automatic speech recognition systems: the representation for speech signals and the methods for speech-features extraction, acoustic and language modeling, efficient algorithms for search
<P>This book reports recent research on mechanisms of normal formulation and control in speaking and in language disorders such as stuttering, aphasia and verbal dyspraxia. The theoretical claim is that such disorders result both from deficits in a component of the language production system and int