✦ LIBER ✦

Speech recognition of mandarin monosyllables

✍ Scribed by Tze Fen Li

Publisher: Elsevier Science
Year: 2003
Tongue: English
Weight: 199 KB
Volume: 36
Category: Article
ISSN: 0031-3203
DOI: 10.1016/s0031-3203(03)00135-3

No coin nor oath required. For personal study only.

✦ Synopsis

The nonlinear dynamic characteristics of expansion and contraction and the sequential time-varying features of the syllable pronunciations greatly complicate the tasks of automatic speech recognition. Each syllable is represented by a sequence of vectors of linear predict coding cepstra (LPCC). Even if the same speaker utters the same syllable, the duration of stable parts of the sequence of LPCC vectors changes every time. Therefore, the duration of stable parts is contracted such that the compressed speech waveform has the same length. We propose ÿve di erent simple techniques to contract the stable parts of the sequence of LPCC vectors. A simpliÿed Bayes decision rule with a weighted variance is used to classify 408 speaker-dependent mandarin syllables. For the 408 speaker-dependent mandarin syllables, the recognition rate is 94.36% as compared with 79.78% obtained by using the hidden Markov models (HMM). A recognition rate 98.16% is achieved within top 3 candidates. The features proposed in this paper to represent the syllables are simple and easy to be extracted. The computation for feature extraction and classiÿcation is much faster than using the techniques of the HMM or any other known techniques.

📜 SIMILAR VOLUMES

Tone recognition of polysyllabic words i

Tone recognition of polysyllabic words in Mandarin speech

✍ Lih-Cherng Liu; Wu-Ji Yang; Hsiao-Chuan Wang; Yueh-Chin Chang 📂 Article 📅 1989 🏛 Elsevier Science 🌐 English ⚖ 728 KB

Modeling partial pronunciation variation

Modeling partial pronunciation variations for spontaneous Mandarin speech recognition

✍ Yi Liu; Pascale Fung 📂 Article 📅 2003 🏛 Elsevier Science 🌐 English ⚖ 797 KB

The high error rate in spontaneous speech recognition is due in part to the poor modeling of pronunciation variations. An analysis of acoustic data reveals that pronunciation variations include both complete changes and partial changes. Complete changes are the replacement of a canonical phoneme by

A Mandarin e-learning system based on sp

A Mandarin e-learning system based on speech recognition and evaluation

✍ Yue Ming; Zongshan Bai 📂 Article 📅 2009 🏛 John Wiley and Sons 🌐 English ⚖ 626 KB

The use of tree-trellis search for large

The use of tree-trellis search for large-vocabulary Mandarin polysyllabic word speech recognition

✍ Eng-Fong Huang; Frank K. Soong; Hsiao-Chuan Wang 📂 Article 📅 1994 🏛 Elsevier Science 🌐 English ⚖ 483 KB

In this paper, we propose the use of a tree-trellis search scheme for the task of large vocabulary Mandarin polysyllabic word recognition. Usually, the task of large vocabulary word recognition is computationally intractable by whole-word based approach. We convert this task into a tree network sear

Interpolation of n-gram and mutual-infor

Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition

✍ Z. GuoDong; L. KimTeng 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 191 KB

While n-gram modeling is simple and dominant in speech recognition, it can only capture the short-distance context dependency within an n-word window where currently the largest practical n for natural language is three. However, many of the context dependencies in natural language occur beyond a th

Automatic selection of phonetically dist

Automatic selection of phonetically distributed sentence sets for speaker adaptation with application to large vocabulary Mandarin speech recognition

✍ Jia-lin Shen; Hsin-min Wang; Ren-yuan Lyu; Lin-shan Lee 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 163 KB

This paper presents an approach of automatic selection of phonetically distributed sentence sets for speaker adaptation, and applies the concept to the task of Mandarin speech recognition with very large vocabulary. This is a different approach to the adaptation data selection problem. A computer al