๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Design of speech recognition system: problems and solutions

โœ Scribed by Takao Watanabe


Publisher
John Wiley and Sons
Year
1999
Tongue
English
Weight
551 KB
Volume
30
Category
Article
ISSN
0882-1666

No coin nor oath required. For personal study only.

โœฆ Synopsis


This paper discusses a number of problems that are to be solved for the purpose of practicable implementation of speech recognition systems, and a method of speech recognition is proposed using the authors concept of demisyllable units. In the proposed implementation of demisyllable-based speaker-independent continuous speech recognition, such improved robustness features as speaker adaptation using spectral interpolation mapping, environment adaptation using the REALISE method, and a method of unknown input rejection based on likelihood correction using syllable recognition were provided. Further, acceleration of likelihood calculation using a tree-structured probability distribution was proposed, as well as cost reduction by acceleration of continuous speech recognition using the bundle search. Software development was also described. The proposed method is expected to contribute to the practicability of speech recognition, although many problems, particularly those related to system robustness, still remain to be solved.


๐Ÿ“œ SIMILAR VOLUMES


Chip design of MFCC extraction for speec
โœ Jia-Ching Wang; Jhing-Fa Wang; Yu-Sheng Weng ๐Ÿ“‚ Article ๐Ÿ“… 2002 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 342 KB

The mel frequency cepstral coefficient (MFCC) is one of the most important features required among various kinds of speech applications. In this paper, the first chip for speech features extraction based on MFCC algorithm is proposed. The chip is implemented as an intellectual property, which is sui

Evaluation of word confidence for speech
โœ Manhung Siu; Herbert Gish ๐Ÿ“‚ Article ๐Ÿ“… 1999 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 195 KB

Confidence measures enable us to assess the output of a speech recognition system. The confidence measure provides us with an estimate of the probability that a word in the recognizer output is either correct or incorrect. In this paper we discuss ways in which to quantify the performance of confide

Comparison of continuous speech recognit
โœ Atsuhiko Kai; Seiichi Nakagawa ๐Ÿ“‚ Article ๐Ÿ“… 1998 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 594 KB

This paper describes speech recognition systems for dealing with spontaneous speech, in which an unknownword processing method based on subword sequence decoding is employed. We propose an efficient algorithm for unknown-word processing that employs an independent process of subword sequence decodin