๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

State clustering in hidden Markov model-based continuous speech recognition

โœ Scribed by S.J. Young; P.C. Woodland


Publisher
Elsevier Science
Year
1994
Tongue
English
Weight
535 KB
Volume
8
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

โœฆ Synopsis


A key problem in the use of context-dependent hidden Markov models is the need to balance the desired model complexity with the amount of available training data. This paper describes a method which uses a simple agglomerative algorithm to cluster and tie acoustically similar states. The main properties of the algorithm are explored using phone recognition on the TIMIT database where it is shown that there is an optimum between the clustering extrema of an untied context-dependent system and a fully tied monophone system. At this optimum, phone recognition performance was (76.7 %) correct and (72 \cdot 3 %) accuracy. The use of state-tying in the HTK continuous speech recognition system is then described and results are presented using the Resource Management database. The average error rate across the Feb '89, Oct ' 89 and Feb '91 test sets was less than 4.3% and this was achieved without cross-word triphones. Genderdependent models were also compared to gender-independent models but found to give little improvement.


๐Ÿ“œ SIMILAR VOLUMES


Keyword detection in conversational spee
โœ R.C. Rose ๐Ÿ“‚ Article ๐Ÿ“… 1995 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 213 KB

This paper describes a set of modeling techniques for detecting a small vocabulary of keywords in running conversational speech. The techniques are applied in the context of a hidden Markov model (HMM) based continuous speech recognition (CSR) approach to keyword spotting. The word spotting task is

Hidden Markov model-based speech recogni
โœ R Singh; K Davis; P V.S Rao ๐Ÿ“‚ Article ๐Ÿ“… 1997 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 633 KB

A discrete wavelet transform algorithm segregates the operand data set sequentially. It generates computational intermediates which represent it at graded resolutions and leads to a reciprocal domain within which information is multiply resolved in terms of the timefrequency localization of the comp

Context-dependent connectionist probabil
โœ Horacio Franco; Michael Cohen; Nelson Morgan; David Rumelhart; Victor Abrash ๐Ÿ“‚ Article ๐Ÿ“… 1994 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 457 KB

In this paper we present a training method and a network architecture for estimating context-dependent observation probabilities in the framework of a hybrid hidden Markov model (HMM)/multi layer perceptron (MLP) speaker-independent continuous speech recognition system. The context-dependent modelin

A study on the use of bi-directional con
โœ Qiang Huo; Chorkin Chan ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 122 KB

In this paper, by using the formulation of the missing-data problem, a general framework for statistical acoustic modelling of speech is presented. With the motivation of utilizing bi-directional contextual dependence in acoustic modelling, a bi-directional hidden Markov modelling approach for speec