๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Hidden Markov model-based speech recognition with intermediate wavelet transform domains

โœ Scribed by R Singh; K Davis; P V.S Rao


Publisher
Elsevier Science
Year
1997
Tongue
English
Weight
633 KB
Volume
11
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

โœฆ Synopsis


A discrete wavelet transform algorithm segregates the operand data set sequentially. It generates computational intermediates which represent it at graded resolutions and leads to a reciprocal domain within which information is multiply resolved in terms of the timefrequency localization of the component wavelet basis vectors. Based on this we introduce the concept of wavelet subtransform domains and show that these can be used to selectively enhance acoustic events in a speech signal. Enhancement of phoneme classes improves segmentation and recognition performance for reasons that we have pointed out in the paper. As an experimental verification, we use subtransform domains to design a preprocessor for a Hindi database and evaluate the subsequent recognition performances using hidden Markov models and two standard parametrizations.


๐Ÿ“œ SIMILAR VOLUMES


State clustering in hidden Markov model-
โœ S.J. Young; P.C. Woodland ๐Ÿ“‚ Article ๐Ÿ“… 1994 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 535 KB

A key problem in the use of context-dependent hidden Markov models is the need to balance the desired model complexity with the amount of available training data. This paper describes a method which uses a simple agglomerative algorithm to cluster and tie acoustically similar states. The main proper

Keyword detection in conversational spee
โœ R.C. Rose ๐Ÿ“‚ Article ๐Ÿ“… 1995 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 213 KB

This paper describes a set of modeling techniques for detecting a small vocabulary of keywords in running conversational speech. The techniques are applied in the context of a hidden Markov model (HMM) based continuous speech recognition (CSR) approach to keyword spotting. The word spotting task is

Hidden Markov model training with contam
โœ Matassoni, Marco (author);Omologo, Maurizio (author);Giuliani, Diego (author);Sv ๐Ÿ“‚ Article ๐Ÿ“… 2002 ๐Ÿ› Academic Press ๐ŸŒ English โš– 280 KB

A challenging scenario is addressed in which a distant-talking speech recognizer operates in a noisy office environment with model adaptation. The use of a single far microphone as well as that of a microphone array input are investigated. In addition to the benefits from the application of microph