๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A parallel processing algorithm for speech recognition using markov random fields

โœ Scribed by Hideki Noda; Mehdi N. Shirazi; Bing Zhang


Publisher
John Wiley and Sons
Year
1994
Tongue
English
Weight
770 KB
Volume
25
Category
Article
ISSN
0882-1666

No coin nor oath required. For personal study only.

โœฆ Synopsis


Abstract

This paper proposes a new method in which the speech recognition processing is executed framewise on the time axis by local parallel operations using the Markov random fields (MRF). There have not been many studies presented concerning the parallel execution of the speech processing. On the hand, it is anticipated that parallel processing algorithms for the recognition process proposed in this paper will be very useful in highโ€performance continuous speech recognition systems, for example, where a strong computational power is required.

The essence of parallel execution is to estimate the optimal state sequence by a parallel process based on the iterated conditional modes (ICM) for the given model parameters and the sequence of observed values. The local probability for the state sequence is indispensable for this purpose. It is shown that the local probability can be derived by representing the generation probability of the state sequence in a HMM (hidden Markov model) as a Gibbs distribution and calculating its conditional distribution.

The foregoing property implies that the oneโ€sided Markov chain used in HMM can be converted into a twoโ€sided Markov chain in the oneโ€dimensional MRF. Through the speakerโ€independent digit speech recognition experiment, it is shown that the proposed parallel processing algorithm has recognition performance comparable to that of the Viterbi algorithm.


๐Ÿ“œ SIMILAR VOLUMES


A fast algorithm for parallel model comb
โœ Tai-Hwei Hwang; Hsiao-Chuan Wang ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 223 KB

Based on the log-normal assumption, parallel model combination (PMC) provides an effective method to adapt the cepstral means and variances of speech models for noisy speech recognition. In addition, the log-add method has been derived to adapt the mean by ignoring the cepstral variance during the p

A study on the use of bi-directional con
โœ Qiang Huo; Chorkin Chan ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 122 KB

In this paper, by using the formulation of the missing-data problem, a general framework for statistical acoustic modelling of speech is presented. With the motivation of utilizing bi-directional contextual dependence in acoustic modelling, a bi-directional hidden Markov modelling approach for speec

Using a fast RLS adaptive algorithm for
โœ C. Papaodysseus; G. Roussopoulos; A. Panagopoulos ๐Ÿ“‚ Article ๐Ÿ“… 2005 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 83 KB

In this paper, a new method is presented that offers efficient computation of Linear Prediction Coefficients (LPC) via a new Recursive Least Squares (RLS) adaptive filtering algorithm. This method can be successfully used in speech coding and processing. The introduced algorithm is numerically robus