Speech recognition using various sequential networks
β Scribed by Seiichi Nakagawa; Isao Hayakawa
- Publisher
- John Wiley and Sons
- Year
- 1992
- Tongue
- English
- Weight
- 710 KB
- Volume
- 23
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
In this paper, we describe two neural networkβbased approaches to speech recognition. One is a purely neural network approach that approximates a multipleorder hidden Markov model (HMM). It is an extension of a sequential network and is called the neural Markov model (NMM). The other approach is a combination of sequential networks and the DP matching (DTW) method.
NMM is based on the idea that a sequential network is similar to a hidden Markov model (HMM) with one state; we attempt to approximate an HMM with several states by concatenating sequential networks.
For this purpose, we tried two approaches: (1) using only neural networks, and (2) applying the DP matching method to the output of the networks. The speakerβindependent recognition rate using the first approach was 65.4 percent for nine syllables that included voiced plosive consonants. The second method produced a 88.1 percent recognition rate for artificial word data consisting of two concatenated syllables. These results outperformed those obtained with the HMM.
π SIMILAR VOLUMES
We have combined an echo state network (ESN) with a competitive state machine framework to create a classification engine called the predictive ESN classifier. We derive the expressions for training the predictive ESN classifier and show that the model was significantly more noise robust compared to
Complex spatial filtering has been applied to the problem of speech recognition. Our initial experiment concerning the recognition of the numbers one to ten spoken by one person on four consecutive occasions, is described and modifications to the sys tem suggested. Possible applications of a success