𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Speech recognition using various sequential networks

✍ Scribed by Seiichi Nakagawa; Isao Hayakawa


Publisher
John Wiley and Sons
Year
1992
Tongue
English
Weight
710 KB
Volume
23
Category
Article
ISSN
0882-1666

No coin nor oath required. For personal study only.

✦ Synopsis


Abstract

In this paper, we describe two neural network‐based approaches to speech recognition. One is a purely neural network approach that approximates a multipleorder hidden Markov model (HMM). It is an extension of a sequential network and is called the neural Markov model (NMM). The other approach is a combination of sequential networks and the DP matching (DTW) method.

NMM is based on the idea that a sequential network is similar to a hidden Markov model (HMM) with one state; we attempt to approximate an HMM with several states by concatenating sequential networks.

For this purpose, we tried two approaches: (1) using only neural networks, and (2) applying the DP matching method to the output of the networks. The speaker‐independent recognition rate using the first approach was 65.4 percent for nine syllables that included voiced plosive consonants. The second method produced a 88.1 percent recognition rate for artificial word data consisting of two concatenated syllables. These results outperformed those obtained with the HMM.


πŸ“œ SIMILAR VOLUMES


Speech recognition using fractals
✍ Erik L.J Bohez; T.R Senevirathne πŸ“‚ Article πŸ“… 2001 πŸ› Elsevier Science 🌐 English βš– 591 KB
Automatic speech recognition using a pre
✍ Mark D. Skowronski; John G. Harris πŸ“‚ Article πŸ“… 2007 πŸ› Elsevier Science 🌐 English βš– 842 KB

We have combined an echo state network (ESN) with a competitive state machine framework to create a classification engine called the predictive ESN classifier. We derive the expressions for training the predictive ESN classifier and show that the model was significantly more noise robust compared to

Speech recognition using optical filteri
✍ D. Leverington πŸ“‚ Article πŸ“… 1969 πŸ› Elsevier Science βš– 690 KB

Complex spatial filtering has been applied to the problem of speech recognition. Our initial experiment concerning the recognition of the numbers one to ten spoken by one person on four consecutive occasions, is described and modifications to the sys tem suggested. Possible applications of a success