Performance comparison of neural network architectures for speaker-independent phoneme recognition
β Scribed by Satoru Nakamura; Hidefumi Sawai
- Publisher
- John Wiley and Sons
- Year
- 1992
- Tongue
- English
- Weight
- 790 KB
- Volume
- 23
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
β¦ Synopsis
Abstract
We applied several types of timeβdelay neural networks (TDNNs), generally used for speakerβdependent and multispeaker speech recognition, to speakerβindependent speech recognition and compared their performance. Six or 12 speakers were used to train each network, and recognition experiments for voiced stops /b, d, g/ were performed in open speaker mode. The best recognition rates were 91.3 percent and 93.6 percent, using six and 12 training speakers, respectively. We found that constructing modular networks, such as modular TDNN with each network corresponding to a speaker, is effective in terms of decreasing the number of training iterations needed, showing slightly better performance than with a single TDNN with a comparable network capacity. This is because the modular networks make use of limited capacity effectively. On the other hand, a single TDNN with an increased number of hidden units showed a recognition rate comparable to that of the modular TDNN.
π SIMILAR VOLUMES
We propose a new model for speaker-independent vowel recognition which uses the flexibility of the dynamic linking that results from the synchronization of oscillating neural units. The system consists of an input layer and three neural layers, which are referred to as the A-, B-and C-centers. The i
Neural networks were trained to classify single 20 ms frames of vowels using either perceptually-based spectral representations or LPC spectra as input. Classification performance was compared with performance of several distance measures using nearest-neighbor and mean-distance decision criteria. T
Strategies that have been developed to extend NN prediction methods to accommodate right-censored data include methods due to Faraggi-Simon, Liestol-Andersen-Andersen, and a modiΓΏcation of the Buckley-James method. In a Monte Carlo simulation study, we evaluated the performance of all three NN metho
Artificial Neural Networks (ANN) have been used in the past to predict the performance of printed circuit heat exchangers (PCHE) with satisfactory accuracy. Typically published literature has focused on optimizing ANN using a training dataset to train the network and a testing dataset to evaluate it