## Abstract A human expert is able to determine phoneme boundaries through spectrogram reading with high accuracy using one's knowledge and strategy. The authors developed a phoneme segmentation expert system which simulates a human expert spectrogram reading process using knowledge and strategy. T
Phoneme recognition expert system using spectrogram reading knowledge and neural networks
โ Scribed by Yasuhiro Komori; Takeshi Kawabata; Kaichiro Hatazaki; Kiyohiro Shikano
- Publisher
- John Wiley and Sons
- Year
- 1990
- Tongue
- English
- Weight
- 900 KB
- Volume
- 21
- Category
- Article
- ISSN
- 0882-1666
No coin nor oath required. For personal study only.
โฆ Synopsis
Abstract
A human expert spectrogram reader is able to recognize phonemes in highโaccuracy performing phoneme segmentation and phoneme identification simultaneously using one's spectrogram reading knowledge. Spectrogram reading knowledge consists mainly of two parts: a strategic part for phoneme segmentation and identification and a patternโmatching part. There are several knowledgeโbased approaches in which all knowledge is implemented as rules. However, it is especially difficult to describe the pattern matching part of the knowledge as rules and to extract acoustic features automatically for phoneme identification. Here, we construct a phoneme recognition expert system which consists of two parts: (1) ruleโbased phoneme segmentation, and (2) neural networkโbased phoneme identification for knowledge such as pattern matching. This paper presents the architecture of the phoneme recognition expert system with its experimental result tested on Japanese consonants. The experimental result shows that 90.8 percent of the phonemes were segmented correctly and 92.4 percent of the phonemes were identified correctly within the correct segments, which means 83.9 percent of the phonemes were correctly recognized both in segmentation and identification.
๐ SIMILAR VOLUMES
09 Combustion (burners, combustion systems) NO, reduction, but less effective for NzO reduction; the optimum value of the ratio of primary/ total air was -0.8 for minimizing NO, emissions. Limestone injection at calcium-sulfur molar ratios >2:1 made it possible to remove >90% of SOz formed, although
Recent years have witnessed the increasing application of artificial intelligence techniques, specifically, knowledge-based systems, artificial neural networks, and pattern recognition, to biotechnological processes. Although progress has been made in simple control applications, more work is needed
## Abstract This paper presents a speech synthesis system using the neural network. While there have been a large number of speech synthesis systems developed, most of them are ruleโbased systems for a restricted range of languages, describing the expert knowledges for the language in the form of r