Chip design of MFCC extraction for speech recognition
โ Scribed by Jia-Ching Wang; Jhing-Fa Wang; Yu-Sheng Weng
- Publisher
- Elsevier Science
- Year
- 2002
- Tongue
- English
- Weight
- 342 KB
- Volume
- 32
- Category
- Article
- ISSN
- 0167-9260
No coin nor oath required. For personal study only.
โฆ Synopsis
The mel frequency cepstral coefficient (MFCC) is one of the most important features required among various kinds of speech applications. In this paper, the first chip for speech features extraction based on MFCC algorithm is proposed. The chip is implemented as an intellectual property, which is suitable to be adopted in a speech recognition system on a chip. The computational complexity and memory requirement of MFCC algorithm are analyzed in detail and improved greatly. The hybrid table look-up scheme is presented to deal with the elementary function value in the MFCC algorithm. Fixed-point arithmetic is adopted to reduce the cost under the accuracy studies of finite word length effect. Finally, the area-efficient design is implemented successfully into the single Xilinx XC4062XL FPGA.
๐ SIMILAR VOLUMES
Seven classes of design guidelines are described for interfaces which use speech recognition. The guidelines concern: (i) allocation of function within complex systems; (ii) parallel processing of speech with other modalities; (iii) design of command vocabulary; (iv) choice of command syntax; (v) us
This paper discusses a number of problems that are to be solved for the purpose of practicable implementation of speech recognition systems, and a method of speech recognition is proposed using the authors concept of demisyllable units. In the proposed implementation of demisyllable-based speaker-in
Confidence measures enable us to assess the output of a speech recognition system. The confidence measure provides us with an estimate of the probability that a word in the recognizer output is either correct or incorrect. In this paper we discuss ways in which to quantify the performance of confide