We investigate a statistical model for integrating narrowband cues in speech. The model is inspired by two ideas in human speech perception: (i) Fletcher's hypothesis (1953) that independent detectors, working in narrow frequency bands, account for the robustness of auditory strategies, and (ii) Mil
Union: a model for partial temporal corruption of speech
โ Scribed by Ji Ming; F. Jack Smith
- Publisher
- Elsevier Science
- Year
- 2001
- Tongue
- English
- Weight
- 139 KB
- Volume
- 15
- Category
- Article
- ISSN
- 0885-2308
No coin nor oath required. For personal study only.
โฆ Synopsis
This paper proposes a new statistical approach, namely the probabilistic union model, for speech recognition subjected to unknown burst noise during the utterance. The model combines the local temporal information based on the union of random events, to reduce the dependence of the model on information about the noise. This paper describes the theory of the model, and an implementation based on hidden Markov modeling techniques. For the evaluation, we used the TIDIGITS database for both isolated and connected digit recognition. The utterances were corrupted by various types of abrupt noise with unknown, time-varying characteristics. The experimental results indicate that the new model offers robustness to partial duration corruption, requiring no prior knowledge about the noise. A combination of the proposed model and conventional noise-reduction techniques is discussed, which has been shown to be potentially capable of dealing with a mixture of stationary noise and random, abrupt noise.
๐ SIMILAR VOLUMES
## Abstract This paper presents a model for the representation and retrieval of structured documents considering their temporal properties. The purpose of this model is to serve as a platform for the development of digital library applications. Thus, it consists of both a new data model and a query
We estimate the union premium for young men over a period of declining unionization (1980ยฑ87) through a procedure which identiยฎes the alternative sources of the endogeneity of union status. While we estimate the average increase in wages resulting from union employment to be in excess of 20% we ยฎnd
We consider an adaptation of the well-known logistic equation in mathematical ecology in which the population is assumed to diffuse and for which the average growth rate is a function of some specified delayed argument. Using a combination of analytical and numerical techniques, we investigate the e
In this paper we report our recent research whose goal is to improve the performance of a novel speech recognizer based on an underlying statistical hidden dynamic model of phonetic reduction in the production of conversational speech. We have developed a path-stack search algorithm which efficientl