In this paper we report our recent research whose goal is to improve the performance of a novel speech recognizer based on an underlying statistical hidden dynamic model of phonetic reduction in the production of conversational speech. We have developed a path-stack search algorithm which efficientl
A statistical model for robust integration of narrowband cues in speech
β Scribed by Lawrence K. Saul; Mazin G. Rahim; Jont B. Allen
- Publisher
- Elsevier Science
- Year
- 2001
- Tongue
- English
- Weight
- 363 KB
- Volume
- 15
- Category
- Article
- ISSN
- 0885-2308
No coin nor oath required. For personal study only.
β¦ Synopsis
We investigate a statistical model for integrating narrowband cues in speech. The model is inspired by two ideas in human speech perception: (i) Fletcher's hypothesis (1953) that independent detectors, working in narrow frequency bands, account for the robustness of auditory strategies, and (ii) Miller and Nicely's analysis (1955) that perceptual confusions in noisy bandlimited speech are correlated with phonetic features. We apply the model to detecting the phonetic feature [+/-sonorant] that distinguishes vowels, approximants, and nasals (sonorants) from stops, fricatives, and affricates (obstruents). The model is represented by a multilayer probabilistic network whose binary hidden variables indicate sonorant cues from different parts of the frequency spectrum. We derive the Expectation-Maximization algorithm for estimating the model's parameters and evaluate its performance on clean and corrupted speech.
π SIMILAR VOLUMES
A new statistical nonlinear model of GaAs FET MMICs which allows the representation of distance-dependent technological parameter variations by means of equivalent circuit parameters, and an automatic extraction procedure, are presented. The capability to reproduce statistical distribution has been
A chance encounter between members of a random repertoire and a molecular target is characteristic of different biological systems, including the immune and olfactory pathways as well as combinatorial libraries. In such systems, the affinity between the target and members of the repertoire is distri
## Abstract Most devices based on shape memory alloys experience large rotations and moderate or finite strains. This motivates the development of finiteβstrain constitutive models together with the appropriate computational counterparts. To this end, in the present paper a threeβdimensional finite
Many biological processes, from cellular metabolism to population dynamics, are characterized by particular allometric scaling (power-law) relationships between size and rate. Although such allometric relationships may be under genetic determination, their precise genetic mechanisms have not been cl