๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A mixed-level switching dynamic system for continuous speech recognition

โœ Scribed by Jeff Ma; Li Deng


Publisher
Elsevier Science
Year
2004
Tongue
English
Weight
246 KB
Volume
18
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

โœฆ Synopsis


A two-level mixture linear dynamic system model, with frame-level switching parameters in the observation equation and with segment-level switching parameters in the target-directed state equation, is developed and evaluated. The main contributions of this work are: (1) the new framework for dealing with mixed-level switching in the dynamic system and (2) the novel use of piecewise linear functions, enabled by the introduction of frame-level switching, to approximate the nonlinear function between the hidden vocaltract-resonance space and the observable acoustic space. The approximation is accomplished by the framedependent switching parameters in the observation equation. In this paper, in a self-contained manner, we highlight the key algorithm differences from the earlier model having only single segment-level switching that is synchronous between the state and observation equations. A series of speech recognition experiments are carried out to evaluate this new model using a subset of Switchboard conversational speech data. The experimental results show that the approximation accuracy is improved with an increased number of switching-parameter values. The speech recognizer built from the new mixed-level switching dynamic system model using an N-best re-scoring evaluation paradigm show moderate word error rate reduction compared with using either single-level switching or no switching parameters.


๐Ÿ“œ SIMILAR VOLUMES


A three level costate prediction method
โœ M. Hassan; R. Hurteau; M.G. Singh; A. Titli ๐Ÿ“‚ Article ๐Ÿ“… 1978 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 315 KB

In this paper a continuous time version of a previous discrete systems optimisation algorithm is developed. The new algorithm uses prediction of costates within a three level structure to provide an efficient organisation of both the storage and the computation. The algorithm which applies to both l