๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Adaptive Fusion of Speech and Lip Information for Robust Speaker Identification

โœ Scribed by T Wark; S Sridharan


Publisher
Elsevier Science
Year
2001
Tongue
English
Weight
269 KB
Volume
11
Category
Article
ISSN
1051-2004

No coin nor oath required. For personal study only.

โœฆ Synopsis


This paper compares techniques for asynchronous fusion of speech and lip information for robust speaker identification. In any fusion system, the ultimate challenge is to determine the optimal way to combine all information sources under varying conditions. We propose a new method for estimating confidence levels to allow intelligent fusion of the audio and visual data. We describe a secondary classification system, where secondary classifiers are used to give approximations for the estimation errors of outputs likelihoods from primary classifiers. The error estimates are combined with a dispersion measure technique allowing an adaptive fusion strategy based on the level of data degradation at the time of testing. We compare the performance of this fusion system with two other approaches to linear fusion and show that the use of secondary classifiers is an effective technique for improving classification performance. Identification experiments are performed on the M2VTS multimodal database [26], with encouraging results.


๐Ÿ“œ SIMILAR VOLUMES