𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Gender Gates for Telephone-Based Automatic Speaker Recognition

✍ Scribed by Pierre Castellano; Stefan Slomka; Peter Barger


Publisher
Elsevier Science
Year
1997
Tongue
English
Weight
331 KB
Volume
7
Category
Article
ISSN
1051-2004

No coin nor oath required. For personal study only.

✦ Synopsis


The present work demonstrates a need for enhancing text-independent, telephone based, automatic speaker recognition systems with a gender gate. A range of gender gates and speech parameter types are proposed for this problem. These gates and parameters are also investigated in the context of speech degraded by coding and reverberation. It is found that the performance of the most accurate gender gates and speech parameters is similar for uncoded, coded, and reverberated speech. However, the most accurate gender gates and speech parameter types differ slightly across the three scenarios. The most robust all-round gender gates consist of two Mahalanobis distance classifiers with fused outputs or pitch fused to the output of one such classifier. The best all-round speech parameters were reflection and Mel-based cepstrum coefficients.


πŸ“œ SIMILAR VOLUMES


AMIRAL: A Block-Segmental Multirecognize
✍ Corinne Fredouille; Jean-FranΓ§ois Bonastre; Teva Merlin πŸ“‚ Article πŸ“… 2000 πŸ› Elsevier Science 🌐 English βš– 323 KB

In the wide domain of automatic speech recognition, extracting the relevant information carried by the speech signal is far from easy. Diversity, redundancy, and variability, the main characteristics of the speech signal, make this task particularly difficult. The work reported here presents a multi

N-Best-based unsupervised speaker adapta
✍ Tomoko Matsui; Sadaoki Furui πŸ“‚ Article πŸ“… 1998 πŸ› Elsevier Science 🌐 English βš– 251 KB

This paper proposes an instantaneous speaker adaptation method that uses N-best decoding for continuous mixture-density hidden-Markovmodel-based speech-recognition systems. This method is effective even for speakers whose decoding using speaker-independent (SI) models are error-prone and for whom sp

Automatic selection of phonetically dist
✍ Jia-lin Shen; Hsin-min Wang; Ren-yuan Lyu; Lin-shan Lee πŸ“‚ Article πŸ“… 1999 πŸ› Elsevier Science 🌐 English βš– 163 KB

This paper presents an approach of automatic selection of phonetically distributed sentence sets for speaker adaptation, and applies the concept to the task of Mandarin speech recognition with very large vocabulary. This is a different approach to the adaptation data selection problem. A computer al