𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Segmental Approaches for Automatic Speaker Verification

✍ Scribed by Dijana Petrovska-Delacrétaz; Jan Černocký; Jean Hennebert; Gérard Chollet


Publisher
Elsevier Science
Year
2000
Tongue
English
Weight
196 KB
Volume
10
Category
Article
ISSN
1051-2004

No coin nor oath required. For personal study only.

✦ Synopsis


Speech is composed of different sounds (acoustic segments). Speakers differ in their pronunciation of these sounds. The segmental approaches described in this paper are meant to exploit these differences for speaker verification purposes. For such approaches, the speech is divided into different classes, and the speaker modeling is done for each class. The speech segmentation applied is based on automatic language independent speech processing tools that provide a segmentation of the speech requiring neither phonetic nor orthographic transcriptions of the speech data. Two different speaker modeling approaches, based on multilayer perceptrons (MLPs) and on Gaussian mixture models (GMMs), are studied. The MLPbased segmental systems have performance comparable to that of the global MLP-based systems, and in the mismatched train-test conditions slightly better results are obtained with the segmental MLP system. The segmental GMM systems gave poorer results than the equivalent global GMM systems.


📜 SIMILAR VOLUMES


AMIRAL: A Block-Segmental Multirecognize
✍ Corinne Fredouille; Jean-François Bonastre; Teva Merlin 📂 Article 📅 2000 🏛 Elsevier Science 🌐 English ⚖ 323 KB

In the wide domain of automatic speech recognition, extracting the relevant information carried by the speech signal is far from easy. Diversity, redundancy, and variability, the main characteristics of the speech signal, make this task particularly difficult. The work reported here presents a multi

Predictive models for speaker verificati
✍ E. Ambikairajah; M. Keane; A. Kelly; L. Kilmartin; G. Tattersall 📂 Article 📅 1993 🏛 Elsevier Science 🌐 English ⚖ 632 KB
Score Normalization for Text-Independent
✍ Roland Auckenthaler; Michael Carey; Harvey Lloyd-Thomas 📂 Article 📅 2000 🏛 Elsevier Science 🌐 English ⚖ 295 KB

This paper discusses several aspects of score normalization for textindependent speaker verification. The theory of score normalization is explained using Bayes' theorem and detection error trade-off plots. Based on the theory, the world, cohort, and zero normalization techniques are explained. A no

Gender Gates for Telephone-Based Automat
✍ Pierre Castellano; Stefan Slomka; Peter Barger 📂 Article 📅 1997 🏛 Elsevier Science 🌐 English ⚖ 331 KB

The present work demonstrates a need for enhancing text-independent, telephone based, automatic speaker recognition systems with a gender gate. A range of gender gates and speech parameter types are proposed for this problem. These gates and parameters are also investigated in the context of speech