In the wide domain of automatic speech recognition, extracting the relevant information carried by the speech signal is far from easy. Diversity, redundancy, and variability, the main characteristics of the speech signal, make this task particularly difficult. The work reported here presents a multi
Segmental Approaches for Automatic Speaker Verification
✍ Scribed by Dijana Petrovska-Delacrétaz; Jan Černocký; Jean Hennebert; Gérard Chollet
- Publisher
- Elsevier Science
- Year
- 2000
- Tongue
- English
- Weight
- 196 KB
- Volume
- 10
- Category
- Article
- ISSN
- 1051-2004
No coin nor oath required. For personal study only.
✦ Synopsis
Speech is composed of different sounds (acoustic segments). Speakers differ in their pronunciation of these sounds. The segmental approaches described in this paper are meant to exploit these differences for speaker verification purposes. For such approaches, the speech is divided into different classes, and the speaker modeling is done for each class. The speech segmentation applied is based on automatic language independent speech processing tools that provide a segmentation of the speech requiring neither phonetic nor orthographic transcriptions of the speech data. Two different speaker modeling approaches, based on multilayer perceptrons (MLPs) and on Gaussian mixture models (GMMs), are studied. The MLPbased segmental systems have performance comparable to that of the global MLP-based systems, and in the mismatched train-test conditions slightly better results are obtained with the segmental MLP system. The segmental GMM systems gave poorer results than the equivalent global GMM systems.
📜 SIMILAR VOLUMES
This paper discusses several aspects of score normalization for textindependent speaker verification. The theory of score normalization is explained using Bayes' theorem and detection error trade-off plots. Based on the theory, the world, cohort, and zero normalization techniques are explained. A no
The present work demonstrates a need for enhancing text-independent, telephone based, automatic speaker recognition systems with a gender gate. A range of gender gates and speech parameter types are proposed for this problem. These gates and parameters are also investigated in the context of speech