๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

Approaches to Speaker Detection and Tracking in Conversational Speech

โœ Scribed by Robert B. Dunn; Douglas A. Reynolds; Thomas F. Quatieri


Publisher
Elsevier Science
Year
2000
Tongue
English
Weight
257 KB
Volume
10
Category
Article
ISSN
1051-2004

No coin nor oath required. For personal study only.

โœฆ Synopsis


Two approaches to detecting and tracking speakers in multispeaker audio are described. Both approaches use an adapted Gaussian mixture model, universal background model (GMM-UBM) speaker detection system as the core speaker recognition engine. In one approach, the individual log-likelihood ratio scores, which are produced on a frame-by-frame basis by the GMM-UBM system, are used to first partition the speech file into speaker homogenous regions and then to create scores for these regions. We refer to this approach as internal segmentation. Another approach uses an external segmentation algorithm, based on blind clustering, to partition the speech file into speaker homogenous regions. The adapted GMM-UBM system then scores each of these regions as in the single-speaker recognition case. We show that the external segmentation system outperforms the internal segmentation system for both detection and tracking. In addition, we show how different components of the detection and tracking algorithms contribute to the overall system performance.


๐Ÿ“œ SIMILAR VOLUMES


Local Normalization and Delayed Decision
โœ Johan Koolwaaij; Lou Boves ๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 268 KB

This paper describes A2RT's speaker detection and tracking system and its performance on the 1999 NIST speaker recognition evaluation data. The system does not consist of concatenated modules such as, for instance, silence-speech detection, handset and gender detection, and finally speaker detection

The ELISA Systems for the NIST'99 Evalua
๐Ÿ“‚ Article ๐Ÿ“… 2000 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 145 KB

This article presents the text-independent speaker detection and tracking systems developed by the members of the ELISA Consortium for the NIST'99 speaker recognition evaluation campaign. ELISA is a consortium grouping researchers of several laboratories sharing software modules, resources and exper

Approaches to Adulteration Detection in
โœ Briandet, R; Kemsley, E K; Wilson, R H ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› John Wiley and Sons ๐ŸŒ English โš– 623 KB

Fourier transform infrared (FTIR) spectroscopy is examined as a rapid alternative to wet chemistry methods for the detection of adulteration of freezedried instant coffees. Spectra have been collected of pure coffees, and of samples adulterated with glucose, starch or chicory in the range 20-100 g k