๐”– Bobbio Scriptorium
โœฆ   LIBER   โœฆ

A phone-based approach to non-linguistic speech feature identification

โœ Scribed by Lori F Lamel; Jean-Luc Gauvain


Publisher
Elsevier Science
Year
1995
Tongue
English
Weight
99 KB
Volume
9
Category
Article
ISSN
0885-2308

No coin nor oath required. For personal study only.

โœฆ Synopsis


In this paper we present a general approach to identifying nonlinguistic speech features from the recorded signal using phone-based acoustic likelihoods. The basic idea is to process the unknown speech signal by feature-specific phone model sets in parallel, and to hypothesize the feature value associated with the model set having the highest likelihood. This technique is shown to be effective for textindependent gender, speaker and language identification. Textindependent speaker identification accuracies of 98โ€ข8% on TIMIT (168 speakers) and 99โ€ข2% on BREF (65 speakers), were obtained with one utterance per speaker, and 100% with two utterances for both corpora. Experiments in which speaker-specific models were estimated without using the phonetic transcriptions for the TIMIT speakers had the same identification accuracies as those obtained with the use of the transcriptions. French/English language identification is better than 99% with 2 s of read, laboratory speech. For spontaneous telephone speech from the OGI corpus, the language can be identified as French or English with 82% accuracy with 10 s of speech. The ten language identification rate using the OGI corpus was 59โ€ข7% with 10 s of signal.


๐Ÿ“œ SIMILAR VOLUMES


Development of an approach to automatic
โœ Yonghong Yan; Etienne Barnard; Ronald A. Cole ๐Ÿ“‚ Article ๐Ÿ“… 1996 ๐Ÿ› Elsevier Science ๐ŸŒ English โš– 219 KB

An Automatic Language Identification (LID) approach is presented. The baseline LID system consists of three parts: (1) hidden Markov model (HMM) based context-independent phone recognizers, (2) language identification score generators and (3) a linear language classifier. The system exploits languag