✦ LIBER ✦

Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish

✍ Scribed by Víctor G. Guijarrubia; M. Inés Torres

Book ID: 103879763
Publisher: Elsevier Science
Year: 2010
Tongue: English
Weight: 975 KB
Volume: 31
Category: Article
ISSN: 0167-8655
DOI: 10.1016/j.patrec.2009.11.014

No coin nor oath required. For personal study only.

✦ Synopsis

This paper presents a series of spoken language identification experiments involving Spanish and Basque. Spanish and Basque are both official languages in the Basque Country, a region located in northern Spain. We focused our research on the study of several phonotactic-based methodologies, analysing at the same time the performance of phonotactic models trained from text and speech samples and the use of phone and phone sequences as decoding units. Although we focus mainly on Spanish-Basque identification, the analysis is later extended to English, so that more generic conclusions can be drawn. From the bilingual results, we can conclude that the text-based phonotactic models can perform similarly to the audio-based ones when applied to read speech. Moreover, when using task-specific information it is also possible to achieve a high accuracy. The use of phone sequences as decoding units results, in most of the cases, in a decrease in performance and appears to be useful when constraining the phone decoders to those sequences. Similar conclusions can be drawn from the trilingual experiments.

📜 SIMILAR VOLUMES

Interpolation of n-gram and mutual-infor

Interpolation of n-gram and mutual-information based trigger pair language models for Mandarin speech recognition

✍ Z. GuoDong; L. KimTeng 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 191 KB

While n-gram modeling is simple and dominant in speech recognition, it can only capture the short-distance context dependency within an n-word window where currently the largest practical n for natural language is three. However, many of the context dependencies in natural language occur beyond a th

Hidden Markov model-based approach for g

Hidden Markov model-based approach for generation of Pitman shorthand language symbols for consonants and vowels from spoken english

✍ G. Hemantha Kumar; M. Ravishankar; P. Nagabushan; Basavaraj S. Anami 📂 Article 📅 2006 🏛 Indian Academy of Sciences 🌐 English ⚖ 310 KB

[IEEE 2010 7th International Symposium o

[IEEE 2010 7th International Symposium on Chinese Spoken Language Processing (ISCSLP) - Tainan, Taiwan (2010.11.29-2010.12.3)] 2010 7th International Symposium on Chinese Spoken Language Processing - Combining HMM spectrum models and ANN prosody models for speech synthesis of syllable prominent languages

✍ Gu, Hung-Yan; Lai, Ming-Yen; Tsai, Sung-Feng 📂 Article 📅 2010 🏛 IEEE ⚖ 115 KB

[IEEE 2011 UkSim 13th International Conf

[IEEE 2011 UkSim 13th International Conference on Computer Modelling and Simulation (UKSim) - Cambridge, United Kingdom (2011.03.30-2011.04.1)] 2011 UkSim 13th International Conference on Computer Modelling and Simulation - Considerations to Spoken Language Recognition for Text-to-Speech Applications

✍ Rafieee, M. Saadeq; Jafari, Somayeh; Ahmadi, Hesamoddin Shahriari; Jafari, Masum 📂 Article 📅 2011 🏛 IEEE ⚖ 773 KB

[Lecture Notes in Computer Science] Spee

[Lecture Notes in Computer Science] Speech and Computer Volume 8113 || Application of l 1 Estimation of Gaussian Mixture Model Parameters for Language Identification

✍ Železný, Miloš; Habernal, Ivan; Ronzhin, Andrey 📂 Article 📅 2013 🏛 Springer International Publishing ⚖ 158 KB

[Lecture Notes in Computer Science] Comp

[Lecture Notes in Computer Science] Computer Processing of Oriental Languages. Language Technology for the Knowledge-based Economy Volume 5459 || CRF Models for Tamil Part of Speech Tagging and Chunking

✍ Li, Wenjie; Mollá-Aliod, Diego 📂 Article 📅 2009 🏛 Springer Berlin Heidelberg ⚖ 337 KB