𝔖 Bobbio Scriptorium
✦   LIBER   ✦

Text- and speech-based phonotactic models for spoken language identification of Basque and Spanish

✍ Scribed by Víctor G. Guijarrubia; M. Inés Torres


Book ID
103879763
Publisher
Elsevier Science
Year
2010
Tongue
English
Weight
975 KB
Volume
31
Category
Article
ISSN
0167-8655

No coin nor oath required. For personal study only.

✦ Synopsis


This paper presents a series of spoken language identification experiments involving Spanish and Basque. Spanish and Basque are both official languages in the Basque Country, a region located in northern Spain. We focused our research on the study of several phonotactic-based methodologies, analysing at the same time the performance of phonotactic models trained from text and speech samples and the use of phone and phone sequences as decoding units. Although we focus mainly on Spanish-Basque identification, the analysis is later extended to English, so that more generic conclusions can be drawn. From the bilingual results, we can conclude that the text-based phonotactic models can perform similarly to the audio-based ones when applied to read speech. Moreover, when using task-specific information it is also possible to achieve a high accuracy. The use of phone sequences as decoding units results, in most of the cases, in a decrease in performance and appears to be useful when constraining the phone decoders to those sequences. Similar conclusions can be drawn from the trilingual experiments.


📜 SIMILAR VOLUMES


Interpolation of n-gram and mutual-infor
✍ Z. GuoDong; L. KimTeng 📂 Article 📅 1999 🏛 Elsevier Science 🌐 English ⚖ 191 KB

While n-gram modeling is simple and dominant in speech recognition, it can only capture the short-distance context dependency within an n-word window where currently the largest practical n for natural language is three. However, many of the context dependencies in natural language occur beyond a th