<p><p>This updated book expands upon prosody for recognition applications of speech processing. It includes importance of prosody for speech processing applications; builds on why prosody needs to be incorporated in speech processing applications; and presents methods for extraction and representati
Automatic Speech Recognition and Translation for Low Resource Languages
β Scribed by L. Ashok Kumar; D. Karthika Renuka; Bharathi Raja Chakravarthi; Thomas Mandl
- Publisher
- WILEY
- Year
- 2024
- Tongue
- English
- Leaves
- 646
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts and researchers sharing innovative solutions that address language challenges in low-resource environments. The book begins by delving into the fundamental concepts of ASR and translation, providing readers with a solid foundation for understanding the subsequent chapters. It then explores the intricacies of low-resource languages, analyzing the factors that contribute to their challenges and the significance of developing tailored solutions to overcome them.
β¦ Table of Contents
Cover
Table of Contents
Series Page
Title Page
Copyright Page
Dedication Page
Foreword
Preface
Acknowledgement
1 A Hybrid Deep Learning Model for Emotion Conversion in Tamil Language
1.1 Introduction
1.2 Dataset Collection and Database Preparation
1.3 Pre-Trained CNN Architectural Models
1.4 Proposed Method for Emotion Transformation
1.5 Synthesized Speech Evaluation
1.6 Conclusion
References
2 Attention-Based End-to-End Automatic Speech Recognition System for Vulnerable Individuals in Tamil
2.1 Introduction
2.2 Related Work
2.3 Dataset Description
2.4 Implementation
2.5 Results and Discussion
2.6 Conclusion
References
3 Speech-Based Dialect Identification for Tamil
3.1 Introduction
3.2 Literature Survey
3.3 Proposed Methodology
3.4 Experimental Setup and Results
3.5 Conclusion
References
4 Language Identification Using Speech Denoising Techniques: A Review
4.1 Introduction
4.2 Speech Denoising and Language Identification
4.3 The Noisy Speech Signal is Denoised Using Temporal and Spectral Processing
4.4 The Denoised Signal is Classified to Identify the Language Spoken Using Recent Machine Learning Algorithm
4.5 Conclusion
References
5 Domain Adaptation-Based Self-Supervised ASR Models for Low-Resource Target Domain
5.1 Introduction
5.2 Literature Survey
5.3 Dataset Description
5.4 Self-Supervised ASR Model
5.5 Domain Adaptation for Low-Resource Target Domain
5.6 Implementation of Domain Adaptation on wav2vec2 Model for Low-Resource Target Domain
5.7 Results Analysis
5.8 Conclusion
Acknowledgements
References
6 ASR Models from Conventional Statistical Models to Transformers and Transfer Learning
6.1 Introduction
6.2 Preprocessing
6.3 Feature Extraction
6.4 Generative Models for ASR
6.5 Discriminative Models for ASR
6.6 Deep Architectures for Low-Resource Languages
6.7 The DNN-HMM Hybrid System
6.8 Summary
References
7 Syllable-Level Morphological Segmentation of Kannada and Tulu Words
7.1 Introduction
7.2 Related Work
7.3 Corpus Construction and Annotation
7.4 Methodology
7.5 Experiments and Results
7.6 Conclusion and Future Work
References
8 A New Robust Deep Learning-Based Automatic Speech Recognition and Machine Transition Model for Tamil and Gujarati
8.1 Introduction
8.2 Literature Survey
8.3 Proposed Architecture
8.4 Experimental Setup
8.5 Results
8.6 Conclusion
References
9 Forensic Voice Comparison Approaches for Low-Resource Languages
9.1 Introduction
9.2 Challenges of Forensic Voice Comparison
9.3 Motivation
9.4 Review on Forensic Voice Comparison Approaches
9.5 Low-Resource Language Datasets
9.6 Applications of Forensic Voice Comparison
9.7 Future Research Scope
9.8 Conclusion
References
10 CoRePooLβCorpus for Resource-Poor Languages: Badaga Speech Corpus
10.1 Introduction
10.2 CoRePooL
10.3 Benchmarking
10.4 Conclusion
Acknowledgement
References
11 Bridging the Linguistic Gap: A Deep Learning-Based Image-to-Text Converter for Ancient Tamil with Web Interface
11.1 Introduction
11.2 The Historical Significance of Ancient Tamil Scripts
11.3 Realization Process
11.4 Dataset Preparation
11.5 Convolution Neural Network
11.6 Webpage with Multilingual Translator
11.7 Results and Discussions
11.8 Conclusion and Future Work
References
12 Voice Cloning for Low-Resource Languages: Investigating the Prospects for Tamil
12.1 Introduction
12.2 Literature Review
12.3 Dataset
12.4 Methodology
12.5 Results and Discussion
12.6 Conclusion
References
13 Transformer-Based Multilingual Automatic Speech Recognition (ASR) Model for Dravidian Languages
13.1 Introduction
13.2 Literature Review
13.3 Dataset Description
13.4 Methodology
13.5 Experimentation Results and Analysis
13.6 Conclusion
References
14 Language Detection Based on Audio for Indian Languages
14.1 Introduction
14.2 Literature Review
14.3 Language Detector System
14.4 Experiments and Outcomes
14.5 Conclusion
References
15 Strategies for Corpus Development for Low-Resource Languages: Insights from Nepal
15.1 Low-Resource Languages and the Constraints
15.2 Language Resources Map for the Languages of Nepal
15.3 Unicode Inception and Advent in Nepal
15.4 Speech and Translation Initiatives
15.5 Corpus Development EffortsβSharing Our Experiences
15.6 Constraints to Competitive Language Technology Research for Nepali and Nepalβs Languages
15.7 Roadmap for the Future
15.8 Conclusion
References
16 Deep Neural Machine Translation (DNMT): Hybrid Deep Learning Architecture-Based English-to-Indian Language Translation
16.1 Introduction
16.2 Literature Survey
16.3 Background
16.4 Proposed System
16.5 Experimental Setup and Results Analysis
16.6 Conclusion and Future Work
References
17 Multiview Learning-Based Speech Recognition for Low-Resource Languages
17.1 Introduction
17.2 Approaches of Information Fusion in ASR
17.3 Partition-Based Multiview Learning
17.4 Data Augmentation Techniques
17.5 Conclusion
References
18 Automatic Speech Recognition Based on Improved Deep Learning
18.1 Introduction
18.2 Literature Review
18.3 Proposed Methodology
18.4 Results and Discussion
18.5 Conclusion
References
19 Comprehensive Analysis of State-of-the-Art Approaches for Speaker Diarization
19.1 Introduction
19.2 Generic Model of Speaker Diarization System
19.3 Review of Existing Speaker Diarization Techniques
19.4 Challenges
19.5 Applications
19.6 Conclusion
References
20 Spoken Language Translation in Low-Resource Language
20.1 Introduction
20.2 Related Work
20.3 MT Algorithms
20.4 Dataset Collection
20.5 Conclusion
References
Index
End User License Agreement
π SIMILAR VOLUMES
Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and speech recognition. Topics covered include: fundamental concepts of speech recognition; speech pattern representation; phoneme-based HMM phoneme recognition; continuous speech recogn
In William J. Hardcastle, John Laver, and Fiona E. Gibbon, editors, Handbook of Phonetic Sciences, chapter 22 (Wiley Blackwell, 2010, ISBN: 978-1-4051-4590-9) β on pp. 804β838.<div class="bb-sep"></div>Speech recognitionβthe transcription of an acoustic speech signal into a string of wordsβis a hard
<p>Machine Translation and Transliteration involving Related, Low-resource Languages discusses an important aspect of natural language processing that has received lesser attention: translation and transliteration involving related languages in a low-resource setting. This is a very relevant real-wo
<p>This book covers language modeling and automatic speech recognition for inflective languages (e.g. Slavic languages), which represent roughly half of the languages spoken in Europe. These languages do not perform as well as English in speech recognition systems and it is therefore harder to devel