Dimension-based Quality Modeling of Transmitted Speech

✍ Scribed by Marcel Wältermann (auth.)

Publisher: Springer-Verlag Berlin Heidelberg
Year: 2013
Tongue: English
Leaves: 206
Series: T-Labs Series in Telecommunication Services
Edition: 1
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

In this book, speech transmission quality is modeled on the basis of perceptual dimensions. The author identifies those dimensions that are relevant for today's public-switched and packet-based telecommunication systems, regarding the complete transmission path from the mouth of the speaker to the ear of the listener. Both narrowband (300-3400 Hz) as well as wideband (50-7000 Hz) speech transmission is taken into account. A new analytical assessment method is presented that allows the dimensions to be rated by non-expert listeners in a direct way. Due to the efficiency of the test method, a relatively large number of stimuli can be assessed in auditory tests. The test method is applied in two auditory experiments. The book gives the evidence that this test method provides meaningful and reliable results. The resulting dimension scores together with respective overall quality ratings form the basis for a new parametric model for the quality estimation of transmitted speech based on the perceptual dimensions. In a two-step model approach, instrumental dimension models estimate dimension impairment factors in a first step. The resulting dimension estimates are combined by a Euclidean integration function in a second step in order to provide an estimate of the total impairment.

✦ Table of Contents

Front Matter....Pages i-xii
Introduction....Pages 1-3
A Dimension-Based Approach to Mouth-to-Ear Speech Transmission Quality....Pages 5-61
Quality Feature Space of Transmitted Speech....Pages 63-93
Direct Scaling of Speech Quality Dimensions....Pages 95-113
Instrumental Dimension-Based Speech Quality Modeling....Pages 115-162
Conclusions and Future Work....Pages 163-167
Back Matter....Pages 169-203

✦ Subjects

Communications Engineering, Networks;Information Systems and Communication Service;Acoustics

📜 SIMILAR VOLUMES

Quality of life modelling on the basis o

📁 Quality of life modelling on the basis of qualitative and quantitative data

✍ Křupka, Jiří; Kašparová, Miloslava; Mandys, Jan; Jirava, Pavel 📂 Library 📅 2012 🏛 Intech 🌐 English

Deep Learning Based Speech Quality Predi

📁 Deep Learning Based Speech Quality Prediction

✍ Gabriel Mittag 📂 Library 📅 2022 🏛 Springer 🌐 English

This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitabilit

Quality of Synthetic Speech: Perceptual

📁 Quality of Synthetic Speech: Perceptual Dimensions, Influencing Factors, and Instrumental Assessment

✍ Florian Hinterleitner (auth.) 📂 Library 📅 2017 🏛 Springer Singapore 🌐 English

This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency a

Dimension-Based Quality Analysis and Pre

📁 Dimension-Based Quality Analysis and Prediction for Videotelephony

✍ Falk Ralph Schiffner 📂 Library 📅 2021 🏛 Springer International Publishing;Springer 🌐 English

This book provides an in-depth investigation of the quality relevant perceptual video space in the domain of videotelephony. The author presents an extensive investigation and quality modeling of the underlying video quality dimensions and the overall quality. The author examines the underlyin

Dimension-based Quality Analysis and Pre

📁 Dimension-based Quality Analysis and Prediction for Videotelephony

✍ Falk Schiffner 📂 Library 📅 2020 🏛 Springer Nature 🌐 English

This book provides an in-depth investigation of the quality relevant perceptual video space in the domain of videotelephony. The author presents an extensive investigation and quality modeling of the underlying video quality dimensions and the overall quality. The author examines the underlying q

Speech Prosody in Speech Synthesis: Mode

📁 Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis

✍ Keikichi Hirose, Jianhua Tao (eds.) 📂 Library 📅 2015 🏛 Springer-Verlag Berlin Heidelberg 🌐 English

The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already be