Dimension-based Quality Modeling of Transmitted Speech
β Scribed by Marcel WΓ€ltermann (auth.)
- Publisher
- Springer-Verlag Berlin Heidelberg
- Year
- 2013
- Tongue
- English
- Leaves
- 206
- Series
- T-Labs Series in Telecommunication Services
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
In this book, speech transmission quality is modeled on the basis of perceptual dimensions. The author identifies those dimensions that are relevant for today's public-switched and packet-based telecommunication systems, regarding the complete transmission path from the mouth of the speaker to the ear of the listener. Both narrowband (300-3400 Hz) as well as wideband (50-7000 Hz) speech transmission is taken into account. A new analytical assessment method is presented that allows the dimensions to be rated by non-expert listeners in a direct way. Due to the efficiency of the test method, a relatively large number of stimuli can be assessed in auditory tests. The test method is applied in two auditory experiments. The book gives the evidence that this test method provides meaningful and reliable results. The resulting dimension scores together with respective overall quality ratings form the basis for a new parametric model for the quality estimation of transmitted speech based on the perceptual dimensions. In a two-step model approach, instrumental dimension models estimate dimension impairment factors in a first step. The resulting dimension estimates are combined by a Euclidean integration function in a second step in order to provide an estimate of the total impairment.
β¦ Table of Contents
Front Matter....Pages i-xii
Introduction....Pages 1-3
A Dimension-Based Approach to Mouth-to-Ear Speech Transmission Quality....Pages 5-61
Quality Feature Space of Transmitted Speech....Pages 63-93
Direct Scaling of Speech Quality Dimensions....Pages 95-113
Instrumental Dimension-Based Speech Quality Modeling....Pages 115-162
Conclusions and Future Work....Pages 163-167
Back Matter....Pages 169-203
β¦ Subjects
Communications Engineering, Networks;Information Systems and Communication Service;Acoustics
π SIMILAR VOLUMES
<span>This book presents how to apply recent machine learning (deep learning) methods for the task of speech quality prediction. The author shows how recent advancements in machine learning can be leveraged for the task of speech quality prediction and provides an in-depth analysis of the suitabilit
<p><p>This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency a
<p><p>This book provides an in-depth investigation of the quality relevant perceptual video space in the domain of videotelephony. The author presents an extensive investigation and quality modeling of the underlying video quality dimensions and the overall quality. The author examines the underlyin
<p>This book provides an in-depth investigation of the quality relevant perceptual video space in the domain of videotelephony. The author presents an extensive investigation and quality modeling of the underlying video quality dimensions and the overall quality. The author examines the underlying q
<p><p>The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already be