<span>This book evaluates the impact of relevant factors affecting the results of speech quality assessment studies carried out in crowdsourcing. The author describes how these factors relate to the test structure, the effect of environmental background noise, and the influence of language differenc
Quality of Synthetic Speech: Perceptual Dimensions, Influencing Factors, and Instrumental Assessment
β Scribed by Florian Hinterleitner (auth.)
- Publisher
- Springer Singapore
- Year
- 2017
- Tongue
- English
- Leaves
- 170
- Series
- T-Labs Series in Telecommunication Services
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
This book reviews research towards perceptual quality dimensions of synthetic speech, compares these findings with the state of the art, and derives a set of five universal perceptual quality dimensions for TTS signals. They are: (i) naturalness of voice, (ii) prosodic quality, (iii) fluency and intelligibility, (iv) absence of disturbances, and (v) calmness. Moreover, a test protocol for the efficient indentification of those dimensions in a listening test is introduced. Furthermore, several factors influencing these dimensions are examined. In addition, different techniques for the instrumental quality assessment of TTS signals are introduced, reviewed and tested. Finally, the requirements for the integration of an instrumental quality measure into a concatenative TTS system are examined.
β¦ Table of Contents
Front Matter....Pages i-xvi
Introduction....Pages 1-4
Speech Synthesis....Pages 5-18
Auditory and Instrumental Quality Evaluation Metrics....Pages 19-36
Perceptual Quality Dimensions....Pages 37-67
Influencing Factors on Perceptual Quality....Pages 69-100
Instrumental Quality Assessment....Pages 101-124
Requirements for the Integration of an Instrumental Quality Measure into a Concatenative TTS System....Pages 125-138
Conclusions and Future Work....Pages 139-145
Back Matter....Pages 147-157
β¦ Subjects
Signal, Image and Speech Processing;User Interfaces and Human Computer Interaction
π SIMILAR VOLUMES
This book evaluates the impact of relevant factors affecting the results of speech quality assessment studies carried out in crowdsourcing. The author describes how these factors relate to the test structure, the effect of environmental background noise, and the influence of language differences. He
<p><STRONG><P>Foundations of Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder
Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder to imagine for perceptual me
Foundations of Voice and Speech Quality Perception starts out with the fundamental question of: "How do listeners perceive voice and speech quality and how can these processes be modeled?" Any quantitative answers require measurements. This is natural for physical quantities but harder to imagine fo
Finally a comprehensive overview of speech quality in VoIP from the userβs perspective!Speech Quality of VoIP is an essential guide to assessing the speech quality of VoIP networks, whilst addressing the implications for the design of VoIP networks and systems. This book bridges the gap between the