This high-level collection of invited tutorial papers and contributed papers is based on a NATO workshop held in 1997. It surveys and discusses the latest techniques in the field of speech science and technology with a view to working toward a unifying theory of speech pattern processing. The tutori
Computing Prosody: Computational Models for Processing Spontaneous Speech
β Scribed by D. R. Ladd (auth.), Yoshinori Sagisaka, Nick Campbell, Norio Higuchi (eds.)
- Publisher
- Springer-Verlag New York
- Year
- 1997
- Tongue
- English
- Leaves
- 399
- Edition
- 1
- Category
- Library
No coin nor oath required. For personal study only.
β¦ Synopsis
This book presents a collection of papers from the Spring 1995 WorkΒ shop on Computational Approaches to Processing the Prosody of SponΒ taneous Speech, hosted by the ATR Interpreting Telecommunications ReΒ search Laboratories in Kyoto, Japan. The workshop brought together leadΒ ing researchers in the fields of speech and signal processing, electrical enΒ gineering, psychology, and linguistics, to discuss aspects of spontaneous speech prosody and to suggest approaches to its computational analysis and modelling. The book is divided into four sections. Part I gives an overview and theoretical background to the nature of spontaneous speech, differentiating it from the lab-speech that has been the focus of so many earlier analyses. Part II focuses on the prosodic features of discourse and the structure of the spoken message, Part ilIon the generation and modelling of prosody for computer speech synthesis. Part IV discusses how prosodic information can be used in the context of automatic speech recognition. Each section of the book starts with an invited overview paper to situate the chapters in the context of current research. We feel that this collection of papers offers interesting insights into the scope and nature of the problems concerned with the computational analysis and modelling of real spontaneous speech, and expect that these works will not only form the basis of further developments in each field but also merge to form an integrated computational model of prosody for a better understanding of human processing of the complex interactions of the speech chain.
β¦ Table of Contents
Front Matter....Pages i-xvii
Front Matter....Pages 1-1
Introduction to Part I....Pages 3-6
A Typology of Spontaneous Speech....Pages 7-26
Prosody, Models, and Spontaneous Speech....Pages 27-42
On the Analysis of Prosody in Interaction....Pages 43-59
Front Matter....Pages 61-61
Introduction to Part II....Pages 63-66
Integrating Prosodie and Discourse Modelling....Pages 67-80
Prosodic Features of Utterances in Task-Oriented Dialogues....Pages 81-93
Variation of Accent Prominence within the Phrase: Models and Spontaneous Speech Data....Pages 95-111
Predicting the Intonation of Discourse Segments from Examples in Dialogue Speech....Pages 117-128
Effects of Focus on Duration and Vowel Formant Frequency in Japanese....Pages 129-153
Front Matter....Pages 155-155
Introduction to Part III....Pages 157-164
Synthesizing Spontaneous Speech....Pages 165-186
Modelling Prosody in Spontaneous Speech....Pages 187-210
Comparison of F 0 Control Rules Derived from Multiple Speech Databases....Pages 211-223
Segmental Duration and Speech Timing....Pages 225-248
Measuring temporal compensation effect in speech perception....Pages 251-270
Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar....Pages 271-283
Front Matter....Pages 285-285
Introduction to Part IV....Pages 287-290
A Multi-level Model for Recognition of Intonation Labels....Pages 291-308
Training Prosody-Syntax Recognition Models without Prosodic Labels....Pages 309-325
Front Matter....Pages 285-285
Disambiguating Recognition Results by Prosodic Features....Pages 327-342
Accent Phrase Segmentation by F 0 Clustering Using Superpositional Modelling....Pages 343-359
Prosodic Modules for Speech Recognition and Understanding in VERBMOBIL....Pages 361-382
Back Matter....Pages 383-401
β¦ Subjects
Signal, Image and Speech Processing;Phonology;Acoustics;Visualization
π SIMILAR VOLUMES
<p>This volume presents an interdisciplinary approach to the study of second language prosody and computer modeling. It addresses the importance of prosodyβs role in communication, bridging the gap between applied linguistics and computer science.</p> <p>The book illustrates the growing importance o
Cognitive Models of Speech Processing presents extensive reviews of current thinking on psycholinguistic and computational topics in speech recognition and natural-language processing, along with a substantial body of new experimental data and compu
Cognitive Models of Speech Processing presents extensive reviews of current thinking on psycholinguistic and computational topics in speech recognition and natural-language processing, along with a substantial body of new experimental data and computat
<p><p>The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already be
<p>Human Computer Interaction is the study of relationships among people and computers. As the digital world is getting multi-modal, the information space is getting more and more complex. In order to navigate this information space and to capture and apply this information to appropriate use, an ef