Computing Prosody: Computational Models for Processing Spontaneous Speech

✍ Scribed by D. R. Ladd (auth.), Yoshinori Sagisaka, Nick Campbell, Norio Higuchi (eds.)

Publisher: Springer-Verlag New York
Year: 1997
Tongue: English
Leaves: 399
Edition: 1
Category: Library

No coin nor oath required. For personal study only.

✦ Synopsis

This book presents a collection of papers from the Spring 1995 Work shop on Computational Approaches to Processing the Prosody of Spon taneous Speech, hosted by the ATR Interpreting Telecommunications Re search Laboratories in Kyoto, Japan. The workshop brought together lead ing researchers in the fields of speech and signal processing, electrical en gineering, psychology, and linguistics, to discuss aspects of spontaneous speech prosody and to suggest approaches to its computational analysis and modelling. The book is divided into four sections. Part I gives an overview and theoretical background to the nature of spontaneous speech, differentiating it from the lab-speech that has been the focus of so many earlier analyses. Part II focuses on the prosodic features of discourse and the structure of the spoken message, Part ilIon the generation and modelling of prosody for computer speech synthesis. Part IV discusses how prosodic information can be used in the context of automatic speech recognition. Each section of the book starts with an invited overview paper to situate the chapters in the context of current research. We feel that this collection of papers offers interesting insights into the scope and nature of the problems concerned with the computational analysis and modelling of real spontaneous speech, and expect that these works will not only form the basis of further developments in each field but also merge to form an integrated computational model of prosody for a better understanding of human processing of the complex interactions of the speech chain.

✦ Table of Contents

Front Matter....Pages i-xvii
Front Matter....Pages 1-1
Introduction to Part I....Pages 3-6
A Typology of Spontaneous Speech....Pages 7-26
Prosody, Models, and Spontaneous Speech....Pages 27-42
On the Analysis of Prosody in Interaction....Pages 43-59
Front Matter....Pages 61-61
Introduction to Part II....Pages 63-66
Integrating Prosodie and Discourse Modelling....Pages 67-80
Prosodic Features of Utterances in Task-Oriented Dialogues....Pages 81-93
Variation of Accent Prominence within the Phrase: Models and Spontaneous Speech Data....Pages 95-111
Predicting the Intonation of Discourse Segments from Examples in Dialogue Speech....Pages 117-128
Effects of Focus on Duration and Vowel Formant Frequency in Japanese....Pages 129-153
Front Matter....Pages 155-155
Introduction to Part III....Pages 157-164
Synthesizing Spontaneous Speech....Pages 165-186
Modelling Prosody in Spontaneous Speech....Pages 187-210
Comparison of F 0 Control Rules Derived from Multiple Speech Databases....Pages 211-223
Segmental Duration and Speech Timing....Pages 225-248
Measuring temporal compensation effect in speech perception....Pages 251-270
Prediction of Major Phrase Boundary Location and Pause Insertion Using a Stochastic Context-free Grammar....Pages 271-283
Front Matter....Pages 285-285
Introduction to Part IV....Pages 287-290
A Multi-level Model for Recognition of Intonation Labels....Pages 291-308
Training Prosody-Syntax Recognition Models without Prosodic Labels....Pages 309-325
Front Matter....Pages 285-285
Disambiguating Recognition Results by Prosodic Features....Pages 327-342
Accent Phrase Segmentation by F 0 Clustering Using Superpositional Modelling....Pages 343-359
Prosodic Modules for Speech Recognition and Understanding in VERBMOBIL....Pages 361-382
Back Matter....Pages 383-401

✦ Subjects

Signal, Image and Speech Processing;Phonology;Acoustics;Visualization

📜 SIMILAR VOLUMES

Computational Models of Speech Pattern P

📁 Computational Models of Speech Pattern Processing

✍ Roger K. Moore (auth.), Keith Ponting (eds.) 📂 Library 📅 1999 🏛 Springer-Verlag Berlin Heidelberg 🌐 English

This high-level collection of invited tutorial papers and contributed papers is based on a NATO workshop held in 1997. It surveys and discusses the latest techniques in the field of speech science and technology with a view to working toward a unifying theory of speech pattern processing. The tutori

Second Language Prosody and Computer Mod

📁 Second Language Prosody and Computer Modeling

✍ Okim Kang, David O. Johnson, Alyssa Kermad 📂 Library 📅 2021 🏛 Routledge 🌐 English

This volume presents an interdisciplinary approach to the study of second language prosody and computer modeling. It addresses the importance of prosody’s role in communication, bridging the gap between applied linguistics and computer science. The book illustrates the growing importance o

Cognitive Models of Speech Processing: P

📁 Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives

✍ Gerry T. M. Altmann 📂 Library 📅 1995 🏛 MIT Press 🌐 English

Cognitive Models of Speech Processing presents extensive reviews of current thinking on psycholinguistic and computational topics in speech recognition and natural-language processing, along with a substantial body of new experimental data and compu

Cognitive Models of Speech Processing: P

📁 Cognitive Models of Speech Processing: Psycholinguistic and Computational Perspectives

✍ Gerry T. M. Altmann 📂 Library 📅 1995 🏛 A Bradford Book 🌐 English

Speech Prosody in Speech Synthesis: Mode

📁 Speech Prosody in Speech Synthesis: Modeling and generation of prosody for high quality and flexible speech synthesis

✍ Keikichi Hirose, Jianhua Tao (eds.) 📂 Library 📅 2015 🏛 Springer-Verlag Berlin Heidelberg 🌐 English

The volume addresses issues concerning prosody generation in speech synthesis, including prosody modeling, how we can convey para- and non-linguistic information in speech synthesis, and prosody control in speech synthesis (including prosody conversions). A high level of quality has already be

Speech, Image and Language Processing fo

📁 Speech, Image and Language Processing for Human Computer Interaction: Multi-Modal Advancements

✍ Uma Shanker Tiwary, Uma Shanker Tiwary, Tanveer J. Siddiqui 📂 Library 📅 2012 🏛 IGI Global 🌐 English

Human Computer Interaction is the study of relationships among people and computers. As the digital world is getting multi-modal, the information space is getting more and more complex. In order to navigate this information space and to capture and apply this information to appropriate use, an ef