In order to formalize the information used in spectrogram reading, a knowledge-based system for identifying spoken stop consonants was developed. Speech spectrogram reading involves interpreting the acoustic patterns in the image to determine the spoken utterance. One must selectively attend to many
The mu + system for corpus based speech research
β Scribed by J. Harrington; S. Cassidy; J. Fletcher; A. Mc Veigh
- Publisher
- Elsevier Science
- Year
- 1993
- Tongue
- English
- Weight
- 817 KB
- Volume
- 7
- Category
- Article
- ISSN
- 0885-2308
No coin nor oath required. For personal study only.
β¦ Synopsis
mu + is a system for corpus based speech research that can be used to retrieve and analyse segments and their associated signal files from a large speech corpus. The segments can occur at many different levels (acoustic-phonetic, phonemic, intonational, prosodic), while the signal files can include the acoustic speech waveform, analysis parameters derived from the speech waveform (e.g. formant frequencies), and various articulatory measurements (e.g. kinematic parameters from lip and jaw movement). Most combinations of segment types, together with their boundary times and the speech signal files with which they are associated, can be retrieved hierarchically (all phonemes that occur in certain words), sequentially (all phonemes that occur in a particular triphone) or hierarchically and sequentially (e.g. all phonemes that occur in content words which are preceded by an intonational phrase of a particular type). The segments and their associated signal files that are retrieved from the speech database can be analysed subsequently using a wide range of statistical primitives and digital-signal-processing routines. The system has been developed to provide a common environment for experimentation in numerous facets of corpus based speech and language research including: articulatory and acoustic phonetics, prosodic analysis, speech technology research, and linguistic corpus development.
π SIMILAR VOLUMES
The role of the spouse in relation to the quality of esophageal speech of laryngectornized patients was investigated using psychodiagnostic instruments and speech ratings and through an assessment of the verbal interactions between the patient and spouse. Analysis of the results indicates that those
This paper proposes an improved maximum model distance (IMMD) approach for HMM-based speech recognition systems based on our previous work [S. Kwong, Q.H. He, K.F. Man, K.S. Tang. A maximum model distance approach for HMM-based speech recognition, Pattern Recognition 31 (3) (1998) 219}229]. It de"ne
The last decade bears witness to an exponential growth in the use of the World Wide Web. As a result, a huge number of documents are accessible online through search engines, whose patternmatching capabilities have turned out to be useful for mining the Web space as a particular kind of linguistic c