𝔖 Scriptorium
✦   LIBER   ✦

πŸ“

Canonical correlation analysis in speech enhancement

✍ Scribed by Benesty, Jacob; Cohen, Israel


Publisher
Springer
Year
2018
Tongue
English
Leaves
124
Series
SpringerBriefs in electrical and computer engineering
Category
Library

⬇  Acquire This Volume

No coin nor oath required. For personal study only.

✦ Synopsis


This book focuses on the application of canonical correlation analysis (CCA) to speech enhancement using the filtering approach. The authors explain how to derive different classes of time-domain and time-frequency-domain noise reduction filters, which are optimal from the CCA perspective for both single-channel and multichannel speech enhancement. Enhancement of noisy speech has been a challenging problem for many researchers over the past few decades and remains an active research area. Typically, speech enhancement algorithms operate in the short-time Fourier transform (STFT) domain, where the clean speech spectral coefficients are estimated using a multiplicative gain function. A filtering approach, which can be performed in the time domain or in the subband domain, obtains an estimate of the clean speech sample at every time instant or time-frequency bin by applying a filtering vector to the noisy speech vector.

Compared to the multiplicative gain approach, the filtering approach more naturally takes into account the correlation of the speech signal in adjacent time frames. In this study, the authors pursue the filtering approach and show how to apply CCA to the speech enhancement problem. They also address the problem of adaptive beamforming from the CCA perspective, and show that the well-known Wiener and minimum variance distortionless response (MVDR) beamformers are particular cases of a general class of CCA-based adaptive beamformers.

✦ Table of Contents


Front Matter ....Pages i-ix
Introduction (Jacob Benesty, Israel Cohen)....Pages 1-3
Canonical Correlation Analysis (Jacob Benesty, Israel Cohen)....Pages 5-14
Single-Channel Speech Enhancement in the Time Domain (Jacob Benesty, Israel Cohen)....Pages 15-35
Single-Channel Speech Enhancement in the STFT Domain (Jacob Benesty, Israel Cohen)....Pages 37-57
Multichannel Speech Enhancement in the Time Domain (Jacob Benesty, Israel Cohen)....Pages 59-77
Multichannel Speech Enhancement in the STFT Domain (Jacob Benesty, Israel Cohen)....Pages 79-101
Adaptive Beamforming (Jacob Benesty, Israel Cohen)....Pages 103-117
Back Matter ....Pages 119-121

✦ Subjects


Speech processing systems;COMPUTERS / General


πŸ“œ SIMILAR VOLUMES


Canonical Correlation Analysis: Uses and
✍ Thompson B. πŸ“‚ Library πŸ“… 1984 πŸ› Sage 🌐 English

Recent advances in statistical methodology and computer automation are making canonical correlation analysis available to more and more researchers. This volume explains the basic features of this sophisticated technique in an essentially non-mathematical introduction that presents numerous examples

Canonical correlation analysis: uses and
✍ Bruce Thompson πŸ“‚ Library πŸ“… 1984 πŸ› SAGE 🌐 English

Recent advances in statistical methodology and computer automation are making canonical correlation analysis available to more and more researchers. This volume explains the basic features of this sophisticated technique in an essentially non-mathematical introduction that presents numerous examples

Speech Enhancement
✍ Prof. Dr. Jacob Benesty, Shoji Makino, Jingdong Chen (auth.) πŸ“‚ Library πŸ“… 2005 πŸ› Springer-Verlag Berlin Heidelberg 🌐 English

<p><P>We live in a noisy world! In all applications (telecommunications, hands-free communications, recording, human-machine interfaces, etc) that require at least one microphone, the signal of interest is usually contaminated by noise and reverberation. As a result, the microphone signal has to be

Preliminaries to Speech Analysis: The Di
✍ Roman Jakobson, C. Gunnar M. Fant, Morris Halle πŸ“‚ Library πŸ“… 1961 🌐 English

This work attempts to describes the ultimate discrete components of language, their specific structure, and their articulatory, acoustic, and perceptual correlates, and surveys their utilization in the language of the world. First published in 1951,

Speech Enhancement in the STFT Domain
✍ Jacob Benesty, Jingdong Chen, EmanuΓ«l A.P. Habets (auth.) πŸ“‚ Library πŸ“… 2012 πŸ› Springer-Verlag Berlin Heidelberg 🌐 English

<p>This work addresses this problem in the short-time Fourier transform (STFT) domain. We divide the general problem into five basic categories depending on the number of microphones being used and whether the interframe or interband correlation is considered. The first category deals with the singl