A frequency-domain-based delay estimator is described, designed specifically for speech signals in a microphone-array environment. It is shown to be capable of obtaining precision delay estimates over a wide range of signal-to-noise ratio conditions and is computationally simple enough to make it pr
A practical methodology for speech source localization with microphone arrays
โ Scribed by Michael S. Brandstein; Harvey F. Silverman
- Publisher
- Elsevier Science
- Year
- 1997
- Tongue
- English
- Weight
- 545 KB
- Volume
- 11
- Category
- Article
- ISSN
- 0885-2308
No coin nor oath required. For personal study only.
โฆ Synopsis
Electronically steerable arrays of microphones have a variety of uses in speech data acquisition systems. Applications include teleconferencing, speech recognition and speaker identification, sound capture in adverse environments, and biomedical devices for the hearing impaired. An array of microphones has a number of advantages over a single-microphone system. It may be electronically aimed to provide a high-quality signal from a desired source location while simultaneously attenuating interfering talkers and ambient noise, does not necessitate local placement of transducers or encumber the talker with a hand-held or head-mounted microphone, and does not require physical movement to alter its direction of reception. Additionally, it has capabilities that a single microphone does not; namely automatic detection, localization and tracking of active talkers in its receptive area. This paper addresses the specific application of source localization algorithms for estimating the position of speech sources in a real-room environment given limited computational resources. The theoretical foundations of a speech source localization system are presented. This includes the development of a source-sensor geometry for talkers and sensors in the near-field environment as well as the evaluation of several error criteria available to the problem. Several practical algorithms necessary for real-time implementation are developed, specifically the derivation and evaluation of an appropriate time-delay estimator and a novel closedform locator. Finally, results obtained from a real system are presented to illustrate the effectiveness of the proposed source localization techniques as well as to confirm the practicality of the theoretical models.
๐ SIMILAR VOLUMES