✦ LIBER ✦

Improving videophone subjective quality using audio information

✍ Scribed by A. Vahedian; J. Arnold; M. Frater; M. Cavenor; L. Godara

Publisher: John Wiley and Sons
Year: 1999
Tongue: English
Weight: 244 KB
Volume: 10
Category: Article
ISSN: 0899-9457
DOI: 10.1002/(sici)1098-1098(1999)10:1<86::aid-ima10>3.0.co;2-b

No coin nor oath required. For personal study only.

✦ Synopsis

This article presents a new technique which uses audio information to achieve more efficient video coding for videophone and videoconferencing applications. The direction of arrival of the audio signal at an array of microphones is used to estimate the position of the speaker's lips such that the quality of the video reconstruction can be enhanced in this crucial area. Once this estimation is performed, then a two-or three-stage quantization strategy is applied to the video information which results in the compression of the subjectively more important parts, i.e., the lips and the face of a speaker, with lower distortion. Algorithms for audio source location using the speech signals received at an array of microphones form an important part of our approach. The proposed new technique is compatible with all existing video compression standards and is much easier to implement than previously proposed techniques.