Speech-assisted lip synchronization in audio-visual communications

Tsuhan Chen, Hans Peter Graf, Barry Haskell, Eric Petajan, Yao Wang, Homer Chen, Wu Chou

Research output: Contribution to conferencePaper

Abstract

We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.

Original languageEnglish (US)
Pages579-582
Number of pages4
StatePublished - 1996
EventProceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3) - Washington, DC, USA
Duration: Oct 23 1995Oct 26 1995

Other

OtherProceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3)
CityWashington, DC, USA
Period10/23/9510/26/95

ASJC Scopus subject areas

  • Hardware and Architecture
  • Computer Vision and Pattern Recognition
  • Electrical and Electronic Engineering

Fingerprint Dive into the research topics of 'Speech-assisted lip synchronization in audio-visual communications'. Together they form a unique fingerprint.

  • Cite this

    Chen, T., Graf, H. P., Haskell, B., Petajan, E., Wang, Y., Chen, H., & Chou, W. (1996). Speech-assisted lip synchronization in audio-visual communications. 579-582. Paper presented at Proceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3), Washington, DC, USA, .