Abstract
We utilize speech information to improve the quality of audio-visual communications such as video telephony and videoconferencing. We show that the marriage of speech analysis and image processing can solve problems related to lip synchronization. We present a technique called speech-assisted frame-rate conversion, and apply it to coding of talking head video. Demonstration sequences are presented. Extensions and other applications are outlined.
Original language | English (US) |
---|---|
Pages | 579-582 |
Number of pages | 4 |
State | Published - 1996 |
Event | Proceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3) - Washington, DC, USA Duration: Oct 23 1995 → Oct 26 1995 |
Other
Other | Proceedings of the 1995 IEEE International Conference on Image Processing. Part 3 (of 3) |
---|---|
City | Washington, DC, USA |
Period | 10/23/95 → 10/26/95 |
ASJC Scopus subject areas
- Hardware and Architecture
- Computer Vision and Pattern Recognition
- Electrical and Electronic Engineering