Image sequence coding at very low bit rates: a review

H Li, A Lundmark, R Forchheimer - IEEE Transactions on image …, 1994 - ieeexplore.ieee.org
This paper presents a review of promising techniques for very low bit-rate, below 64 kb/s,
image sequence coding. Image sequence coding at such low rates will be a crucial …

[图书][B] Computer facial animation

FI Parke, K Waters - 2008 - books.google.com
This comprehensive work provides the fundamentals of computer facial animation and
brings into sharper focus techniques that are becoming mainstream in the industry. Over the …

Audio-visual integration in multimodal communication

T Chen, RR Rao - Proceedings of the IEEE, 1998 - ieeexplore.ieee.org
We review recent research that examines audio-visual integration in multimodal
communication. The topics include bimodality in human speech, human and automated lip …

Audiovisual speech processing

T Chen - IEEE signal processing magazine, 2001 - ieeexplore.ieee.org
We have reported activities in audiovisual speech processing, with emphasis on lip reading
and lip synchronization. These research results have shown that, with lip reading, it is …

Neural networks for intelligent multimedia processing

SY Kung, JN Hwang - Proceedings of the IEEE, 1998 - ieeexplore.ieee.org
This paper reviews key attributes of neural processing essential to intelligent multimedia
processing (IMP). The objective is to show why neural networks (NNs) are a core technology …

Model-based image coding advanced video coding techniques for very low bit-rate applications

K Aizawa, TS Huang - Proceedings of the IEEE, 1995 - ieeexplore.ieee.org
The paper gives an overview of model-based approaches applied to image coding, by
looking at image source models. In these model-based schemes, which are different from …

A video compression scheme with optimal bit allocation among segmentation, motion, and residual error

GM Schuster, AK Katsaggelos - IEEE Transactions on Image …, 1997 - ieeexplore.ieee.org
We present a theory for the optimal bit allocation among quadtree (QT) segmentation,
displacement vector field (DVF), and displaced frame difference (DFD). The theory is …

Speechreading using probabilistic models

J Luettin, NA Thacker - Computer vision and image understanding, 1997 - Elsevier
We describe a robust method for locating and tracking lips in gray-level image sequences.
Our approach learns patterns of shape variability from a training set which constrains the …

Lip movement synthesis from speech based on Hidden Markov Models

E Yamamoto, S Nakamura, K Shikano - Speech Communication, 1998 - Elsevier
Speech intelligibility can be improved by adding lip images to the speech signal. Thus lip
movement synthesis plays an important role to realize a natural human-like face of computer …

Real-time speech-driven face animation with expressions using neural networks

P Hong, Z Wen, TS Huang - IEEE Transactions on neural …, 2002 - ieeexplore.ieee.org
A real-time speech-driven synthetic talking face provides an effective multimodal
communication interface in distributed collaboration environments. Nonverbal gestures such …