An intelligent facial image coding driven by speech and phoneme

H Li, A Lundmark, R Forchheimer - IEEE Transactions on image …, 1994 - ieeexplore.ieee.org

This paper presents a review of promising techniques for very low bit-rate, below 64 kb/s,
image sequence coding. Image sequence coding at such low rates will be a crucial …

被引用次数：302 相关文章所有 8 个版本

[图书][B] Computer facial animation

FI Parke, K Waters - 2008 - books.google.com

This comprehensive work provides the fundamentals of computer facial animation and
brings into sharper focus techniques that are becoming mainstream in the industry. Over the …

被引用次数：1254 相关文章所有 8 个版本

[PDF] psu.edu

Audio-visual integration in multimodal communication

T Chen, RR Rao - Proceedings of the IEEE, 1998 - ieeexplore.ieee.org

We review recent research that examines audio-visual integration in multimodal
communication. The topics include bimodality in human speech, human and automated lip …

被引用次数：446 相关文章所有 17 个版本

Audiovisual speech processing

T Chen - IEEE signal processing magazine, 2001 - ieeexplore.ieee.org

We have reported activities in audiovisual speech processing, with emphasis on lip reading
and lip synchronization. These research results have shown that, with lip reading, it is …

被引用次数：364 相关文章所有 5 个版本

Neural networks for intelligent multimedia processing

SY Kung, JN Hwang - Proceedings of the IEEE, 1998 - ieeexplore.ieee.org

This paper reviews key attributes of neural processing essential to intelligent multimedia
processing (IMP). The objective is to show why neural networks (NNs) are a core technology …

被引用次数：93 相关文章所有 4 个版本

Model-based image coding advanced video coding techniques for very low bit-rate applications

K Aizawa, TS Huang - Proceedings of the IEEE, 1995 - ieeexplore.ieee.org

The paper gives an overview of model-based approaches applied to image coding, by
looking at image source models. In these model-based schemes, which are different from …

被引用次数：368 相关文章所有 5 个版本

[PDF] researchgate.net

A video compression scheme with optimal bit allocation among segmentation, motion, and residual error

GM Schuster, AK Katsaggelos - IEEE Transactions on Image …, 1997 - ieeexplore.ieee.org

We present a theory for the optimal bit allocation among quadtree (QT) segmentation,
displacement vector field (DVF), and displaced frame difference (DFD). The theory is …

被引用次数：194 相关文章所有 17 个版本

[PDF] epfl.ch

Speechreading using probabilistic models

J Luettin, NA Thacker - Computer vision and image understanding, 1997 - Elsevier

We describe a robust method for locating and tracking lips in gray-level image sequences.
Our approach learns patterns of shape variability from a training set which constrains the …

被引用次数：191 相关文章所有 20 个版本

[PDF] academia.edu

Lip movement synthesis from speech based on Hidden Markov Models

E Yamamoto, S Nakamura, K Shikano - Speech Communication, 1998 - Elsevier

Speech intelligibility can be improved by adding lip images to the speech signal. Thus lip
movement synthesis plays an important role to realize a natural human-like face of computer …

被引用次数：169 相关文章所有 15 个版本

Real-time speech-driven face animation with expressions using neural networks

P Hong, Z Wen, TS Huang - IEEE Transactions on neural …, 2002 - ieeexplore.ieee.org

A real-time speech-driven synthetic talking face provides an effective multimodal
communication interface in distributed collaboration environments. Nonverbal gestures such …

被引用次数：137 相关文章所有 9 个版本