Generative Visual Compression: A Review

B Chen, S Yin, P Chen, S Wang, Y Ye - arXiv preprint arXiv:2402.02140, 2024 - arxiv.org
Artificial Intelligence Generated Content (AIGC) is leading a new technical revolution for the
acquisition of digital content and impelling the progress of visual compression towards …

Generative face video coding techniques and standardization efforts: A review

B Chen, J Chen, S Wang, Y Ye - 2024 Data Compression …, 2024 - ieeexplore.ieee.org
Generative Face Video Coding (GFVC) techniques can exploit the compact representation
of facial priors and the strong inference capability of deep generative models, achieving high …

Preprocessing enhanced image compression for machine vision

G Lu, X Ge, T Zhong, Q Hu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Recently, more and more images are compressed and sent to the back-end devices for
machine analysis tasks (eg, object detection) instead of being purely watched by humans …

Stochastic latent talking face generation towards emotional expressions and head poses

Z Sheng, L Nie, M Zhang, X Chang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Current talking face generation methods have achieved promising lip-synchronization
results, while still struggling to generate talking face video that exhibits emotional …

Enabling translatability of generative face video coding: A unified face feature transcoding framework

S Yin, B Chen, S Wang, Y Ye - 2024 Data Compression …, 2024 - ieeexplore.ieee.org
Generative face video coding (GFVC) can achieve high-quality visual face communication at
ultra-low bit-rate ranges via strong facial prior learning and realistic generation. However …

Peering into The Sketch: Ultra-Low Bitrate Face Compression for Joint Human and Machine Perception

Y Mao, P Chen, S Wang, S Wang, D Wu - Proceedings of the 31st ACM …, 2023 - dl.acm.org
We propose a novel face compression framework that leverages the external priors for joint
human and machine perception under ultra-low bitrate scenarios. The proposed framework …

Conditional residual coding: A remedy for bottleneck problems in conditional inter frame coding

F Brand, J Seiler, A Kaup - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
Conditional coding is a new video coding paradigm enabled by neural-network-based
compression. It can be shown that conditional coding is in theory better than the traditional …

Ultra-Low Bitrate Face Video Compression Based on Conversions from 3D Keypoints to 2D Motion Map

Z Wang, B Chen, S Wang, S Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
How to compress face video is a crucial problem for a series of online applications, such as
video chat/conference, live broadcasting and remote education. Compared to other natural …

Audio-Semantic Enhanced Pose-Driven Talking Head Generation

M Liu, D Li, Y Li, X Song, L Nie - IEEE Transactions on Circuits …, 2024 - ieeexplore.ieee.org
Talking head generation, aiming to create photo-realistic videos from a single reference
image and audio input, has emerged as a vibrant area of interest within the computer vision …

A Semantic-Aware Detail Adaptive Network for Image Enhancement

L Fan, X Wei, M Zhou, J Yan, H Pu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Low-light images often suffer from varying degrees of visual degradation. Current methods
for recovering image texture details fail to rely on the self-adaptive correlation texture …