High efficiency image compression for large visual-language models

B Li, S Wang, S Wang, Y Ye - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
In recent years, large visual language models (LVLMs) have shown impressive performance
and promising generalization capability in multi-modal tasks, thus replacing humans as …

USTC-TD: A Test Dataset and Benchmark for Image and Video Coding in 2020s

Z Li, J Liao, C Tang, H Zhang, Y Li, Y Bian… - arXiv preprint arXiv …, 2024 - arxiv.org
Image/video coding has been a remarkable research area for both academia and industry
for many years. Testing datasets, especially high-quality image/video datasets are desirable …

IBVC: Interpolation-driven B-frame video compression

C Xu, M Liu, C Yao, W Lin, Y Zhao - Pattern Recognition, 2024 - Elsevier
Learned B-frame video compression aims to adopt bi-directional motion estimation and
motion compensation (MEMC) coding for middle frame reconstruction. However, previous …

NVC-1B: A Large Neural Video Coding Model

X Sheng, C Tang, L Li, D Liu, F Wu - arXiv preprint arXiv:2407.19402, 2024 - arxiv.org
The emerging large models have achieved notable progress in the fields of natural
language processing and computer vision. However, large models for neural video coding …

Rate-aware Compression for NeRF-based Volumetric Video

Z Zhang, G Lu, H Liang, Z Cheng, A Tang… - Proceedings of the 32nd …, 2024 - dl.acm.org
The neural radiance fields (NeRF) have advanced the development of 3D volumetric video
technology, but the large data volumes they involve pose significant challenges for storage …

SMC++: Masked Learning of Unsupervised Video Semantic Compression

Y Tian, G Lu, G Zhai - arXiv preprint arXiv:2406.04765, 2024 - arxiv.org
Most video compression methods focus on human visual perception, neglecting semantic
preservation. This leads to severe semantic loss during the compression, hampering …

[PDF][PDF] Human-Machine Collaborative Image and Video Compression: A Survey

H Li, X Zhang, S Wang, S Wang… - APSIPA Transactions on …, 2024 - nowpublishers.com
Traditional image and video compression methods are designed to maintain the quality of
human visual perception, which makes it necessary to reconstruct the image or video before …

Bi-Directional Deep Contextual Video Compression

X Sheng, L Li, D Liu, S Wang - arXiv preprint arXiv:2408.08604, 2024 - arxiv.org
Deep video compression has made remarkable process in recent years, with the majority of
advancements concentrated on P-frame coding. Although efforts to enhance B-frame coding …

High-Efficiency Neural Video Compression via Hierarchical Predictive Learning

M Lu, Z Duan, W Cong, D Ding, F Zhu, Z Ma - arXiv preprint arXiv …, 2024 - arxiv.org
The enhanced Deep Hierarchical Video Compression-DHVC 2.0-has been introduced. This
single-model neural video codec operates across a broad range of bitrates, delivering not …

Prediction and Reference Quality Adaptation for Learned Video Compression

X Sheng, L Li, D Liu, H Li - arXiv preprint arXiv:2406.14118, 2024 - arxiv.org
Temporal prediction is one of the most important technologies for video compression.
Various prediction coding modes are designed in traditional video codecs. Traditional video …