Non-semantics suppressed mask learning for unsupervised video semantic compression

Y Tian, G Lu, G Zhai, Z Gao - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Most video compression methods aim to improve the decoded video visual quality, instead
of particularly guaranteeing the semantic-completeness, which deteriorates downstream …

A coding framework and benchmark towards low-bitrate video understanding

Y Tian, G Lu, Y Yan, G Zhai, L Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Video compression is indispensable to most video analysis systems. Despite saving the
transportation bandwidth, it also deteriorates downstream video understanding tasks …

Semantically structured image compression via irregular group-based decoupling

R Feng, Y Gao, X Jin, R Feng… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Image compression techniques typically focus on compressing rectangular images for
human consumption, however, resulting in transmitting redundant content for downstream …

Preprocessing enhanced image compression for machine vision

G Lu, X Ge, T Zhong, J Geng, Q Hu - arXiv preprint arXiv:2206.05650, 2022 - arxiv.org
Recently, more and more images are compressed and sent to the back-end devices for the
machine analysis tasks~(\textit {eg,} object detection) instead of being purely watched by …

Prompt-icm: A unified framework towards image coding for machines with task-driven prompts

R Feng, J Liu, X Jin, X Pan, H Sun, Z Chen - arXiv preprint arXiv …, 2023 - arxiv.org
Image coding for machines (ICM) aims to compress images to support downstream AI
analysis instead of human perception. For ICM, developing a unified codec to reduce …

Task-Aware Encoder Control for Deep Video Compression

X Ge, J Luo, X Zhang, T Xu, G Lu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Prior research on deep video compression (DVC) for machine tasks typically necessitates
training a unique codec for each specific task mandating a dedicated decoder per task. In …

Meta clustering learning for large-scale unsupervised person re-identification

X Jin, T He, X Shen, T Liu, X Wang, J Huang… - Proceedings of the 30th …, 2022 - dl.acm.org
Unsupervised Person Re-identification (U-ReID) with pseudo labeling recently reaches a
competitive performance compared to fully-supervised ReID methods based on modern …

Scalable Image Coding for Humans and Machines Using Feature Fusion Network

T Shindo, T Watanabe, Y Tatsumi… - arXiv preprint arXiv …, 2024 - arxiv.org
As image recognition models become more prevalent, scalable coding methods for
machines and humans gain more importance. Applications of image recognition models …

Composable Image Coding for Machine via Task-oriented Internal Adaptor and External Prior

J Liu, X Jin, R Feng, Z Chen… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Traditional image coding standards are typically optimized with a focus on human
perception, which conflicts with the fact that most of the images are now analyzed by …

Image Coding for Machines with Object Region Learning

T Shindo, T Watanabe, K Yamada… - 2024 IEEE 21st …, 2024 - ieeexplore.ieee.org
Compression technology is essential for efficient image transmission and storage. With the
rapid advances in deep learning, images are beginning to be used for image recognition as …