SemFlow: Binding Semantic Segmentation and Image Synthesis via Rectified Flow

C Wang, X Li, L Qi, H Ding, Y Tong… - arXiv preprint arXiv …, 2024 - arxiv.org
Semantic segmentation and semantic image synthesis are two representative tasks in visual
perception and generation. While existing methods consider them as two distinct tasks, we …

A Unified Framework for 3D Scene Understanding

W Xu, C Shi, S Tu, X Zhou, D Liang, X Bai - arXiv preprint arXiv …, 2024 - arxiv.org
We propose UniSeg3D, a unified 3D segmentation framework that achieves panoptic,
semantic, instance, interactive, referring, and open-vocabulary semantic segmentation tasks …

ControlVAR: Exploring Controllable Visual Autoregressive Modeling

X Li, K Qiu, H Chen, J Kuen, Z Lin, R Singh… - arXiv preprint arXiv …, 2024 - arxiv.org
Conditional visual generation has witnessed remarkable progress with the advent of
diffusion models (DMs), especially in tasks like control-to-image generation. However …

Image Segmentation in Foundation Model Era: A Survey

T Zhou, F Zhang, B Chang, W Wang, Y Yuan… - arXiv preprint arXiv …, 2024 - arxiv.org
Image segmentation is a long-standing challenge in computer vision, studied continuously
over several decades, as evidenced by seminal algorithms such as N-Cut, FCN, and …

No Re-Train, More Gain: Upgrading Backbones with Diffusion Model for Few-Shot Segmentation

S Chen, F Meng, C Wu, H Wei, R Zhang, Q Wu… - arXiv preprint arXiv …, 2024 - arxiv.org
Few-Shot Segmentation (FSS) aims to segment novel classes using only a few annotated
images. Despite considerable process under pixel-wise support annotation, current FSS …