Tubeformer-deeplab: Video mask transformer

D Kim, J Xie, H Wang, S Qiao, Q Yu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Abstract We present TubeFormer-DeepLab, the first attempt to tackle multiple core video
segmentation tasks in a unified manner. Different video segmentation tasks (eg, video …

Video-kmax: A simple unified approach for online and near-online video panoptic segmentation

I Shin, D Kim, Q Yu, J Xie, HS Kim… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Video Panoptic Segmentation (VPS) aims to achieve comprehensive pixel-level
scene understanding by segmenting all pixels and associating objects in a video. Current …

SLVP: Self-Supervised Language-Video Pre-Training for Referring Video Object Segmentation

J Mei, AJ Piergiovanni… - Proceedings of the …, 2024 - openaccess.thecvf.com
The referring video object segmentation (R-VOS) task requires a model to understand both
referring expression and video input. Most recent works are mainly based on an encoder …

[图书][B] Inferring the 3D Information from the Outside World Using Monocular Cameras

H Zhang - 2022 - search.proquest.com
Technological advances have made autonomous driving more and more feasible in
common driving scenarios. Many large companies such as Waymo, Tesla, GM, and Uber …