Kitti-360: A novel dataset and benchmarks for urban scene understanding in 2d and 3d

Y Liao, J Xie, A Geiger - IEEE Transactions on Pattern Analysis …, 2022 - ieeexplore.ieee.org
For the last few decades, several major subfields of artificial intelligence including computer
vision, graphics, and robotics have progressed largely independently from each other …

Monoscene: Monocular 3d semantic scene completion

AQ Cao, R De Charette - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense
geometry and semantics of a scene are inferred from a single monocular RGB image …

Resnest: Split-attention networks

H Zhang, C Wu, Z Zhang, Y Zhu, H Lin… - Proceedings of the …, 2022 - openaccess.thecvf.com
The ability to learn richer network representations generally boosts the performance of deep
learning models. To improve representation-learning in convolutional neural networks, we …

Robustnet: Improving domain generalization in urban-scene segmentation via instance selective whitening

S Choi, S Jung, H Yun, JT Kim… - Proceedings of the …, 2021 - openaccess.thecvf.com
Enhancing the generalization capability of deep neural networks to unseen domains is
crucial for safety-critical applications in the real world such as autonomous driving. To …

Axial-deeplab: Stand-alone axial-attention for panoptic segmentation

H Wang, Y Zhu, B Green, H Adam, A Yuille… - European conference on …, 2020 - Springer
Convolution exploits locality for efficiency at a cost of missing long range context. Self-
attention has been adopted to augment CNNs with non-local interactions. Recent works …

Object-contextual representations for semantic segmentation

Y Yuan, X Chen, J Wang - Computer Vision–ECCV 2020: 16th European …, 2020 - Springer
In this paper, we study the context aggregation problem in semantic segmentation.
Motivated by that the label of a pixel is the category of the object that the pixel belongs to, we …

Pointpainting: Sequential fusion for 3d object detection

S Vora, AH Lang, B Helou… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Camera and lidar are important sensor modalities for robotics in general and self-driving
cars in particular. The sensors provide complementary information offering an opportunity for …

A dynamic multi-scale voxel flow network for video prediction

X Hu, Z Huang, A Huang, J Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
The performance of video prediction has been greatly boosted by advanced deep neural
networks. However, most of the current methods suffer from large model sizes and require …

Panoptic-deeplab: A simple, strong, and fast baseline for bottom-up panoptic segmentation

B Cheng, MD Collins, Y Zhu, T Liu… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this work, we introduce Panoptic-DeepLab, a simple, strong, and fast system for panoptic
segmentation, aiming to establish a solid baseline for bottom-up methods that can achieve …

Dacs: Domain adaptation via cross-domain mixed sampling

W Tranheden, V Olsson, J Pinto… - Proceedings of the …, 2021 - openaccess.thecvf.com
Semantic segmentation models based on convolutional neural networks have recently
displayed remarkable performance for a multitude of applications. However, these models …