A brief survey on semantic segmentation with deep learning

S Hao, Y Zhou, Y Guo - Neurocomputing, 2020 - Elsevier
Semantic segmentation is a challenging task in computer vision. In recent years, the
performance of semantic segmentation has been greatly improved by using deep learning …

A survey on deep learning techniques for image and video semantic segmentation

A Garcia-Garcia, S Orts-Escolano, S Oprea… - Applied Soft …, 2018 - Elsevier
Image semantic segmentation is more and more being of interest for computer vision and
machine learning researchers. Many applications on the rise need accurate and efficient …

Google scanned objects: A high-quality dataset of 3d scanned household items

L Downs, A Francis, N Koenig, B Kinman… - … on Robotics and …, 2022 - ieeexplore.ieee.org
Interactive 3D simulations have enabled break-throughs in robotics and computer vision, but
simulating the broad diversity of environments needed for deep learning requires large …

Adabins: Depth estimation using adaptive bins

SF Bhat, I Alhashim, P Wonka - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We address the problem of estimating a high quality dense depth map from a single RGB
input image. We start out with a baseline encoder-decoder convolutional neural network …

Depthformer: Exploiting long-range correlation and local information for accurate monocular depth estimation

Z Li, Z Chen, X Liu, J Jiang - Machine Intelligence Research, 2023 - Springer
This paper aims to address the problem of supervised monocular depth estimation. We start
with a meticulous pilot study to demonstrate that the long-range correlation is essential for …

Pointcontrast: Unsupervised pre-training for 3d point cloud understanding

S Xie, J Gu, D Guo, CR Qi, L Guibas… - Computer Vision–ECCV …, 2020 - Springer
Arguably one of the top success stories of deep learning is transfer learning. The finding that
pre-training a network on a rich source set (eg, ImageNet) can help boost performance once …

Self-supervised pretraining of 3d features on any point-cloud

Z Zhang, R Girdhar, A Joulin… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Pretraining on large labeled datasets is a prerequisite to achieve good performance in many
computer vision tasks like image recognition, video understanding etc. However, pretraining …

Exploring data-efficient 3d scene understanding with contrastive scene contexts

J Hou, B Graham, M Nießner… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
The rapid progress in 3D scene understanding has come with growing demand for data;
however, collecting and annotating 3D scenes (eg point clouds) are notoriously hard. For …

UrbanLF: A comprehensive light field dataset for semantic segmentation of urban scenes

H Sheng, R Cong, D Yang, R Chen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
As one of the fundamental technologies for scene understanding, semantic segmentation
has been widely explored in the last few years. Light field cameras encode the geometric …

Imvotenet: Boosting 3d object detection in point clouds with image votes

CR Qi, X Chen, O Litany… - Proceedings of the IEEE …, 2020 - openaccess.thecvf.com
Abstract 3D object detection has seen quick progress thanks to advances in deep learning
on point clouds. A few recent works have even shown state-of-the-art performance with just …