Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

Unleashing text-to-image diffusion models for visual perception

W Zhao, Y Rao, Z Liu, B Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Diffusion models (DMs) have become the new trend of generative models and have
demonstrated a powerful ability of conditional synthesis. Among those, text-to-image …

Repurposing diffusion-based image generators for monocular depth estimation

B Ke, A Obukhov, S Huang, N Metzger… - Proceedings of the …, 2024 - openaccess.thecvf.com
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …

idisc: Internal discretization for monocular depth estimation

L Piccinelli, C Sakaridis, F Yu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Monocular depth estimation is fundamental for 3D scene understanding and downstream
applications. However, even under the supervised setup, it is still challenging and ill-posed …

Attention attention everywhere: Monocular depth prediction with skip attention

A Agarwal, C Arora - Proceedings of the IEEE/CVF Winter …, 2023 - openaccess.thecvf.com
Abstract Monocular Depth Estimation (MDE) aims to predict pixel-wise depth given a single
RGB image. For both, the convolutional as well as the recent attention-based models …

Robust monocular depth estimation under challenging conditions

S Gasperini, N Morbitzer, HJ Jung… - Proceedings of the …, 2023 - openaccess.thecvf.com
While state-of-the-art monocular depth estimation approaches achieve impressive results in
ideal settings, they are highly unreliable under challenging illumination and weather …

All in tokens: Unifying output space of visual tasks via soft token

J Ning, C Li, Z Zhang, C Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce AiT, a unified output representation for various vision tasks, which is a crucial
step towards general-purpose vision task solvers. Despite the challenges posed by the high …

Nddepth: Normal-distance assisted monocular depth estimation

S Shao, Z Pei, W Chen, X Wu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Monocular depth estimation has drawn widespread attention from the vision community due
to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep …

Iebins: Iterative elastic bins for monocular depth estimation

S Shao, Z Pei, X Wu, Z Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Monocular depth estimation (MDE) is a fundamental topic of geometric computer vision and
a core technique for many downstream applications. Recently, several methods reframe the …

Ra-depth: Resolution adaptive self-supervised monocular depth estimation

M He, L Hui, Y Bian, J Ren, J Xie, J Yang - European Conference on …, 2022 - Springer
Existing self-supervised monocular depth estimation methods can get rid of expensive
annotations and achieve promising results. However, these methods suffer from severe …