Depth anything: Unleashing the power of large-scale unlabeled data
Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …
Unleashing text-to-image diffusion models for visual perception
Diffusion models (DMs) have become the new trend of generative models and have
demonstrated a powerful ability of conditional synthesis. Among those, text-to-image …
demonstrated a powerful ability of conditional synthesis. Among those, text-to-image …
Repurposing diffusion-based image generators for monocular depth estimation
Monocular depth estimation is a fundamental computer vision task. Recovering 3D depth
from a single image is geometrically ill-posed and requires scene understanding so it is not …
from a single image is geometrically ill-posed and requires scene understanding so it is not …
idisc: Internal discretization for monocular depth estimation
Monocular depth estimation is fundamental for 3D scene understanding and downstream
applications. However, even under the supervised setup, it is still challenging and ill-posed …
applications. However, even under the supervised setup, it is still challenging and ill-posed …
Attention attention everywhere: Monocular depth prediction with skip attention
Abstract Monocular Depth Estimation (MDE) aims to predict pixel-wise depth given a single
RGB image. For both, the convolutional as well as the recent attention-based models …
RGB image. For both, the convolutional as well as the recent attention-based models …
Robust monocular depth estimation under challenging conditions
While state-of-the-art monocular depth estimation approaches achieve impressive results in
ideal settings, they are highly unreliable under challenging illumination and weather …
ideal settings, they are highly unreliable under challenging illumination and weather …
All in tokens: Unifying output space of visual tasks via soft token
We introduce AiT, a unified output representation for various vision tasks, which is a crucial
step towards general-purpose vision task solvers. Despite the challenges posed by the high …
step towards general-purpose vision task solvers. Despite the challenges posed by the high …
Nddepth: Normal-distance assisted monocular depth estimation
Monocular depth estimation has drawn widespread attention from the vision community due
to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep …
to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep …
Iebins: Iterative elastic bins for monocular depth estimation
S Shao, Z Pei, X Wu, Z Liu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Monocular depth estimation (MDE) is a fundamental topic of geometric computer vision and
a core technique for many downstream applications. Recently, several methods reframe the …
a core technique for many downstream applications. Recently, several methods reframe the …
Ra-depth: Resolution adaptive self-supervised monocular depth estimation
Existing self-supervised monocular depth estimation methods can get rid of expensive
annotations and achieve promising results. However, these methods suffer from severe …
annotations and achieve promising results. However, these methods suffer from severe …