Robo3d: Towards robust and reliable 3d perception against corruptions

L Kong, Y Liu, X Li, R Chen, W Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
The robustness of 3D perception systems under natural corruptions from environments and
sensors is pivotal for safety-critical applications. Existing large-scale 3D perception datasets …

Surgical fine-tuning improves adaptation to distribution shifts

Y Lee, AS Chen, F Tajwar, A Kumar, H Yao… - arXiv preprint arXiv …, 2022 - arxiv.org
A common approach to transfer learning under distribution shift is to fine-tune the last few
layers of a pre-trained model, preserving learned features while also adapting to the new …

Robodepth: Robust out-of-distribution depth estimation under corruptions

L Kong, S Xie, H Hu, LX Ng… - Advances in Neural …, 2024 - proceedings.neurips.cc
Depth estimation from monocular images is pivotal for real-world visual perception systems.
While current learning-based depth estimation models train and test on meticulously curated …

4m: Massively multimodal masked modeling

D Mizrahi, R Bachmann, O Kar, T Yeo… - Advances in …, 2024 - proceedings.neurips.cc
Current machine learning models for vision are often highly specialized and limited to a
single modality and task. In contrast, recent large language models exhibit a wide range of …

Stylegan knows normal, depth, albedo, and more

A Bhattad, D McKee, D Hoiem… - Advances in Neural …, 2024 - proceedings.neurips.cc
Intrinsic images, in the original sense, are image-like maps of scene properties like depth,
normal, albedo, or shading. This paper demonstrates that StyleGAN can easily be induced …

A probabilistic framework for lifelong test-time adaptation

D Brahma, P Rai - … of the IEEE/CVF Conference on …, 2023 - openaccess.thecvf.com
Test-time adaptation (TTA) is the problem of updating a pre-trained source model at
inference time given test input (s) from a different target domain. Most existing TTA …

Consistent123: One image to highly consistent 3d asset using case-aware diffusion priors

Y Lin, H Han, C Gong, Z Xu, Y Zhang, X Li - arXiv preprint arXiv …, 2023 - arxiv.org
Reconstructing 3D objects from a single image guided by pretrained diffusion models has
demonstrated promising outcomes. However, due to utilizing the case-agnostic rigid …

The robodepth challenge: Methods and advancements towards robust depth estimation

L Kong, Y Niu, S Xie, H Hu, LX Ng… - arXiv preprint arXiv …, 2023 - arxiv.org
Accurate depth estimation under out-of-distribution (OoD) scenarios, such as adverse
weather conditions, sensor failure, and noise contamination, is desirable for safety-critical …

Exploiting Diffusion Prior for Generalizable Dense Prediction

HY Lee, HY Tseng, MH Yang - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Contents generated by recent advanced Text-to-Image (T2I) diffusion models are sometimes
too imaginative for existing off-the-shelf dense predictors to estimate due to the immitigable …

Rethinking inductive biases for surface normal estimation

G Bae, AJ Davison - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Despite the growing demand for accurate surface normal estimation models existing
methods use general-purpose dense prediction models adopting the same inductive biases …