Dust3r: Geometric 3d vision made easy

S Wang, V Leroy, Y Cabon… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-view stereo reconstruction (MVS) in the wild requires to first estimate the camera
intrinsic and extrinsic parameters. These are usually tedious and cumbersome to obtain yet …

DeDoDe: Detect, Don't Describe—Describe, Don't Detect for Local Feature Matching

J Edstedt, G Bökman, M Wadenbäck… - … Conference on 3D …, 2024 - ieeexplore.ieee.org
Keypoint detection is a pivotal step in 3D reconstruction, whereby sets of (up to) K points are
detected in each view of a scene. Crucially, the detected points need to be consistent …

Efficient LoFTR: Semi-dense local feature matching with sparse-like speed

Y Wang, X He, S Peng, D Tan… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We present a novel method for efficiently producing semi-dense matches across images.
Previous detector-free matcher LoFTR has shown remarkable matching capability in …

Affine-based Deformable Attention and Selective Fusion for Semi-dense Matching

H Chen, Z Luo, Y Tian, X Bai, Z Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Identifying robust and accurate correspondences across images is a fundamental problem
in computer vision that enables various downstream tasks. Recent semi-dense matching …

Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences

A Barroso-Laguna, S Munukutla… - Proceedings of the …, 2024 - openaccess.thecvf.com
Given two images we can estimate the relative camera pose between them by establishing
image-to-image correspondences. Usually correspondences are 2D-to-2D and the pose we …

The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement

G Trivigno, C Masone, B Caputo… - Proceedings of the …, 2024 - openaccess.thecvf.com
Pose refinement is an interesting and practically relevant research direction. Pose
refinement can be used to (1) obtain a more accurate pose estimate from an initial prior (eg …

Local feature matching using deep learning: A survey

S Xu, S Chen, R Xu, C Wang, P Lu, L Guo - Information Fusion, 2024 - Elsevier
Local feature matching enjoys wide-ranging applications in the realm of computer vision,
encompassing domains such as image retrieval, 3D reconstruction, and object recognition …

Crab: Cross-environment agent benchmark for multimodal language model agents

T Xu, L Chen, DJ Wu, Y Chen, Z Zhang, X Yao… - arXiv preprint arXiv …, 2024 - arxiv.org
The development of autonomous agents increasingly relies on Multimodal Language
Models (MLMs) to perform tasks described in natural language with GUI environments, such …

EarthMatch: Iterative Coregistration for Fine-grained Localization of Astronaut Photography

G Berton, G Goletto, G Trivigno… - Proceedings of the …, 2024 - openaccess.thecvf.com
Precise pixel-wise geolocalization of astronaut photography is critical to unlocking the
potential of this unique type of remotely sensed Earth data particularly for its use in disaster …

From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation

J Tirado-Garín, J Civera - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com
Estimating the relative camera pose from n\geq 5 correspondences between two calibrated
views is a fundamental task in computer vision. This process typically involves two stages: 1) …