Visual slam: What are the current trends and what to expect?
In recent years, Simultaneous Localization and Mapping (SLAM) systems have shown
significant performance, accuracy, and efficiency gain. In this regard, Visual Simultaneous …
significant performance, accuracy, and efficiency gain. In this regard, Visual Simultaneous …
Ransac for robotic applications: A survey
Random Sample Consensus, most commonly abbreviated as RANSAC, is a robust
estimation method for the parameters of a model contaminated by a sizable percentage of …
estimation method for the parameters of a model contaminated by a sizable percentage of …
Emergent correspondence from image diffusion
Finding correspondences between images is a fundamental problem in computer vision. In
this paper, we show that correspondence emerges in image diffusion models without any …
this paper, we show that correspondence emerges in image diffusion models without any …
Lightglue: Local feature matching at light speed
P Lindenberger, PE Sarlin… - Proceedings of the …, 2023 - openaccess.thecvf.com
We introduce LightGlue, a deep neural network that learns to match local features across
images. We revisit multiple design decisions of SuperGlue, the state of the art in sparse …
images. We revisit multiple design decisions of SuperGlue, the state of the art in sparse …
Blink: Multimodal large language models can see but not perceive
We introduce Blink, a new benchmark for multimodal language models (LLMs) that focuses
on core visual perception abilities not found in other evaluations. Most of the Blink tasks can …
on core visual perception abilities not found in other evaluations. Most of the Blink tasks can …
LoFTR: Detector-free local feature matching with transformers
We present a novel method for local image feature matching. Instead of performing image
feature detection, description, and matching sequentially, we propose to first establish pixel …
feature detection, description, and matching sequentially, we propose to first establish pixel …
Image matching from handcrafted to deep features: A survey
As a fundamental and critical task in various visual applications, image matching can identify
then correspond the same or similar structure/content from two or more images. Over the …
then correspond the same or similar structure/content from two or more images. Over the …
Hypercorrelation squeeze for few-shot segmentation
Few-shot semantic segmentation aims at learning to segment a target object from a query
image using only a few annotated support images of the target class. This challenging task …
image using only a few annotated support images of the target class. This challenging task …
Tapir: Tracking any point with per-frame initialization and temporal refinement
We present a novel model for Tracking Any Point (TAP) that effectively tracks any queried
point on any physical surface throughout a video sequence. Our approach employs two …
point on any physical surface throughout a video sequence. Our approach employs two …
Cotr: Correspondence transformer for matching across images
We propose a novel framework for finding correspondences in images based on a deep
neural network that, given two images and a query point in one of them, finds its …
neural network that, given two images and a query point in one of them, finds its …