Scenefun3d: Fine-grained functionality and affordance understanding in 3d scenes
Existing 3D scene understanding methods are heavily focused on 3D semantic and instance
segmentation. However identifying objects and their parts only constitutes an intermediate …
segmentation. However identifying objects and their parts only constitutes an intermediate …
Segment3d: Learning fine-grained class-agnostic 3d segmentation without manual labels
Current 3D scene segmentation methods are heavily dependent on manually annotated 3D
training datasets. Such manual annotations are labor-intensive, and often lack fine-grained …
training datasets. Such manual annotations are labor-intensive, and often lack fine-grained …
OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views
Large visual-language models (VLMs), like CLIP, enable open-set image segmentation to
segment arbitrary concepts from an image in a zero-shot manner. This goes beyond the …
segment arbitrary concepts from an image in a zero-shot manner. This goes beyond the …
OrbitGrasp: -Equivariant Grasp Learning
While grasp detection is an important part of any robotic manipulation pipeline, reliable and
accurate grasp detection in $ SE (3) $ remains a research challenge. Many robotics …
accurate grasp detection in $ SE (3) $ remains a research challenge. Many robotics …
SceneGraphLoc: Cross-Modal Coarse Visual Localization on 3D Scene Graphs
We introduce the task of localizing an input image within a multi-modal reference map
represented by a collection of 3D scene graphs. These scene graphs comprise multiple …
represented by a collection of 3D scene graphs. These scene graphs comprise multiple …
P2P-Bridge: Diffusion Bridges for 3D Point Cloud Denoising
In this work, we address the task of point cloud denoising using a novel framework adapting
Diffusion Schrödinger bridges to unstructured data like point sets. Unlike previous works that …
Diffusion Schrödinger bridges to unstructured data like point sets. Unlike previous works that …
Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds
In recent years, modern techniques in deep learning and large-scale datasets have led to
impressive progress in 3D instance segmentation, grasp pose estimation, and robotics. This …
impressive progress in 3D instance segmentation, grasp pose estimation, and robotics. This …
TARGO: Benchmarking Target-driven Object Grasping under Occlusions
Recent advances in predicting 6D grasp poses from a single depth image have led to
promising performance in robotic grasping. However, previous grasping models face …
promising performance in robotic grasping. However, previous grasping models face …
OpenDAS: Domain Adaptation for Open-Vocabulary Segmentation
The advent of Vision Language Models (VLMs) transformed image understanding from
closed-set classifications to dynamic image-language interactions, enabling open …
closed-set classifications to dynamic image-language interactions, enabling open …