Hoi-diff: Text-driven synthesis of 3d human-object interactions using diffusion models
We address the problem of generating realistic 3D human-object interactions (HOIs) driven
by textual prompts. Instead of a single model, our key insight is to take a modular design and …
by textual prompts. Instead of a single model, our key insight is to take a modular design and …
One-shot open affordance learning with foundation models
Abstract We introduce One-shot Open Affordance Learning (OOAL) where a model is trained
with just one example per base object category but is expected to identify novel objects and …
with just one example per base object category but is expected to identify novel objects and …
Language-driven 6-dof grasp detection using negative prompt guidance
DoF grasp detection has been a fundamental and challenging problem in robotic vision.
While previous works have focused on ensuring grasp stability, they often do not consider …
While previous works have focused on ensuring grasp stability, they often do not consider …
Lan-grasp: Using large language models for semantic object grasping
In this paper, we propose LAN-grasp, a novel approach towards more appropriate semantic
grasping. We use foundation models to provide the robot with a deeper understanding of the …
grasping. We use foundation models to provide the robot with a deeper understanding of the …
Language-conditioned affordance-pose detection in 3d point clouds
Affordance detection and pose estimation are of great importance in many robotic
applications. Their combination helps the robot gain an enhanced manipulation capability …
applications. Their combination helps the robot gain an enhanced manipulation capability …
Grasp-anything: Large-scale grasp dataset from foundation models
Foundation models such as ChatGPT have made significant strides in robotic tasks due to
their universal representation of real-world domains. In this paper, we leverage foundation …
their universal representation of real-world domains. In this paper, we leverage foundation …
Learning 6-dof fine-grained grasp detection based on part affordance grounding
Robotic grasping is a fundamental ability for a robot to interact with the environment. Current
methods focus on how to obtain a stable and reliable grasping pose in object level, while …
methods focus on how to obtain a stable and reliable grasping pose in object level, while …
Open-vocabulary affordance detection using knowledge distillation and text-point correlation
Affordance detection presents intricate challenges and has a wide range of robotic
applications. Previous works have faced limitations such as the complexities of 3D object …
applications. Previous works have faced limitations such as the complexities of 3D object …
LLM-enhanced Scene Graph Learning for Household Rearrangement
The household rearrangement task involves spotting misplaced objects in a scene and
accommodate them with proper places. It depends both on common-sense knowledge on …
accommodate them with proper places. It depends both on common-sense knowledge on …
Learning precise affordances from egocentric videos for robotic manipulation
Affordance, defined as the potential actions that an object offers, is crucial for robotic
manipulation tasks. A deep understanding of affordance can lead to more intelligent AI …
manipulation tasks. A deep understanding of affordance can lead to more intelligent AI …