Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - … on Computer Vision, 2025 - Springer
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

Remos: 3d motion-conditioned reaction synthesis for two-person interactions

A Ghosh, R Dabral, V Golyanik, C Theobalt… - … on Computer Vision, 2025 - Springer
Current approaches for 3D human motion synthesis generate high-quality animations of
digital humans performing a wide variety of actions and gestures. However, a notable …

Diffh2o: Diffusion-based synthesis of hand-object interactions from textual descriptions

S Christen, S Hampali, F Sener, E Remelli… - SIGGRAPH Asia 2024 …, 2024 - dl.acm.org
We introduce DiffH2O, a new diffusion-based framework for synthesizing realistic, dexterous
hand-object interactions from natural language. Our model employs a temporal two-stage …

Grasping diverse objects with simulated humanoids

Z Luo, J Cao, S Christen, A Winkler, K Kitani… - arXiv preprint arXiv …, 2024 - arxiv.org
We present a method for controlling a simulated humanoid to grasp an object and move it to
follow an object trajectory. Due to the challenges in controlling a humanoid with dexterous …

3D Whole-body Grasp Synthesis with Directional Controllability

G Paschalidis, R Wilschut, D Antić, O Taheri… - arXiv preprint arXiv …, 2024 - arxiv.org
Synthesizing 3D whole-bodies that realistically grasp objects is useful for animation, mixed
reality, and robotics. This is challenging, because the hands and body need to look natural …

ManiDext: Hand-Object Manipulation Synthesis via Continuous Correspondence Embeddings and Residual-Guided Diffusion

J Zhang, Y Zhang, L An, M Li, H Zhang, Z Hu… - arXiv preprint arXiv …, 2024 - arxiv.org
Dynamic and dexterous manipulation of objects presents a complex challenge, requiring the
synchronization of hand motions with the trajectories of objects to achieve seamless and …

Human-object interaction from human-level instructions

Z Wu, J Li, CK Liu - arXiv preprint arXiv:2406.17840, 2024 - arxiv.org
Intelligent agents need to autonomously navigate and interact within contextual
environments to perform a wide range of daily tasks based on human-level instructions …

Learning Context with Priors for 3D Interacting Hand-Object Pose Estimation

Z Kuang, C Ding, H Yao - Proceedings of the 32nd ACM International …, 2024 - dl.acm.org
Achieving 3D hand-object pose estimation in interaction scenarios is challenging due to the
severe occlusion generated during the interaction. Existing methods address this issue by …

[HTML][HTML] Learning 3D human–object interaction graphs from transferable context knowledge for construction monitoring

L Xie, S Misra, N Suresh, J Soza-Soto, T Furuhata… - Computers in …, 2025 - Elsevier
We propose a novel framework for detecting 3D human–object interactions (HOI) in
construction sites and a toolkit for generating construction-related human–object interaction …

ManiVideo: Generating Hand-Object Manipulation Video with Dexterous and Generalizable Grasping

Y Pang, R Shao, J Zhang, H Tu, Y Liu, B Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce ManiVideo, a novel method for generating consistent and
temporally coherent bimanual hand-object manipulation videos from given motion …