Recent trends in task and motion planning for robotics: A survey

H Guo, F Wu, Y Qin, R Li, K Li, K Li - ACM Computing Surveys, 2023 - dl.acm.org
Autonomous robots are increasingly served in real-world unstructured human environments
with complex long-horizon tasks, such as restaurant serving and office delivery. Task and …

Deep learning on monocular object pose detection and tracking: A comprehensive overview

Z Fan, Y Zhu, Y He, Q Sun, H Liu, J He - ACM Computing Surveys, 2022 - dl.acm.org
Object pose detection and tracking has recently attracted increasing attention due to its wide
applications in many areas, such as autonomous driving, robotics, and augmented reality …

Vector neurons: A general framework for so (3)-equivariant networks

C Deng, O Litany, Y Duan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Invariance and equivariance to the rotation group have been widely discussed in the 3D
deep learning community for pointclouds. Yet most proposed methods either use complex …

Zebrapose: Coarse to fine surface encoding for 6dof object pose estimation

Y Su, M Saleh, T Fetzer, J Rambach… - Proceedings of the …, 2022 - openaccess.thecvf.com
Establishing correspondences from image to 3D has been a key task of 6DoF object pose
estimation for a long time. To predict pose more accurately, deeply learned dense maps …

Hoi4d: A 4d egocentric dataset for category-level human-object interaction

Y Liu, Y Liu, C Jiang, K Lyu, W Wan… - Proceedings of the …, 2022 - openaccess.thecvf.com
We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the
research of category-level human-object interaction. HOI4D consists of 2.4 M RGB-D …

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - … on Computer Vision, 2025 - Springer
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

ARCTIC: A dataset for dexterous bimanual hand-object manipulation

Z Fan, O Taheri, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com
Humans intuitively understand that inanimate objects do not move by themselves, but that
state changes are typically caused by human manipulation (eg, the opening of a book). This …

Gpv-pose: Category-level object pose estimation via geometry-guided point-wise voting

Y Di, R Zhang, Z Lou, F Manhardt, X Ji… - Proceedings of the …, 2022 - openaccess.thecvf.com
While 6D object pose estimation has recently made a huge leap forward, most methods can
still only handle a single or a handful of different objects, which limits their applications. To …

Gapartnet: Cross-category domain-generalizable object perception and manipulation via generalizable and actionable parts

H Geng, H Xu, C Zhao, C Xu, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com
For years, researchers have been devoted to generalizable object perception and
manipulation, where cross-category generalizability is highly desired yet underexplored. In …

Sgpa: Structure-guided prior adaptation for category-level 6d object pose estimation

K Chen, Q Dou - Proceedings of the IEEE/CVF International …, 2021 - openaccess.thecvf.com
Category-level 6D object pose estimation aims to predict the position and orientation for
unseen objects, which plays a pillar role in many scenarios such as robotics and augmented …