Category-level articulated object pose estimation

H Guo, F Wu, Y Qin, R Li, K Li, K Li - ACM Computing Surveys, 2023 - dl.acm.org

Autonomous robots are increasingly served in real-world unstructured human environments
with complex long-horizon tasks, such as restaurant serving and office delivery. Task and …

被引用次数：44 相关文章

[PDF] arxiv.org

Deep learning on monocular object pose detection and tracking: A comprehensive overview

Z Fan, Y Zhu, Y He, Q Sun, H Liu, J He - ACM Computing Surveys, 2022 - dl.acm.org

Object pose detection and tracking has recently attracted increasing attention due to its wide
applications in many areas, such as autonomous driving, robotics, and augmented reality …

被引用次数：98 相关文章所有 3 个版本

[PDF] thecvf.com

Vector neurons: A general framework for so (3)-equivariant networks

C Deng, O Litany, Y Duan… - Proceedings of the …, 2021 - openaccess.thecvf.com

Invariance and equivariance to the rotation group have been widely discussed in the 3D
deep learning community for pointclouds. Yet most proposed methods either use complex …

被引用次数：304 相关文章所有 12 个版本

[PDF] thecvf.com

Zebrapose: Coarse to fine surface encoding for 6dof object pose estimation

Y Su, M Saleh, T Fetzer, J Rambach… - Proceedings of the …, 2022 - openaccess.thecvf.com

Establishing correspondences from image to 3D has been a key task of 6DoF object pose
estimation for a long time. To predict pose more accurately, deeply learned dense maps …

被引用次数：141 相关文章所有 9 个版本

[PDF] thecvf.com

Hoi4d: A 4d egocentric dataset for category-level human-object interaction

Y Liu, Y Liu, C Jiang, K Lyu, W Wan… - Proceedings of the …, 2022 - openaccess.thecvf.com

We present HOI4D, a large-scale 4D egocentric dataset with rich annotations, to catalyze the
research of category-level human-object interaction. HOI4D consists of 2.4 M RGB-D …

被引用次数：134 相关文章所有 5 个版本

[PDF] arxiv.org

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - … on Computer Vision, 2025 - Springer

This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

被引用次数：32 相关文章所有 2 个版本

[PDF] thecvf.com

ARCTIC: A dataset for dexterous bimanual hand-object manipulation

Z Fan, O Taheri, D Tzionas… - Proceedings of the …, 2023 - openaccess.thecvf.com

Humans intuitively understand that inanimate objects do not move by themselves, but that
state changes are typically caused by human manipulation (eg, the opening of a book). This …

被引用次数：135 相关文章所有 8 个版本

[PDF] thecvf.com

Gpv-pose: Category-level object pose estimation via geometry-guided point-wise voting

Y Di, R Zhang, Z Lou, F Manhardt, X Ji… - Proceedings of the …, 2022 - openaccess.thecvf.com

While 6D object pose estimation has recently made a huge leap forward, most methods can
still only handle a single or a handful of different objects, which limits their applications. To …

被引用次数：118 相关文章所有 5 个版本

[PDF] thecvf.com

Gapartnet: Cross-category domain-generalizable object perception and manipulation via generalizable and actionable parts

H Geng, H Xu, C Zhao, C Xu, L Yi… - Proceedings of the …, 2023 - openaccess.thecvf.com

For years, researchers have been devoted to generalizable object perception and
manipulation, where cross-category generalizability is highly desired yet underexplored. In …

被引用次数：77 相关文章所有 6 个版本

[PDF] thecvf.com

Sgpa: Structure-guided prior adaptation for category-level 6d object pose estimation

K Chen, Q Dou - Proceedings of the IEEE/CVF International …, 2021 - openaccess.thecvf.com

Category-level 6D object pose estimation aims to predict the position and orientation for
unseen objects, which plays a pillar role in many scenarios such as robotics and augmented …

被引用次数：131 相关文章所有 4 个版本