Point primitive transformer for long-term 4D point cloud video understanding

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer

What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

被引用次数：15 相关文章所有 7 个版本

[PDF] thecvf.com

Complete-to-partial 4D distillation for self-supervised point cloud sequence representation learning

Z Zhang, Y Dong, Y Liu, L Yi - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

Recent work on 4D point cloud sequences has attracted a lot of attention. However,
obtaining exhaustively labeled 4D datasets is often very expensive and laborious, so it is …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - arXiv preprint arXiv …, 2024 - arxiv.org

This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

被引用次数：12 相关文章所有 2 个版本

[PDF] thecvf.com

LeaF: Learning Frames for 4D Point Cloud Sequence Understanding

Y Liu, J Chen, Z Zhang, J Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

We focus on learning descriptive geometry and motion features from 4D point cloud
sequences in this work. Existing works usually develop generic 4D learning tools without …

被引用次数：4 相关文章所有 3 个版本

[PDF] thecvf.com

Pointcmp: Contrastive mask prediction for self-supervised learning on point cloud videos

Z Shen, X Sheng, L Wang, Y Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com

Self-supervised learning can extract representations of good quality from solely unlabeled
data, which is appealing for point cloud videos due to their high labelling cost. In this paper …

被引用次数：9 相关文章所有 5 个版本

[PDF] thecvf.com

Masked spatio-temporal structure prediction for self-supervised learning on point cloud videos

Z Shen, X Sheng, H Fan, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, the community has made tremendous progress in developing effective methods
for point cloud video understanding that learn from massive amounts of labeled data …

被引用次数：4 相关文章所有 5 个版本

[PDF] thecvf.com

Point contrastive prediction with semantic clustering for self-supervised learning on point cloud videos

X Sheng, Z Shen, G Xiao, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

We propose a unified point cloud video self-supervised learning framework for object-centric
and scene-centric data. Previous methods commonly conduct representation learning at the …

被引用次数：4 相关文章所有 5 个版本

[PDF] aaai.org

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Z Wang, Z Ye, H Wu, J Chen, L Yi - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

We study a new problem of semantic complete scene forecasting (SCSF) in this work. Given
a 4D dynamic point cloud sequence, our goal is to forecast the complete scene …

被引用次数：2 相关文章所有 3 个版本

[PDF] aaai.org

Contrastive predictive autoencoders for dynamic point cloud self-supervised learning

X Sheng, Z Shen, G Xiao - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org

We present a new self-supervised paradigm on point cloud sequence understanding.
Inspired by the discriminative and generative self-supervised methods, we design two tasks …

被引用次数：7 相关文章所有 4 个版本

[PDF] arxiv.org

Interactive humanoid: Online full-body motion reaction synthesis with social affordance canonicalization and forecasting

Y Liu, C Chen, L Yi - arXiv preprint arXiv:2312.08983, 2023 - arxiv.org

We focus on the human-humanoid interaction task optionally with an object. We propose a
new task named online full-body motion reaction synthesis, which generates humanoid …

被引用次数：4 相关文章所有 2 个版本