An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Complete-to-partial 4D distillation for self-supervised point cloud sequence representation learning

Z Zhang, Y Dong, Y Liu, L Yi - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Recent work on 4D point cloud sequences has attracted a lot of attention. However,
obtaining exhaustively labeled 4D datasets is often very expensive and laborious, so it is …

Shapellm: Universal 3d object understanding for embodied interaction

Z Qi, R Dong, S Zhang, H Geng, C Han, Z Ge… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …

LeaF: Learning Frames for 4D Point Cloud Sequence Understanding

Y Liu, J Chen, Z Zhang, J Huang… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
We focus on learning descriptive geometry and motion features from 4D point cloud
sequences in this work. Existing works usually develop generic 4D learning tools without …

Pointcmp: Contrastive mask prediction for self-supervised learning on point cloud videos

Z Shen, X Sheng, L Wang, Y Guo… - Proceedings of the …, 2023 - openaccess.thecvf.com
Self-supervised learning can extract representations of good quality from solely unlabeled
data, which is appealing for point cloud videos due to their high labelling cost. In this paper …

Masked spatio-temporal structure prediction for self-supervised learning on point cloud videos

Z Shen, X Sheng, H Fan, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, the community has made tremendous progress in developing effective methods
for point cloud video understanding that learn from massive amounts of labeled data …

Point contrastive prediction with semantic clustering for self-supervised learning on point cloud videos

X Sheng, Z Shen, G Xiao, L Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com
We propose a unified point cloud video self-supervised learning framework for object-centric
and scene-centric data. Previous methods commonly conduct representation learning at the …

Semantic Complete Scene Forecasting from a 4D Dynamic Point Cloud Sequence

Z Wang, Z Ye, H Wu, J Chen, L Yi - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org
We study a new problem of semantic complete scene forecasting (SCSF) in this work. Given
a 4D dynamic point cloud sequence, our goal is to forecast the complete scene …

Contrastive predictive autoencoders for dynamic point cloud self-supervised learning

X Sheng, Z Shen, G Xiao - Proceedings of the AAAI Conference on …, 2023 - ojs.aaai.org
We present a new self-supervised paradigm on point cloud sequence understanding.
Inspired by the discriminative and generative self-supervised methods, we design two tasks …

Interactive humanoid: Online full-body motion reaction synthesis with social affordance canonicalization and forecasting

Y Liu, C Chen, L Yi - arXiv preprint arXiv:2312.08983, 2023 - arxiv.org
We focus on the human-humanoid interaction task optionally with an object. We propose a
new task named online full-body motion reaction synthesis, which generates humanoid …