Autoencoders as Cross-Modal Teachers: Can Pretrained 2D Image Transformers Help 3D Representation Learning? R Dong, Z Qi, L Zhang, J Zhang, J Sun, Z Ge, L Yi, K Ma International Conference on Learning Representations (ICLR), 2023, 2022 | 63 | 2022 |
Rethinking the augmentation module in contrastive learning: Learning hierarchical augmentation invariance with expanded views J Zhang, K Ma IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, 2022 | 35 | 2022 |
CLIP-FO3D: Learning free open-world 3D scene representations from 2D dense CLIP J Zhang, R Dong, K Ma arXiv preprint arXiv:2303.04748, 2023 | 34 | 2023 |
Contrastive deep supervision L Zhang, X Chen, J Zhang, R Dong, K Ma European Conference on Computer Vision (ECCV), 2022, 2022 | 31 | 2022 |
Language-Assisted 3D Feature Learning for Semantic Scene Understanding J Zhang, G Fan, G Wang, Z Su, K Ma, L Yi AAAI Conference on Artificial Intelligence (AAAI), 2023, 2023 | 6 | 2023 |
Multi-task Manipulation Policy Modeling with Visuomotor Latent Diffusion W Tan, B Liu, J Zhang, R Song, J Fu arXiv preprint arXiv:2403.07312, 2024 | | 2024 |