Gridmm: Grid memory map for vision-and-language navigation Z Wang, X Li, J Yang, Y Liu, S Jiang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 28 | 2023 |
Kerm: Knowledge enhanced reasoning for vision-and-language navigation X Li, Z Wang, J Yang, Y Wang, S Jiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 25 | 2023 |
Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation Z Wang, X Li, J Yang, Y Liu, J Hu, M Jiang, S Jiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
Membridge: Video-language pre-training with memory-augmented inter-modality bridge J Yang, X Li, M Zheng, Z Wang, Y Zhu, X Guo, Y Yuan, Z Chai, S Jiang IEEE Transactions on Image Processing, 2023 | 5 | 2023 |
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation Z Wang, X Li, J Yang, S Jiang Conference on Robot Learning (CoRL), 2024 | 1 | 2024 |
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training Y Zhu, X Li, M Zheng, J Yang, Z Wang, X Guo, Z Chai, Y Yuan, S Jiang IEEE Transactions on Multimedia 25, 8036-8050, 2022 | 1 | 2022 |