EnvEdit: Environment Editing for Vision-and-Language Navigation J Li, H Tan, M Bansal CVPR 2022, 2022 | 71 | 2022 |
Exploring the role of argument structure in online debate persuasion J Li, E Durmus, C Cardie EMNLP 2020, 2020 | 46 | 2020 |
Improving cross-modal alignment in vision language navigation via syntactic information J Li, H Tan, M Bansal NAACL 2021, 2021 | 35 | 2021 |
Scaling Data Generation in Vision-and-Language Navigation Z Wang, J Li, Y Hong, Y Wang, Q Wu, M Bansal, S Gould, H Tan, Y Qiao ICCV 2023, 2023 | 30 | 2023 |
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation J Li, M Bansal NeurIPS 2023, 2023 | 22 | 2023 |
Improving vision-and-language navigation by generating future-view image semantics J Li, M Bansal Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 19 | 2023 |
Ndh-full: Learning and evaluating navigational agents on full-length dialogue H Kim, J Li, M Bansal Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 14 | 2021 |
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations J Li, H Tan, M Bansal Findings of NAACL 2022, 2022 | 12 | 2022 |
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation J Li, A Padmakumar, G Sukhatme, M Bansal AAAI 2024, 2024 | 5 | 2024 |
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data J Li, J Cho, YL Sung, J Yoon, M Bansal arXiv preprint arXiv:2403.06952, 2024 | 2 | 2024 |
Multimodal large language model for visual navigation YHH Tsai, V Dhar, J Li, B Zhang, J Zhang arXiv preprint arXiv:2310.08669, 2023 | 2 | 2023 |
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation Models Y Zhang, Z Ma, J Li, Y Qiao, Z Wang, J Chai, Q Wu, M Bansal, ... arXiv preprint arXiv:2407.07035, 2024 | 1 | 2024 |