On the evaluation of vision-and-language navigation instructions

L Ouyang, J Wu, X Jiang, D Almeida… - Advances in neural …, 2022 - proceedings.neurips.cc

Making language models bigger does not inherently make them better at following a user's
intent. For example, large language models can generate outputs that are untruthful, toxic, or …

被引用次数：7474 相关文章所有 18 个版本

[PDF] arxiv.org

Vision-and-language navigation: A survey of tasks, methods, and future directions

J Gu, E Stefani, Q Wu, J Thomason… - arXiv preprint arXiv …, 2022 - arxiv.org

A long-term goal of AI research is to build intelligent agents that can communicate with
humans in natural language, perceive the environment, and perform real-world tasks. Vision …

被引用次数：96 相关文章所有 6 个版本

[PDF] jair.org Full View

Core challenges in embodied vision-language planning

J Francis, N Kitamura, F Labelle, X Lu, I Navarro… - Journal of Artificial …, 2022 - jair.org

Recent advances in the areas of multimodal machine learning and artificial intelligence (AI)
have led to the development of challenging tasks at the intersection of Computer Vision …

被引用次数：32 相关文章所有 14 个版本

[PDF] thecvf.com

Envedit: Environment editing for vision-and-language navigation

J Li, H Tan, M Bansal - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

Abstract In Vision-and-Language Navigation (VLN), an agent needs to navigate through the
environment based on natural language instructions. Due to limited available data for agent …

被引用次数：64 相关文章所有 5 个版本

[PDF] thecvf.com

Counterfactual cycle-consistent learning for instruction following and generation in vision-language navigation

H Wang, W Liang, J Shen… - Proceedings of the …, 2022 - openaccess.thecvf.com

Since the rise of vision-language navigation (VLN), great progress has been made in
instruction following--building a follower to navigate environments under the guidance of …

被引用次数：43 相关文章所有 9 个版本

[PDF] thecvf.com

Pathdreamer: A world model for indoor navigation

JY Koh, H Lee, Y Yang, J Baldridge… - Proceedings of the …, 2021 - openaccess.thecvf.com

People navigating in unfamiliar buildings take advantage of myriad visual, spatial and
semantic cues to efficiently achieve their navigation goals. Towards equipping …

被引用次数：63 相关文章所有 10 个版本

[PDF] thecvf.com

Less is more: Generating grounded navigation instructions from landmarks

S Wang, C Montgomery, J Orbay… - Proceedings of the …, 2022 - openaccess.thecvf.com

We study the automatic generation of navigation instructions from 360-degree images
captured on indoor routes. Existing generators suffer from poor visual grounding, causing …

被引用次数：41 相关文章所有 8 个版本

[PDF] arxiv.org

Etpnav: Evolving topological planning for vision-language navigation in continuous environments

D An, H Wang, W Wang, Z Wang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Vision-language navigation is a task that requires an agent to follow instructions to navigate
in environments. It becomes increasingly crucial in the field of embodied AI, with potential …

被引用次数：19 相关文章所有 6 个版本

[PDF] arxiv.org

Vision-language navigation: a survey and taxonomy

W Wu, T Chang, X Li, Q Yin, Y Hu - Neural Computing and Applications, 2024 - Springer

Vision-language navigation (VLN) tasks require an agent to follow language instructions
from a human guide to navigate in previously unseen environments using visual …

被引用次数：12 相关文章所有 4 个版本

[PDF] thecvf.com

A new path: Scaling vision-and-language navigation with synthetic instructions and imitation learning

A Kamath, P Anderson, S Wang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent studies in Vision-and-Language Navigation (VLN) train RL agents to execute natural-
language navigation instructions in photorealistic environments, as a step towards robots …

被引用次数：14 相关文章所有 9 个版本