Stubborn: A strong baseline for indoor object navigation

A Khandelwal, L Weihs, R Mottaghi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Contrastive language image pretraining (CLIP) encoders have been shown to be beneficial
for a range of visual tasks from classification and detection to captioning and image …

被引用次数：204 相关文章所有 5 个版本

[PDF] neurips.cc

Zson: Zero-shot object-goal navigation using multimodal goal embeddings

A Majumdar, G Aggarwal, B Devnani… - Advances in …, 2022 - proceedings.neurips.cc

We present a scalable approach for learning open-world object-goal navigation (ObjectNav)–
the task of asking a virtual robot (agent) to find any instance of an object in an unexplored …

被引用次数：86 相关文章所有 6 个版本

[PDF] openreview.net

Vlfm: Vision-language frontier maps for zero-shot semantic navigation

N Yokoyama, S Ha, D Batra, J Wang… - … on Robotics and …, 2024 - ieeexplore.ieee.org

Understanding how humans leverage semantic knowledge to navigate unfamiliar
environments and decide where to explore next is pivotal for developing robots capable of …

被引用次数：17 相关文章所有 5 个版本

[PDF] thecvf.com

Learning navigational visual representations with semantic map supervision

Y Hong, Y Zhou, R Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Being able to perceive the semantics and the spatial structure of the environment is
essential for visual navigation of a household robot. However, most existing works only …

被引用次数：15 相关文章所有 6 个版本

[PDF] thecvf.com

3d-aware object goal navigation via simultaneous exploration and identification

J Zhang, L Dai, F Meng, Q Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com

Object goal navigation (ObjectNav) in unseen environments is a fundamental task for
Embodied AI. Agents in existing works learn ObjectNav policies based on 2D maps, scene …

被引用次数：20 相关文章所有 6 个版本

[PDF] arxiv.org

Etpnav: Evolving topological planning for vision-language navigation in continuous environments

D An, H Wang, W Wang, Z Wang… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Vision-language navigation is a task that requires an agent to follow instructions to navigate
in environments. It becomes increasingly crucial in the field of embodied AI, with potential …

被引用次数：28 相关文章所有 6 个版本

[PDF] arxiv.org

Object goal navigation with recursive implicit maps

S Chen, T Chabal, I Laptev… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org

Object goal navigation aims to navigate an agent to locations of a given object category in
unseen environments. Classical methods explicitly build maps of environments and require …

被引用次数：10 相关文章所有 9 个版本

[PDF] thecvf.com

Peanut: Predicting and navigating to unseen targets

AJ Zhai, S Wang - … of the IEEE/CVF International Conference …, 2023 - openaccess.thecvf.com

Abstract Efficient ObjectGoal navigation (ObjectNav) in novel environments requires an
understanding of the spatial and semantic regularities in environment layouts. In this work …

被引用次数：14 相关文章所有 7 个版本

A survey of object goal navigation: Datasets, metrics and methods

D Wang, J Chen, J Cheng - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Object Goal Navigation (ObjectNav) aims at directing an agent to a specified target object
within an unseen scene. This task integrates advanced techniques, including visual …

被引用次数：2 相关文章

[PDF] arxiv.org

Learning generalizable feature fields for mobile manipulation

RZ Qiu, Y Hu, G Yang, Y Song, Y Fu, J Ye, J Mu… - arXiv preprint arXiv …, 2024 - arxiv.org

An open problem in mobile manipulation is how to represent objects and scenes in a unified
manner, so that robots can use it both for navigating in the environment and manipulating …

被引用次数：6 相关文章所有 2 个版本