Simultaneous mapping and target driven navigation

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

被引用次数：256 相关文章所有 8 个版本

[PDF] thecvf.com

Cross-modal map learning for vision and language navigation

G Georgakis, K Schmeckpeper… - Proceedings of the …, 2022 - openaccess.thecvf.com

We consider the problem of Vision-and-Language Navigation (VLN). The majority of current
methods for VLN are trained end-to-end using either unstructured memory such as LSTM, or …

被引用次数：56 相关文章所有 9 个版本

[PDF] thecvf.com

Visual navigation with spatial attention

B Mayo, T Hazan, A Tal - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com

This work focuses on object goal visual navigation, aiming at finding the location of an object
from a given class, where in each step the agent is provided with an egocentric RGB image …

被引用次数：77 相关文章所有 5 个版本

[PDF] thecvf.com

3d-aware object goal navigation via simultaneous exploration and identification

J Zhang, L Dai, F Meng, Q Fan… - Proceedings of the …, 2023 - openaccess.thecvf.com

Object goal navigation (ObjectNav) in unseen environments is a fundamental task for
Embodied AI. Agents in existing works learn ObjectNav policies based on 2D maps, scene …

被引用次数：21 相关文章所有 6 个版本

[PDF] aaai.org

Semantic mapnet: Building allocentric semantic maps and representations from egocentric views

V Cartillier, Z Ren, N Jain, S Lee, I Essa… - Proceedings of the AAAI …, 2021 - ojs.aaai.org

We study the task of semantic mapping–specifically, an embodied agent (a robot or an
egocentric AI assistant) is given a tour of a new environment and asked to build an …

被引用次数：69 相关文章所有 6 个版本

[PDF] arxiv.org

Uncertainty-driven planner for exploration and navigation

G Georgakis, B Bucher, A Arapin… - … on Robotics and …, 2022 - ieeexplore.ieee.org

We consider the problems of exploration and pointgoal navigation in previously unseen
environments, where the spatial complexity of indoor scenes and partial observability …

被引用次数：46 相关文章所有 6 个版本

Robust-EQA: robust learning for embodied question answering with noisy labels

H Luo, G Lin, F Shen, X Huang, Y Yao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Embodied question answering (EQA) is a recently emerged research field in which an agent
is asked to answer the user's questions by exploring the environment and collecting visual …

被引用次数：11 相关文章所有 4 个版本

A survey of visual navigation: From geometry to embodied AI

T Zhang, X Hu, J Xiao, G Zhang - Engineering Applications of Artificial …, 2022 - Elsevier

The capacity to extract information and comprehend an unseen environment is critical for
mobile robots to navigate. Few surveys has mentioned the combinatorial-non-optimality …

被引用次数：23 相关文章所有 2 个版本

Transformer-based vision-language alignment for robot navigation and question answering

H Luo, Z Guo, Z Wu, F Teng, T Li - Information Fusion, 2024 - Elsevier

The task of robot navigation and question answering, which is also known as Embodied
Question Answering (EQA), places its emphasis on empowering agents to actively explore …

被引用次数：3 相关文章

[PDF] a-star.edu.sg

Depth and video segmentation based visual attention for embodied question answering

H Luo, G Lin, Y Yao, F Liu, Z Liu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Embodied Question Answering (EQA) is a newly defined research area where an agent is
required to answer the user's questions by exploring the real-world environment. It has …

被引用次数：14 相关文章所有 6 个版本