On evaluation of embodied navigation agents

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

被引用次数：250 相关文章所有 8 个版本

[PDF] frontiersin.org

Evaluation of socially-aware robot navigation

Y Gao, CM Huang - Frontiers in Robotics and AI, 2022 - frontiersin.org

As mobile robots are increasingly introduced into our daily lives, it grows ever more
imperative that these robots navigate with and among people in a safe and socially …

被引用次数：92 相关文章所有 8 个版本

[PDF] arxiv.org

Visual language maps for robot navigation

C Huang, O Mees, A Zeng… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Grounding language to the visual observations of a navigating agent can be performed
using off-the-shelf visual-language models pretrained on Internet-scale data (eg, image …

被引用次数：270 相关文章所有 4 个版本

[PDF] mlr.press

Q-transformer: Scalable offline reinforcement learning via autoregressive q-functions

Y Chebotar, Q Vuong, K Hausman… - … on Robot Learning, 2023 - proceedings.mlr.press

In this work, we present a scalable reinforcement learning method for training multi-task
policies from large offline datasets that can leverage both human demonstrations and …

被引用次数：57 相关文章所有 6 个版本

[PDF] arxiv.org

Interactive language: Talking to robots in real time

C Lynch, A Wahid, J Tompson, T Ding… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

We present a framework for building interactive, real-time, natural language-instructable
robots in the real world, and we open source related assets (dataset, environment …

被引用次数：165 相关文章所有 3 个版本

[PDF] neurips.cc

Habitat 2.0: Training home assistants to rearrange their habitat

A Szot, A Clegg, E Undersander… - Advances in neural …, 2021 - proceedings.neurips.cc

Abstract We introduce Habitat 2.0 (H2. 0), a simulation platform for training virtual robots in
interactive 3D environments and complex physics-enabled scenarios. We make …

被引用次数：465 相关文章所有 6 个版本

[PDF] mlr.press

Rvt: Robotic view transformer for 3d object manipulation

A Goyal, J Xu, Y Guo, V Blukis… - Conference on Robot …, 2023 - proceedings.mlr.press

For 3D object manipulation, methods that build an explicit 3D representation perform better
than those relying only on camera images. But using explicit 3D representations like voxels …

被引用次数：71 相关文章所有 4 个版本

[PDF] arxiv.org

Navigating to objects in the real world

T Gervet, S Chintala, D Batra, J Malik, DS Chaplot - Science Robotics, 2023 - science.org

Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …

被引用次数：86 相关文章所有 8 个版本

[PDF] arxiv.org

Habitat-matterport 3d dataset (hm3d): 1000 large-scale 3d environments for embodied ai

SK Ramakrishnan, A Gokaslan, E Wijmans… - arXiv preprint arXiv …, 2021 - arxiv.org

We present the Habitat-Matterport 3D (HM3D) dataset. HM3D is a large-scale dataset of
1,000 building-scale 3D reconstructions from a diverse set of real-world locations. Each …

被引用次数：297 相关文章所有 5 个版本

[PDF] neurips.cc

History aware multimodal transformer for vision-and-language navigation

S Chen, PL Guhur, C Schmid… - Advances in neural …, 2021 - proceedings.neurips.cc

Vision-and-language navigation (VLN) aims to build autonomous visual agents that follow
instructions and navigate in real scenes. To remember previously visited locations and …

被引用次数：200 相关文章所有 8 个版本