Sim2real predictivity: Does evaluation in simulation predict real-world performance?

C Tang, B Abbatematteo, J Hu… - Annual Review of …, 2024 - annualreviews.org

Reinforcement learning (RL), particularly its combination with deep neural networks,
referred to as deep RL (DRL), has shown tremendous promise across a wide range of …

被引用次数：16 相关文章所有 3 个版本

[PDF] arxiv.org

A survey of embodied ai: From simulators to research tasks

J Duan, S Yu, HL Tan, H Zhu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

There has been an emerging paradigm shift from the era of “internet AI” to “embodied AI,”
where AI algorithms and agents no longer learn from datasets of images, videos or text …

被引用次数：291 相关文章所有 8 个版本

[PDF] neurips.cc

Habitat 2.0: Training home assistants to rearrange their habitat

A Szot, A Clegg, E Undersander… - Advances in neural …, 2021 - proceedings.neurips.cc

Abstract We introduce Habitat 2.0 (H2. 0), a simulation platform for training virtual robots in
interactive 3D environments and complex physics-enabled scenarios. We make …

被引用次数：506 相关文章所有 6 个版本

[PDF] science.org

Navigating to objects in the real world

T Gervet, S Chintala, D Batra, J Malik, DS Chaplot - Science Robotics, 2023 - science.org

Semantic navigation is necessary to deploy mobile robots in uncontrolled environments
such as homes or hospitals. Many learning-based approaches have been proposed in …

被引用次数：102 相关文章所有 8 个版本

[PDF] thecvf.com

Simple but effective: Clip embeddings for embodied ai

A Khandelwal, L Weihs, R Mottaghi… - Proceedings of the …, 2022 - openaccess.thecvf.com

Contrastive language image pretraining (CLIP) encoders have been shown to be beneficial
for a range of visual tasks from classification and detection to captioning and image …

被引用次数：230 相关文章所有 5 个版本

[PDF] arxiv.org

Nomad: Goal masked diffusion policies for navigation and exploration

A Sridhar, D Shah, C Glossop… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org

Robotic learning for navigation in unfamiliar environments needs to provide policies for both
task-oriented navigation (ie, reaching a goal that the robot has located), and task-agnostic …

被引用次数：62 相关文章所有 4 个版本

[PDF] arxiv.org

International Workshop on Multimodal Learning-2023 Theme: Multimodal Learning with Foundation Models

Y Ling, F Wu, S Dong, Y Feng, G Karypis… - Proceedings of the 29th …, 2023 - dl.acm.org

The recent advancements in machine learning and artificial intelligence (particularly
foundation models such as BERT, GPT-3, T5, ResNet, etc.) have demonstrated remarkable …

被引用次数：241 相关文章所有 6 个版本

[HTML] ieee-jas.net

Parallel learning: Overview and perspective for computational learning across Syn2Real and Sim2Real

Q Miao, Y Lv, M Huang, X Wang… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org

The virtual-to-real paradigm, ie, training models on virtual data and then applying them to
solve real-world problems, has attracted more and more attention from various domains by …

被引用次数：87 相关文章所有 3 个版本

[PDF] arxiv.org

ViNT: A foundation model for visual navigation

D Shah, A Sridhar, N Dashora, K Stachowicz… - arXiv preprint arXiv …, 2023 - arxiv.org

General-purpose pre-trained models (" foundation models") have enabled practitioners to
produce generalizable solutions for individual machine learning problems with datasets that …

被引用次数：101 相关文章所有 4 个版本

[PDF] neurips.cc

Soundspaces 2.0: A simulation platform for visual-acoustic learning

C Chen, C Schissler, S Garg… - Advances in …, 2022 - proceedings.neurips.cc

Abstract We introduce SoundSpaces 2.0, a platform for on-the-fly geometry-based audio
rendering for 3D environments. Given a 3D mesh of a real-world environment …

被引用次数：83 相关文章所有 8 个版本