A Review of Safe Reinforcement Learning: Methods, Theories and Applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Knowledge-integrated machine learning for materials: lessons from gameplaying and robotics

K Hippalgaonkar, Q Li, X Wang, JW Fisher III… - Nature Reviews …, 2023 - nature.com
As materials researchers increasingly embrace machine-learning (ML) methods, it is natural
to wonder what lessons can be learned from other fields undergoing similar developments …

Generative agents: Interactive simulacra of human behavior

JS Park, J O'Brien, CJ Cai, MR Morris, P Liang… - Proceedings of the 36th …, 2023 - dl.acm.org
Believable proxies of human behavior can empower interactive applications ranging from
immersive environments to rehearsal spaces for interpersonal communication to prototyping …

Champion-level drone racing using deep reinforcement learning

E Kaufmann, L Bauersfeld, A Loquercio, M Müller… - Nature, 2023 - nature.com
First-person view (FPV) drone racing is a televised sport in which professional competitors
pilot high-speed aircraft through a 3D circuit. Each pilot sees the environment from the …

Human-level play in the game of Diplomacy by combining language models with strategic reasoning

Meta Fundamental AI Research Diplomacy Team … - Science, 2022 - science.org
Despite much progress in training artificial intelligence (AI) systems to imitate human
language, building agents that use language to communicate intentionally with humans in …

Video pretraining (vpt): Learning to act by watching unlabeled online videos

B Baker, I Akkaya, P Zhokov… - Advances in …, 2022 - proceedings.neurips.cc
Pretraining on noisy, internet-scale datasets has been heavily studied as a technique for
training models with broad, general capabilities for text, images, and other modalities …

Fusing blockchain and AI with metaverse: A survey

Q Yang, Y Zhao, H Huang, Z Xiong… - IEEE Open Journal …, 2022 - ieeexplore.ieee.org
Metaverse as the latest buzzword has attracted great attention from both industry and
academia. Metaverse seamlessly integrates the real world with the virtual world and allows …

Outracing champion Gran Turismo drivers with deep reinforcement learning

PR Wurman, S Barrett, K Kawamoto, J MacGlashan… - Nature, 2022 - nature.com
Many potential applications of artificial intelligence involve making real-time decisions in
physical systems while interacting with humans. Automobile racing represents an extreme …

Deep reinforcement learning at the edge of the statistical precipice

R Agarwal, M Schwarzer, PS Castro… - Advances in neural …, 2021 - proceedings.neurips.cc
Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …

Mastering the game of Stratego with model-free multiagent reinforcement learning

J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub… - Science, 2022 - science.org
We introduce DeepNash, an autonomous agent that plays the imperfect information game
Stratego at a human expert level. Stratego is one of the few iconic board games that artificial …