Knowledge-integrated machine learning for materials: lessons from gameplaying and robotics
As materials researchers increasingly embrace machine-learning (ML) methods, it is natural
to wonder what lessons can be learned from other fields undergoing similar developments …
to wonder what lessons can be learned from other fields undergoing similar developments …
Offline reinforcement learning: Tutorial, review, and perspectives on open problems
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …
started on research on offline reinforcement learning algorithms: reinforcement learning …
Mastering diverse domains through world models
Developing a general algorithm that learns to solve tasks across a wide range of
applications has been a fundamental challenge in artificial intelligence. Although current …
applications has been a fundamental challenge in artificial intelligence. Although current …
Planning with diffusion for flexible behavior synthesis
Model-based reinforcement learning methods often use learning only for the purpose of
estimating an approximate dynamics model, offloading the rest of the decision-making work …
estimating an approximate dynamics model, offloading the rest of the decision-making work …
Flexible diffusion modeling of long videos
We present a framework for video modeling based on denoising diffusion probabilistic
models that produces long-duration video completions in a variety of realistic environments …
models that produces long-duration video completions in a variety of realistic environments …
Deep reinforcement learning at the edge of the statistical precipice
Deep reinforcement learning (RL) algorithms are predominantly evaluated by comparing
their relative performance on a large suite of tasks. Most published results on deep RL …
their relative performance on a large suite of tasks. Most published results on deep RL …
The primacy bias in deep reinforcement learning
This work identifies a common flaw of deep reinforcement learning (RL) algorithms: a
tendency to rely on early interactions and ignore useful evidence encountered later …
tendency to rely on early interactions and ignore useful evidence encountered later …
Mastering atari with discrete world models
Intelligent agents need to generalize from past experience to achieve goals in complex
environments. World models facilitate such generalization and allow learning behaviors …
environments. World models facilitate such generalization and allow learning behaviors …
Foundation models for decision making: Problems, methods, and opportunities
Foundation models pretrained on diverse data at scale have demonstrated extraordinary
capabilities in a wide range of vision and language tasks. When such models are deployed …
capabilities in a wide range of vision and language tasks. When such models are deployed …
Bigger, better, faster: Human-level atari with human-level efficiency
We introduce a value-based RL agent, which we call BBF, that achieves super-human
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …
performance in the Atari 100K benchmark. BBF relies on scaling the neural networks used …