A survey on model-based reinforcement learning
Reinforcement learning (RL) interacts with the environment to solve sequential decision-
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
making problems via a trial-and-error approach. Errors are always undesirable in real-world …
Mastering atari games with limited data
Reinforcement learning has achieved great success in many applications. However, sample
efficiency remains a key challenge, with prominent methods requiring millions (or even …
efficiency remains a key challenge, with prominent methods requiring millions (or even …
Exploring chemical reaction space with machine learning models: Representation and feature perspective
Chemical reactions serve as foundational building blocks for organic chemistry and drug
design. In the era of large AI models, data-driven approaches have emerged to innovate the …
design. In the era of large AI models, data-driven approaches have emerged to innovate the …
Bidirectional model-based policy optimization
Abstract Model-based reinforcement learning approaches leverage a forward dynamics
model to support planning and decision making, which, however, may fail catastrophically if …
model to support planning and decision making, which, however, may fail catastrophically if …
Mismatched no more: Joint model-policy optimization for model-based rl
B Eysenbach, A Khazatsky, S Levine… - Advances in …, 2022 - proceedings.neurips.cc
Many model-based reinforcement learning (RL) methods follow a similar template: fit a
model to previously observed data, and then use data from that model for RL or planning …
model to previously observed data, and then use data from that model for RL or planning …
Learning continuous models for continuous physics
Dynamical systems that evolve continuously over time are ubiquitous throughout science
and engineering. Machine learning (ML) provides data-driven approaches to model and …
and engineering. Machine learning (ML) provides data-driven approaches to model and …
How to fine-tune the model: Unified model shift and model bias policy optimization
Designing and deriving effective model-based reinforcement learning (MBRL) algorithms
with a performance improvement guarantee is challenging, mainly attributed to the high …
with a performance improvement guarantee is challenging, mainly attributed to the high …
Live in the moment: Learning dynamics model adapted to evolving policy
X Wang, W Wongkamjan, R Jia… - … on Machine Learning, 2023 - proceedings.mlr.press
Abstract Model-based reinforcement learning (RL) often achieves higher sample efficiency
in practice than model-free RL by learning a dynamics model to generate samples for policy …
in practice than model-free RL by learning a dynamics model to generate samples for policy …
gamma-models: Generative temporal difference learning for infinite-horizon prediction
We introduce the gamma-model, a predictive model of environment dynamics with an
infinite, probabilistic horizon. Replacing standard single-step models with gamma-models …
infinite, probabilistic horizon. Replacing standard single-step models with gamma-models …
Investigating compounding prediction errors in learned dynamics models
Accurately predicting the consequences of agents' actions is a key prerequisite for planning
in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies …
in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies …