Revisiting design choices in offline model-based reinforcement learning
Offline reinforcement learning enables agents to leverage large pre-collected datasets of
environment transitions to learn control policies, circumventing the need for potentially …
environment transitions to learn control policies, circumventing the need for potentially …
When to update your model: Constrained model-based reinforcement learning
Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic
improvement has been challenging, mainly due to the interdependence between policy …
improvement has been challenging, mainly due to the interdependence between policy …
The virtues of laziness in model-based rl: A unified objective and algorithms
We propose a novel approach to addressing two fundamental challenges in Model-based
Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good …
Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good …
Model-based reparameterization policy gradient methods: Theory and practical algorithms
Abstract ReParameterization (RP) Policy Gradient Methods (PGMs) have been widely
adopted for continuous control tasks in robotics and computer graphics. However, recent …
adopted for continuous control tasks in robotics and computer graphics. However, recent …
Model-based uncertainty in value functions
We consider the problem of quantifying uncertainty over expected cumulative rewards in
model-based reinforcement learning. In particular, we focus on characterizing the variance …
model-based reinforcement learning. In particular, we focus on characterizing the variance …
An experimental design perspective on model-based reinforcement learning
In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …
environment. For example, in the problem of plasma control for nuclear fusion, computing …
Value gradient weighted model-based reinforcement learning
Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control
policies, yet unavoidable modeling errors often lead performance deterioration. The model …
policies, yet unavoidable modeling errors often lead performance deterioration. The model …
Exploration via planning for information about the optimal trajectory
Many potential applications of reinforcement learning (RL) are stymied by the large numbers
of samples required to learn an effective policy. This is especially true when applying RL to …
of samples required to learn an effective policy. This is especially true when applying RL to …
Sim-to-real model-based and model-free deep reinforcement learning for tactile pushing
Object pushing presents a key non-prehensile manipulation problem that is illustrative of
more complex robotic manipulation tasks. While deep reinforcement learning (RL) methods …
more complex robotic manipulation tasks. While deep reinforcement learning (RL) methods …
Inferring smooth control: Monte carlo posterior policy iteration with gaussian processes
Monte Carlo methods have become increasingly relevant for control of non-differentiable
systems, approximate dynamics models, and learning from data. These methods scale to …
systems, approximate dynamics models, and learning from data. These methods scale to …