Revisiting design choices in offline model-based reinforcement learning

C Lu, PJ Ball, J Parker-Holder, MA Osborne… - arXiv preprint arXiv …, 2021 - arxiv.org
Offline reinforcement learning enables agents to leverage large pre-collected datasets of
environment transitions to learn control policies, circumventing the need for potentially …

When to update your model: Constrained model-based reinforcement learning

T Ji, Y Luo, F Sun, M Jing, F He… - Advances in Neural …, 2022 - proceedings.neurips.cc
Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic
improvement has been challenging, mainly due to the interdependence between policy …

The virtues of laziness in model-based rl: A unified objective and algorithms

A Vemula, Y Song, A Singh… - International …, 2023 - proceedings.mlr.press
We propose a novel approach to addressing two fundamental challenges in Model-based
Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good …

Model-based reparameterization policy gradient methods: Theory and practical algorithms

S Zhang, B Liu, Z Wang, T Zhao - Advances in Neural …, 2024 - proceedings.neurips.cc
Abstract ReParameterization (RP) Policy Gradient Methods (PGMs) have been widely
adopted for continuous control tasks in robotics and computer graphics. However, recent …

Model-based uncertainty in value functions

CE Luis, AG Bottero, J Vinogradska… - International …, 2023 - proceedings.mlr.press
We consider the problem of quantifying uncertainty over expected cumulative rewards in
model-based reinforcement learning. In particular, we focus on characterizing the variance …

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arXiv preprint arXiv …, 2021 - arxiv.org
In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …

Value gradient weighted model-based reinforcement learning

C Voelcker, V Liao, A Garg, A Farahmand - arXiv preprint arXiv …, 2022 - arxiv.org
Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control
policies, yet unavoidable modeling errors often lead performance deterioration. The model …

Exploration via planning for information about the optimal trajectory

V Mehta, I Char, J Abbate, R Conlin… - Advances in …, 2022 - proceedings.neurips.cc
Many potential applications of reinforcement learning (RL) are stymied by the large numbers
of samples required to learn an effective policy. This is especially true when applying RL to …

Sim-to-real model-based and model-free deep reinforcement learning for tactile pushing

M Yang, Y Lin, A Church, J Lloyd… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Object pushing presents a key non-prehensile manipulation problem that is illustrative of
more complex robotic manipulation tasks. While deep reinforcement learning (RL) methods …

Inferring smooth control: Monte carlo posterior policy iteration with gaussian processes

J Watson, J Peters - Conference on Robot Learning, 2023 - proceedings.mlr.press
Monte Carlo methods have become increasingly relevant for control of non-differentiable
systems, approximate dynamics models, and learning from data. These methods scale to …