Mbrl-lib: A modular library for model-based reinforcement learning

C Lu, PJ Ball, J Parker-Holder, MA Osborne… - arXiv preprint arXiv …, 2021 - arxiv.org

Offline reinforcement learning enables agents to leverage large pre-collected datasets of
environment transitions to learn control policies, circumventing the need for potentially …

被引用次数：58 相关文章所有 5 个版本

[PDF] neurips.cc

When to update your model: Constrained model-based reinforcement learning

T Ji, Y Luo, F Sun, M Jing, F He… - Advances in Neural …, 2022 - proceedings.neurips.cc

Designing and analyzing model-based RL (MBRL) algorithms with guaranteed monotonic
improvement has been challenging, mainly due to the interdependence between policy …

被引用次数：14 相关文章所有 6 个版本

[PDF] mlr.press

The virtues of laziness in model-based rl: A unified objective and algorithms

A Vemula, Y Song, A Singh… - International …, 2023 - proceedings.mlr.press

We propose a novel approach to addressing two fundamental challenges in Model-based
Reinforcement Learning (MBRL): the computational expense of repeatedly finding a good …

被引用次数：11 相关文章所有 6 个版本

[PDF] neurips.cc

Model-based reparameterization policy gradient methods: Theory and practical algorithms

S Zhang, B Liu, Z Wang, T Zhao - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract ReParameterization (RP) Policy Gradient Methods (PGMs) have been widely
adopted for continuous control tasks in robotics and computer graphics. However, recent …

被引用次数：2 相关文章所有 5 个版本

[PDF] mlr.press

Model-based uncertainty in value functions

CE Luis, AG Bottero, J Vinogradska… - International …, 2023 - proceedings.mlr.press

We consider the problem of quantifying uncertainty over expected cumulative rewards in
model-based reinforcement learning. In particular, we focus on characterizing the variance …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

An experimental design perspective on model-based reinforcement learning

V Mehta, B Paria, J Schneider, S Ermon… - arXiv preprint arXiv …, 2021 - arxiv.org

In many practical applications of RL, it is expensive to observe state transitions from the
environment. For example, in the problem of plasma control for nuclear fusion, computing …

被引用次数：34 相关文章所有 3 个版本

[PDF] arxiv.org

Value gradient weighted model-based reinforcement learning

C Voelcker, V Liao, A Garg, A Farahmand - arXiv preprint arXiv …, 2022 - arxiv.org

Model-based reinforcement learning (MBRL) is a sample efficient technique to obtain control
policies, yet unavoidable modeling errors often lead performance deterioration. The model …

被引用次数：33 相关文章所有 5 个版本

[PDF] neurips.cc

Exploration via planning for information about the optimal trajectory

V Mehta, I Char, J Abbate, R Conlin… - Advances in …, 2022 - proceedings.neurips.cc

Many potential applications of reinforcement learning (RL) are stymied by the large numbers
of samples required to learn an effective policy. This is especially true when applying RL to …

被引用次数：11 相关文章所有 5 个版本

[PDF] arxiv.org

Sim-to-real model-based and model-free deep reinforcement learning for tactile pushing

M Yang, Y Lin, A Church, J Lloyd… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

Object pushing presents a key non-prehensile manipulation problem that is illustrative of
more complex robotic manipulation tasks. While deep reinforcement learning (RL) methods …

被引用次数：10 相关文章所有 5 个版本

[PDF] mlr.press

Inferring smooth control: Monte carlo posterior policy iteration with gaussian processes

J Watson, J Peters - Conference on Robot Learning, 2023 - proceedings.mlr.press

Monte Carlo methods have become increasingly relevant for control of non-differentiable
systems, approximate dynamics models, and learning from data. These methods scale to …

被引用次数：10 相关文章所有 6 个版本