Scale-free adaptive planning for deterministic dynamics & discounted rewards

D Shah, Q Xie, Z Xu - Abstracts of the 2020 SIGMETRICS/Performance …, 2020 - dl.acm.org

In this work, we consider the popular tree-based search strategy within the framework of
reinforcement learning, the Monte Carlo Tree Search (MCTS), in the context of infinite …

被引用次数：52 相关文章所有 12 个版本

[PDF] neurips.cc

A provably efficient sample collection strategy for reinforcement learning

J Tarbouriech, M Pirotta, M Valko… - Advances in Neural …, 2021 - proceedings.neurips.cc

One of the challenges in online reinforcement learning (RL) is that the agent needs to trade
off the exploration of the environment and the exploitation of the samples to optimize its …

被引用次数：18 相关文章所有 12 个版本

[PDF] neurips.cc

Planning in entropy-regularized Markov decision processes and games

JB Grill, O Darwiche Domingues… - Advances in …, 2019 - proceedings.neurips.cc

We propose SmoothCruiser, a new planning algorithm for estimating the value function in
entropy-regularized Markov decision processes and two-player games, given a generative …

被引用次数：20 相关文章所有 10 个版本

[PDF] hal.science

Goal-oriented exploration for reinforcement learning

J Tarbouriech - 2022 - theses.hal.science

Learning to reach goals is a competence of high practical relevance to acquire for intelligent
agents. For instance, this encompasses many navigation tasks (" go to target X"), robotic …

被引用次数：3 相关文章所有 5 个版本

[PDF] hal.science

Exploration in Reinforcement Learning: Beyond Finite State-Spaces

OD Domingues - 2022 - theses.hal.science

Reinforcement learning (RL) is a powerful machine learning framework to design algorithms
that learn to make decisions and to interact with the world. Algorithms for RL can be …

sujet2023-recherche-optimisation Algorithmes de recherche pour l'optimisation

B Scherrer, EBSF Colas, C Bureau - team.inria.fr

Ce sujet concerne l'optimisation de l'utilisation des ressources numériques telles que la
mémoire et le temps CPU pour améliorer les performances liées à la résolution de …

[PDF] mit.edu

Data Efficient Reinforcement Learning

Z Xu - 2021 - dspace.mit.edu

Reinforcement learning (RL) has recently emerged as a generic yet powerful solution for
learning complex decision-making policies, providing the key foundational underpinnings of …