Exploration in deep reinforcement learning: A survey
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …
techniques are of primary importance when solving sparse reward problems. In sparse …
[HTML][HTML] A review of uncertainty quantification in deep learning: Techniques, applications and challenges
Uncertainty quantification (UQ) methods play a pivotal role in reducing the impact of
uncertainties during both optimization and decision making processes. They have been …
uncertainties during both optimization and decision making processes. They have been …
Offline reinforcement learning: Tutorial, review, and perspectives on open problems
In this tutorial article, we aim to provide the reader with the conceptual tools needed to get
started on research on offline reinforcement learning algorithms: reinforcement learning …
started on research on offline reinforcement learning algorithms: reinforcement learning …
Off-policy deep reinforcement learning without exploration
Many practical applications of reinforcement learning constrain agents to learn from a fixed
batch of data which has already been gathered, without offering further possibility for data …
batch of data which has already been gathered, without offering further possibility for data …
Exploration by random network distillation
We introduce an exploration bonus for deep reinforcement learning methods that is easy to
implement and adds minimal overhead to the computation performed. The bonus is the error …
implement and adds minimal overhead to the computation performed. The bonus is the error …
[HTML][HTML] First return, then explore
Reinforcement learning promises to solve complex sequential-decision problems
autonomously by specifying a high-level reward function only. However, reinforcement …
autonomously by specifying a high-level reward function only. However, reinforcement …
Addressing function approximation error in actor-critic methods
In value-based reinforcement learning methods such as deep Q-learning, function
approximation errors are known to lead to overestimated value estimates and suboptimal …
approximation errors are known to lead to overestimated value estimates and suboptimal …
Robust reinforcement learning: A review of foundations and recent advances
Reinforcement learning (RL) has become a highly successful framework for learning in
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …
Markov decision processes (MDP). Due to the adoption of RL in realistic and complex …
Pessimistic bootstrapping for uncertainty-driven offline reinforcement learning
Offline Reinforcement Learning (RL) aims to learn policies from previously collected
datasets without exploring the environment. Directly applying off-policy algorithms to offline …
datasets without exploring the environment. Directly applying off-policy algorithms to offline …
Go-explore: a new approach for hard-exploration problems
A grand challenge in reinforcement learning is intelligent exploration, especially when
rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard …
rewards are sparse or deceptive. Two Atari games serve as benchmarks for such hard …