Exploration in deep reinforcement learning: A survey
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …
techniques are of primary importance when solving sparse reward problems. In sparse …
How to train your robot with deep reinforcement learning: lessons we have learned
Deep reinforcement learning (RL) has emerged as a promising approach for autonomously
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
acquiring complex behaviors from low-level sensor observations. Although a large portion of …
A minimalist approach to offline reinforcement learning
S Fujimoto, SS Gu - Advances in neural information …, 2021 - proceedings.neurips.cc
Offline reinforcement learning (RL) defines the task of learning from a fixed batch of data.
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Due to errors in value estimation from out-of-distribution actions, most offline RL algorithms …
Efficient online reinforcement learning with offline data
Sample efficiency and exploration remain major challenges in online reinforcement learning
(RL). A powerful approach that can be applied to address these issues is the inclusion of …
(RL). A powerful approach that can be applied to address these issues is the inclusion of …
Cal-ql: Calibrated offline rl pre-training for efficient online fine-tuning
A compelling use case of offline reinforcement learning (RL) is to obtain a policy initialization
from existing datasets followed by fast online fine-tuning with limited interaction. However …
from existing datasets followed by fast online fine-tuning with limited interaction. However …
Transfer learning in deep reinforcement learning: A survey
Reinforcement learning is a learning paradigm for solving sequential decision-making
problems. Recent years have witnessed remarkable progress in reinforcement learning …
problems. Recent years have witnessed remarkable progress in reinforcement learning …
Conservative q-learning for offline reinforcement learning
Effectively leveraging large, previously collected datasets in reinforcement learn-ing (RL) is
a key challenge for large-scale real-world applications. Offline RL algorithms promise to …
a key challenge for large-scale real-world applications. Offline RL algorithms promise to …
Awac: Accelerating online reinforcement learning with offline datasets
Reinforcement learning (RL) provides an appealing formalism for learning control policies
from experience. However, the classic active formulation of RL necessitates a lengthy active …
from experience. However, the classic active formulation of RL necessitates a lengthy active …
Recent advances in robot learning from demonstration
H Ravichandar, AS Polydoros… - Annual review of …, 2020 - annualreviews.org
In the context of robotics and automation, learning from demonstration (LfD) is the paradigm
in which robots acquire new skills by learning to imitate an expert. The choice of LfD over …
in which robots acquire new skills by learning to imitate an expert. The choice of LfD over …
Off-policy deep reinforcement learning without exploration
Many practical applications of reinforcement learning constrain agents to learn from a fixed
batch of data which has already been gathered, without offering further possibility for data …
batch of data which has already been gathered, without offering further possibility for data …