An information-theoretic perspective on intrinsic motivation in reinforcement learning: A survey

A Aubret, L Matignon, S Hassas - Entropy, 2023 - mdpi.com
The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …

Evolutionary reinforcement learning: A survey

H Bai, R Cheng, Y Jin - Intelligent Computing, 2023 - spj.science.org
Reinforcement learning (RL) is a machine learning approach that trains agents to maximize
cumulative rewards through interactions with environments. The integration of RL with deep …

Behavior from the void: Unsupervised active pre-training

H Liu, P Abbeel - Advances in Neural Information …, 2021 - proceedings.neurips.cc
We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …

State entropy maximization with random encoders for efficient exploration

Y Seo, L Chen, J Shin, H Lee… - … on Machine Learning, 2021 - proceedings.mlr.press
Recent exploration methods have proven to be a recipe for improving sample-efficiency in
deep reinforcement learning (RL). However, efficient exploration in high-dimensional …

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

Deep reinforcement learning versus evolution strategies: A comparative survey

AY Majid, S Saaybi, V Francois-Lavet… - … on Neural Networks …, 2023 - ieeexplore.ieee.org
Deep reinforcement learning (DRL) and evolution strategies (ESs) have surpassed human-
level control in many sequential decision-making problems, yet many open challenges still …

Accelerating reinforcement learning with value-conditional state entropy exploration

D Kim, J Shin, P Abbeel, Y Seo - Advances in Neural …, 2024 - proceedings.neurips.cc
A promising technique for exploration is to maximize the entropy of visited state distribution,
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …

Cem: Constrained entropy maximization for task-agnostic safe exploration

Q Yang, MTJ Spaan - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
In the absence of assigned tasks, a learning agent typically seeks to explore its environment
efficiently. However, the pursuit of exploration will bring more safety risks. An under-explored …

Dynamic bottleneck for robust self-supervised exploration

C Bai, L Wang, L Han, A Garg, J Hao… - Advances in Neural …, 2021 - proceedings.neurips.cc
Exploration methods based on pseudo-count of transitions or curiosity of dynamics have
achieved promising results in solving reinforcement learning with sparse rewards. However …

Curiosity-driven exploration via latent bayesian surprise

P Mazzaglia, O Catal, T Verbelen… - Proceedings of the AAAI …, 2022 - ojs.aaai.org
The human intrinsic desire to pursue knowledge, also known as curiosity, is considered
essential in the process of skill acquisition. With the aid of artificial curiosity, we could equip …