Novelty search in representational space for sample efficient exploration

A Aubret, L Matignon, S Hassas - Entropy, 2023 - mdpi.com

The reinforcement learning (RL) research area is very active, with an important number of
new contributions, especially considering the emergent field of deep RL (DRL). However, a …

被引用次数：27 相关文章所有 10 个版本

[PDF] science.org

Evolutionary reinforcement learning: A survey

H Bai, R Cheng, Y Jin - Intelligent Computing, 2023 - spj.science.org

Reinforcement learning (RL) is a machine learning approach that trains agents to maximize
cumulative rewards through interactions with environments. The integration of RL with deep …

被引用次数：29 相关文章所有 4 个版本

[PDF] neurips.cc

Behavior from the void: Unsupervised active pre-training

H Liu, P Abbeel - Advances in Neural Information …, 2021 - proceedings.neurips.cc

We introduce a new unsupervised pre-training method for reinforcement learning called
APT, which stands for Active Pre-Training. APT learns behaviors and representations by …

被引用次数：184 相关文章所有 7 个版本

[PDF] mlr.press

State entropy maximization with random encoders for efficient exploration

Y Seo, L Chen, J Shin, H Lee… - … on Machine Learning, 2021 - proceedings.mlr.press

Recent exploration methods have proven to be a recipe for improving sample-efficiency in
deep reinforcement learning (RL). However, efficient exploration in high-dimensional …

被引用次数：119 相关文章所有 8 个版本

[PDF] arxiv.org

Exploration in deep reinforcement learning: From single-agent to multiagent domain

J Hao, T Yang, H Tang, C Bai, J Liu… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) and deep multiagent reinforcement learning (MARL)
have achieved significant success across a wide range of domains, including game artificial …

被引用次数：69 相关文章所有 7 个版本

[PDF] ieee.org

Deep reinforcement learning versus evolution strategies: A comparative survey

AY Majid, S Saaybi, V Francois-Lavet… - … on Neural Networks …, 2023 - ieeexplore.ieee.org

Deep reinforcement learning (DRL) and evolution strategies (ESs) have surpassed human-
level control in many sequential decision-making problems, yet many open challenges still …

被引用次数：46 相关文章所有 14 个版本

[PDF] neurips.cc

Accelerating reinforcement learning with value-conditional state entropy exploration

D Kim, J Shin, P Abbeel, Y Seo - Advances in Neural …, 2024 - proceedings.neurips.cc

A promising technique for exploration is to maximize the entropy of visited state distribution,
ie, state entropy, by encouraging uniform coverage of visited state space. While it has been …

被引用次数：5 相关文章所有 5 个版本

[PDF] aaai.org

Cem: Constrained entropy maximization for task-agnostic safe exploration

Q Yang, MTJ Spaan - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

In the absence of assigned tasks, a learning agent typically seeks to explore its environment
efficiently. However, the pursuit of exploration will bring more safety risks. An under-explored …

被引用次数：12 相关文章所有 7 个版本

[PDF] neurips.cc

Dynamic bottleneck for robust self-supervised exploration

C Bai, L Wang, L Han, A Garg, J Hao… - Advances in Neural …, 2021 - proceedings.neurips.cc

Exploration methods based on pseudo-count of transitions or curiosity of dynamics have
achieved promising results in solving reinforcement learning with sparse rewards. However …

被引用次数：23 相关文章所有 8 个版本

[PDF] aaai.org

Curiosity-driven exploration via latent bayesian surprise

P Mazzaglia, O Catal, T Verbelen… - Proceedings of the AAAI …, 2022 - ojs.aaai.org

The human intrinsic desire to pursue knowledge, also known as curiosity, is considered
essential in the process of skill acquisition. With the aid of artificial curiosity, we could equip …

被引用次数：25 相关文章所有 7 个版本