Exploration in deep reinforcement learning: A survey

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier
This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications

W Du, S Ding - Artificial Intelligence Review, 2021 - Springer
Deep reinforcement learning has proved to be a fruitful method in various tasks in the field of
artificial intelligence during the last several years. Recent works have focused on deep …

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer
Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

[HTML][HTML] A survey on multi-agent reinforcement learning and its application

Z Ning, L Xie - Journal of Automation and Intelligence, 2024 - Elsevier
Multi-agent reinforcement learning (MARL) has been a rapidly evolving field. This paper
presents a comprehensive survey of MARL and its applications. We trace the historical …

Bombalytics: Visualization of competition and collaboration strategies of players in a bomb laying game

S Agarwal, G Wallner, F Beck - Computer Graphics Forum, 2020 - Wiley Online Library
Competition and collaboration form complex interaction patterns between the agents and
objects involved. Only by understanding these interaction patterns, we can reveal the …

Model-based reinforcement learning for time-optimal velocity control

G Hartmann, Z Shiller, A Azaria - IEEE Robotics and …, 2020 - ieeexplore.ieee.org
Autonomous navigation has recently gained great interest in the field of reinforcement
learning. However, little attention was given to the time-optimal velocity control problem, ie …

Multi-agent advisor Q-learning

SG Subramanian, ME Taylor, K Larson… - Journal of Artificial …, 2022 - jair.org
In the last decade, there have been significant advances in multi-agent reinforcement
learning (MARL) but there are still numerous challenges, such as high sample complexity …

Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach

NM Huynh, HG Cao, I Wu - arXiv preprint arXiv:2407.00662, 2024 - arxiv.org
Pommerman is a multi-agent environment that has received considerable attention from
researchers in recent years. This environment is an ideal benchmark for multi-agent training …

Search and Learning Algorithms for Two-Player Games with Application to the Game of Hex

C Gao - 2020 - era.library.ualberta.ca
Two-Player alternate-turn perfect-information zero-sum games have been suggested as a
testbed for Artificial Intelligence research since Shannon in 1950s. In this thesis, we …

Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman

J Weil, J Czech, T Meuser, K Kersting - arXiv preprint arXiv:2305.13206, 2023 - arxiv.org
In combination with Reinforcement Learning, Monte-Carlo Tree Search has shown to
outperform human grandmasters in games such as Chess, Shogi and Go with little to no …