On hard exploration for reinforcement learning: A case study in pommerman

P Ladosz, L Weng, M Kim, H Oh - Information Fusion, 2022 - Elsevier

This paper reviews exploration techniques in deep reinforcement learning. Exploration
techniques are of primary importance when solving sparse reward problems. In sparse …

被引用次数：203 相关文章所有 5 个版本

A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications

W Du, S Ding - Artificial Intelligence Review, 2021 - Springer

Deep reinforcement learning has proved to be a fruitful method in various tasks in the field of
artificial intelligence during the last several years. Recent works have focused on deep …

被引用次数：171 相关文章所有 4 个版本

[PDF] arxiv.org

A survey and critique of multiagent deep reinforcement learning

P Hernandez-Leal, B Kartal, ME Taylor - Autonomous Agents and Multi …, 2019 - Springer

Deep reinforcement learning (RL) has achieved outstanding results in recent years. This has
led to a dramatic increase in the number of applications and methods. Recent works have …

被引用次数：618 相关文章所有 8 个版本

[HTML] sciencedirect.com

[HTML][HTML] A survey on multi-agent reinforcement learning and its application

Z Ning, L Xie - Journal of Automation and Intelligence, 2024 - Elsevier

Multi-agent reinforcement learning (MARL) has been a rapidly evolving field. This paper
presents a comprehensive survey of MARL and its applications. We trace the historical …

被引用次数：2 相关文章

[PDF] wiley.com

Bombalytics: Visualization of competition and collaboration strategies of players in a bomb laying game

S Agarwal, G Wallner, F Beck - Computer Graphics Forum, 2020 - Wiley Online Library

Competition and collaboration form complex interaction patterns between the agents and
objects involved. Only by understanding these interaction patterns, we can reveal the …

被引用次数：17 相关文章所有 9 个版本

[PDF] academia.edu

Model-based reinforcement learning for time-optimal velocity control

G Hartmann, Z Shiller, A Azaria - IEEE Robotics and …, 2020 - ieeexplore.ieee.org

Autonomous navigation has recently gained great interest in the field of reinforcement
learning. However, little attention was given to the time-optimal velocity control problem, ie …

被引用次数：15 相关文章所有 4 个版本

[PDF] jair.org Full View

Multi-agent advisor Q-learning

SG Subramanian, ME Taylor, K Larson… - Journal of Artificial …, 2022 - jair.org

In the last decade, there have been significant advances in multi-agent reinforcement
learning (MARL) but there are still numerous challenges, such as high sample complexity …

被引用次数：11 相关文章所有 12 个版本

[PDF] arxiv.org

Multi-Agent Training for Pommerman: Curriculum Learning and Population-based Self-Play Approach

NM Huynh, HG Cao, I Wu - arXiv preprint arXiv:2407.00662, 2024 - arxiv.org

Pommerman is a multi-agent environment that has received considerable attention from
researchers in recent years. This environment is an ideal benchmark for multi-agent training …

Search and Learning Algorithms for Two-Player Games with Application to the Game of Hex

C Gao - 2020 - era.library.ualberta.ca

Two-Player alternate-turn perfect-information zero-sum games have been suggested as a
testbed for Artificial Intelligence research since Shannon in 1950s. In this thesis, we …

被引用次数：4 相关文章

[PDF] arxiv.org

Know your Enemy: Investigating Monte-Carlo Tree Search with Opponent Models in Pommerman

J Weil, J Czech, T Meuser, K Kersting - arXiv preprint arXiv:2305.13206, 2023 - arxiv.org

In combination with Reinforcement Learning, Monte-Carlo Tree Search has shown to
outperform human grandmasters in games such as Chess, Shogi and Go with little to no …