Self-play Monte-Carlo tree search in computer poker

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1332 相关文章所有 8 个版本

[PDF] zju.edu.cn

Self-play reinforcement learning with comprehensive critic in computer games

S Liu, J Cao, Y Wang, W Chen, Y Liu - Neurocomputing, 2021 - Elsevier

Self-play reinforcement learning, where agents learn by playing with themselves, has been
successfully applied in many game scenarios. However, the training procedure for self-play …

被引用次数：27 相关文章所有 3 个版本

[PDF] arxiv.org

Enhanced rolling horizon evolution algorithm with opponent model learning: Results for the fighting game AI competition

Z Tang, Y Zhu, D Zhao, SM Lucas - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

The Fighting Game AI Competition (FTGAIC) provides a challenging benchmark for two-
player video game artificial intelligence. The challenge arises from the large action space …

被引用次数：37 相关文章所有 4 个版本

[PDF] mlanctot.info

[PDF][PDF] Online Monte Carlo Counterfactual Regret Minimization for Search in Imperfect Information Games.

V Lisý, M Lanctot, MH Bowling - AAMAS, 2015 - mlanctot.info

Online search in games has been a core interest of artificial intelligence. Search in imperfect
information games (eg, Poker, Bridge, Skat) is particularly challenging due to the …

被引用次数：75 相关文章所有 9 个版本

Ensemble strategy learning for imperfect information games

W Yuan, S Chen, P Li, J Chen - Neurocomputing, 2023 - Elsevier

Algorithms with several paradigms (such as rule-based methods, game theory and
reinforcement learning) have achieved great success in solving imperfect information games …

被引用次数：3 相关文章所有 2 个版本

[PDF] davidsilver.uk

[PDF][PDF] Smooth UCT Search in Computer Poker.

J Heinrich, D Silver - IJCAI, 2015 - davidsilver.uk

Abstract Self-play Monte Carlo Tree Search (MCTS) has been successful in many perfect-
information twoplayer games. Although these methods have been extended to imperfect …

被引用次数：35 相关文章所有 5 个版本

[图书][B] The Governance cycle in parliamentary democracies

S De Marchi, M Laver - 2023 - books.google.com

Parliamentary democracy involves a never-ending cycle of elections, government
formations, and the need for governments to survive in potentially hostile environments …

被引用次数：4 相关文章所有 5 个版本

[PDF] aaai.org

Don't Predict Counterfactual Values, Predict Expected Values Instead

J Wołosiuk, M Świechowski, J Mańdziuk - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Abstract Counterfactual Regret Minimization algorithms are the most popular way of
estimating the Nash Equilibrium in imperfect-information zero-sum games. In particular …

Monte Carlo tree search for games with hidden information and uncertainty

D Whitehouse - 2014 - etheses.whiterose.ac.uk

Monte Carlo Tree Search (MCTS) is an AI technique that has been successfully applied to
many deterministic games of perfect information, leading to large advances in a number of …

被引用次数：23 相关文章所有 3 个版本

[PDF] orangehelicopter.com

Emergent bluffing and inference with Monte Carlo tree search

PI Cowling, D Whitehouse… - 2015 IEEE conference on …, 2015 - ieeexplore.ieee.org

In many card and board games, players cannot see the whole game state, with different
players seeing different parts of the state. In such games, gathering of information …

被引用次数：15 相关文章所有 4 个版本