The UCT algorithm applied to games with imperfect information

CB Browne, E Powley, D Whitehouse… - … Intelligence and AI …, 2012 - ieeexplore.ieee.org

Monte Carlo tree search (MCTS) is a recently proposed search method that combines the
precision of tree search with the generality of random sampling. It has received considerable …

被引用次数：3839 相关文章所有 55 个版本

[PDF] sciencedirect.com

Monte-Carlo tree search and rapid action value estimation in computer Go

S Gelly, D Silver - Artificial Intelligence, 2011 - Elsevier

A new paradigm for search, based on Monte-Carlo simulation, has revolutionised the
performance of computer Go programs. In this article we describe two extensions to the …

被引用次数：505 相关文章所有 19 个版本

[PDF] whiterose.ac.uk

Information set monte carlo tree search

PI Cowling, EJ Powley… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org

Monte Carlo tree search (MCTS) is an AI technique that has been successfully applied to
many deterministic games of perfect information. This paper investigates the application of …

被引用次数：238 相关文章所有 12 个版本

[PDF] jair.org

Learning to win by reading manuals in a monte-carlo framework

SRK Branavan, D Silver, R Barzilay - Journal of Artificial Intelligence …, 2012 - jair.org

Abstract Domain knowledge is crucial for effective performance in autonomous control
systems. Typically, human effort is required to encode this knowledge into a control …

被引用次数：200 相关文章所有 19 个版本

[PDF] springer.com

Temporal-difference search in computer Go

D Silver, RS Sutton, M Müller - Machine learning, 2012 - Springer

Temporal-difference learning is one of the most successful and broadly applied solutions to
the reinforcement learning problem; it has been used to achieve master-level play in chess …

被引用次数：144 相关文章所有 27 个版本

[PDF] ijcai.org

[PDF][PDF] Improving state evaluation, inference, and search in trick-based card games

M Buro, JR Long, T Furtak, N Sturtevant - Twenty-First International Joint …, 2009 - ijcai.org

Skat is Germany's national card game played by millions of players around the world. In this
paper, we present the world's first computer skat player that plays at the level of human …

被引用次数：102 相关文章所有 14 个版本

[PDF] ualberta.ca

Reinforcement learning and simulation-based search in computer Go

D Silver - 2009 - era.library.ualberta.ca

Learning and planning are two fundamental problems in artificial intelligence. The learning
problem can be tackled by reinforcement learning methods, such as temporal-difference …

被引用次数：104 相关文章所有 7 个版本

[PDF] arxiv.org

Watch the unobserved: A simple approach to parallelizing monte carlo tree search

A Liu, J Chen, M Yu, Y Zhai, X Zhou, J Liu - arXiv preprint arXiv …, 2018 - arxiv.org

Monte Carlo Tree Search (MCTS) algorithms have achieved great success on many
challenging benchmarks (eg, Computer Go). However, they generally require a large …

被引用次数：31 相关文章所有 5 个版本

[PDF] ualberta.ca

[图书][B] Monte Carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games

M Lanctot - 2013 - search.proquest.com

In this thesis, we investigate the problem of decision-making in large two-player zero-sum
games using Monte Carlo sampling and regret minimization methods. We demonstrate four …

被引用次数：56 相关文章所有 11 个版本

[PDF] psu.edu

Recursive Monte Carlo search for imperfect information games

T Furtak, M Buro - … on Computational Inteligence in Games (CIG …, 2013 - ieeexplore.ieee.org

Perfect information Monte Carlo (PIMC) search is the method of choice for constructing
strong Al systems for trick-taking card games. PIMC search evaluates moves in imperfect …

被引用次数：50 相关文章所有 7 个版本