A survey of monte carlo tree search methods

CB Browne, E Powley, D Whitehouse… - … Intelligence and AI …, 2012 - ieeexplore.ieee.org
Monte Carlo tree search (MCTS) is a recently proposed search method that combines the
precision of tree search with the generality of random sampling. It has received considerable …

Monte-Carlo tree search and rapid action value estimation in computer Go

S Gelly, D Silver - Artificial Intelligence, 2011 - Elsevier
A new paradigm for search, based on Monte-Carlo simulation, has revolutionised the
performance of computer Go programs. In this article we describe two extensions to the …

Information set monte carlo tree search

PI Cowling, EJ Powley… - IEEE Transactions on …, 2012 - ieeexplore.ieee.org
Monte Carlo tree search (MCTS) is an AI technique that has been successfully applied to
many deterministic games of perfect information. This paper investigates the application of …

Learning to win by reading manuals in a monte-carlo framework

SRK Branavan, D Silver, R Barzilay - Journal of Artificial Intelligence …, 2012 - jair.org
Abstract Domain knowledge is crucial for effective performance in autonomous control
systems. Typically, human effort is required to encode this knowledge into a control …

Temporal-difference search in computer Go

D Silver, RS Sutton, M Müller - Machine learning, 2012 - Springer
Temporal-difference learning is one of the most successful and broadly applied solutions to
the reinforcement learning problem; it has been used to achieve master-level play in chess …

[PDF][PDF] Improving state evaluation, inference, and search in trick-based card games

M Buro, JR Long, T Furtak, N Sturtevant - Twenty-First International Joint …, 2009 - ijcai.org
Skat is Germany's national card game played by millions of players around the world. In this
paper, we present the world's first computer skat player that plays at the level of human …

Reinforcement learning and simulation-based search in computer Go

D Silver - 2009 - era.library.ualberta.ca
Learning and planning are two fundamental problems in artificial intelligence. The learning
problem can be tackled by reinforcement learning methods, such as temporal-difference …

Watch the unobserved: A simple approach to parallelizing monte carlo tree search

A Liu, J Chen, M Yu, Y Zhai, X Zhou, J Liu - arXiv preprint arXiv …, 2018 - arxiv.org
Monte Carlo Tree Search (MCTS) algorithms have achieved great success on many
challenging benchmarks (eg, Computer Go). However, they generally require a large …

[图书][B] Monte Carlo sampling and regret minimization for equilibrium computation and decision-making in large extensive form games

M Lanctot - 2013 - search.proquest.com
In this thesis, we investigate the problem of decision-making in large two-player zero-sum
games using Monte Carlo sampling and regret minimization methods. We demonstrate four …

Recursive Monte Carlo search for imperfect information games

T Furtak, M Buro - … on Computational Inteligence in Games (CIG …, 2013 - ieeexplore.ieee.org
Perfect information Monte Carlo (PIMC) search is the method of choice for constructing
strong Al systems for trick-taking card games. PIMC search evaluates moves in imperfect …