Value functions for depth-limited solving in imperfect-information games beyond poker

N Brown, A Bakhtin, A Lerer… - Advances in Neural …, 2020 - proceedings.neurips.cc

The combination of deep reinforcement learning and search at both training and test time is
a powerful paradigm that has led to a number of successes in single-agent settings and …

被引用次数：165 相关文章所有 10 个版本

[PDF] arxiv.org

Rethinking formal models of partially observable multiagent decision making

V Kovařík, M Schmid, N Burch, M Bowling, V Lisý - Artificial Intelligence, 2022 - Elsevier

Multiagent decision-making in partially observable environments is usually modelled as
either an extensive-form game (EFG) in game theory or a partially observable stochastic …

被引用次数：87 相关文章所有 7 个版本

[PDF] cmu.edu

[PDF][PDF] Equilibrium finding for large adversarial imperfect-information games

N Brown - PhD thesis, 2020 - reports-archive.adm.cs.cmu.edu

Imperfect-information games model strategic interactions involving multiple agents with
private information. A typical goal in this setting is to approximate an equilibrium in which all …

被引用次数：28 相关文章所有 2 个版本

[PDF] arxiv.org

Solving zero-sum one-sided partially observable stochastic games

K Horák, B Bošanský, V Kovařík, C Kiekintveld - Artificial Intelligence, 2023 - Elsevier

Many real-world situations are dynamic, with long-term interactions between multiple agents
with uncertainty and limited observations. The agents must reason about which actions to …

被引用次数：25 相关文章所有 5 个版本

[PDF] arxiv.org

Search in imperfect information games

M Schmid - arXiv preprint arXiv:2111.05884, 2021 - arxiv.org

From the very dawn of the field, search with value functions was a fundamental concept of
computer games research. Turing's chess algorithm from 1950 was able to think two moves …

被引用次数：14 相关文章所有 4 个版本

Scalable algorithms for solving stochastic games with limited partial observability

K Horák - 2020 - search.proquest.com

Partially observable stochastic games (POSGs) represent a very general class of models
that can be used to reason about sequential decision making in the presence of adversaries …

被引用次数：14 相关文章

[PDF] vub.ac.be

[PDF][PDF] Particle value functions in imperfect information games

M Šustr, V Kovarík, V Lisy - AAMAS Adaptive and Learning …, 2021 - ala2021.vub.ac.be

Limited look-ahead search with a heuristic value function is a key AI technique in
reinforcement learning and game playing. In perfectinformation problems, this has …

被引用次数：4 相关文章所有 2 个版本

[PDF] mlanctot.info

[PDF][PDF] Fast algorithms for poker require modelling it as a sequential Bayesian game

V Kovařík, D Milec, M Šustr, D Seitz… - arXiv preprint ArXiv …, 2021 - rlg.mlanctot.info

Many recent results in imperfect information games were only formulated for, or evaluated
on, poker-like games. We argue that sequential Bayesian games constitute a natural class of …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Sound algorithms in imperfect information games

M Šustr, M Schmid, M Moravčík, N Burch… - arXiv preprint arXiv …, 2020 - arxiv.org

Search has played a fundamental role in computer game research since the very beginning.
And while online search has been commonly used in perfect information games such as …

被引用次数：4 相关文章所有 8 个版本

[PDF] arxiv.org

Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms

V Kovařík, D Milec, M Šustr, D Seitz, V Lisý - arXiv preprint arXiv …, 2021 - arxiv.org

Recent advancements in algorithms for sequential decision-making under imperfect
information have shown remarkable success in large games such as limit-and no-limit …