Combining deep reinforcement learning and search for imperfect-information games

N Brown, A Bakhtin, A Lerer… - Advances in Neural …, 2020 - proceedings.neurips.cc
The combination of deep reinforcement learning and search at both training and test time is
a powerful paradigm that has led to a number of successes in single-agent settings and …

Rethinking formal models of partially observable multiagent decision making

V Kovařík, M Schmid, N Burch, M Bowling, V Lisý - Artificial Intelligence, 2022 - Elsevier
Multiagent decision-making in partially observable environments is usually modelled as
either an extensive-form game (EFG) in game theory or a partially observable stochastic …

[PDF][PDF] Equilibrium finding for large adversarial imperfect-information games

N Brown - PhD thesis, 2020 - reports-archive.adm.cs.cmu.edu
Imperfect-information games model strategic interactions involving multiple agents with
private information. A typical goal in this setting is to approximate an equilibrium in which all …

Solving zero-sum one-sided partially observable stochastic games

K Horák, B Bošanský, V Kovařík, C Kiekintveld - Artificial Intelligence, 2023 - Elsevier
Many real-world situations are dynamic, with long-term interactions between multiple agents
with uncertainty and limited observations. The agents must reason about which actions to …

Search in imperfect information games

M Schmid - arXiv preprint arXiv:2111.05884, 2021 - arxiv.org
From the very dawn of the field, search with value functions was a fundamental concept of
computer games research. Turing's chess algorithm from 1950 was able to think two moves …

Scalable algorithms for solving stochastic games with limited partial observability

K Horák - 2020 - search.proquest.com
Partially observable stochastic games (POSGs) represent a very general class of models
that can be used to reason about sequential decision making in the presence of adversaries …

[PDF][PDF] Particle value functions in imperfect information games

M Šustr, V Kovarík, V Lisy - AAMAS Adaptive and Learning …, 2021 - ala2021.vub.ac.be
Limited look-ahead search with a heuristic value function is a key AI technique in
reinforcement learning and game playing. In perfectinformation problems, this has …

[PDF][PDF] Fast algorithms for poker require modelling it as a sequential Bayesian game

V Kovařík, D Milec, M Šustr, D Seitz… - arXiv preprint ArXiv …, 2021 - rlg.mlanctot.info
Many recent results in imperfect information games were only formulated for, or evaluated
on, poker-like games. We argue that sequential Bayesian games constitute a natural class of …

Sound algorithms in imperfect information games

M Šustr, M Schmid, M Moravčík, N Burch… - arXiv preprint arXiv …, 2020 - arxiv.org
Search has played a fundamental role in computer game research since the very beginning.
And while online search has been commonly used in perfect information games such as …

Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms

V Kovařík, D Milec, M Šustr, D Seitz, V Lisý - arXiv preprint arXiv …, 2021 - arxiv.org
Recent advancements in algorithms for sequential decision-making under imperfect
information have shown remarkable success in large games such as limit-and no-limit …