Combining deep reinforcement learning and search for imperfect-information games
The combination of deep reinforcement learning and search at both training and test time is
a powerful paradigm that has led to a number of successes in single-agent settings and …
a powerful paradigm that has led to a number of successes in single-agent settings and …
Rethinking formal models of partially observable multiagent decision making
Multiagent decision-making in partially observable environments is usually modelled as
either an extensive-form game (EFG) in game theory or a partially observable stochastic …
either an extensive-form game (EFG) in game theory or a partially observable stochastic …
[PDF][PDF] Equilibrium finding for large adversarial imperfect-information games
N Brown - PhD thesis, 2020 - reports-archive.adm.cs.cmu.edu
Imperfect-information games model strategic interactions involving multiple agents with
private information. A typical goal in this setting is to approximate an equilibrium in which all …
private information. A typical goal in this setting is to approximate an equilibrium in which all …
Solving zero-sum one-sided partially observable stochastic games
Many real-world situations are dynamic, with long-term interactions between multiple agents
with uncertainty and limited observations. The agents must reason about which actions to …
with uncertainty and limited observations. The agents must reason about which actions to …
Search in imperfect information games
M Schmid - arXiv preprint arXiv:2111.05884, 2021 - arxiv.org
From the very dawn of the field, search with value functions was a fundamental concept of
computer games research. Turing's chess algorithm from 1950 was able to think two moves …
computer games research. Turing's chess algorithm from 1950 was able to think two moves …
Scalable algorithms for solving stochastic games with limited partial observability
K Horák - 2020 - search.proquest.com
Partially observable stochastic games (POSGs) represent a very general class of models
that can be used to reason about sequential decision making in the presence of adversaries …
that can be used to reason about sequential decision making in the presence of adversaries …
[PDF][PDF] Particle value functions in imperfect information games
Limited look-ahead search with a heuristic value function is a key AI technique in
reinforcement learning and game playing. In perfectinformation problems, this has …
reinforcement learning and game playing. In perfectinformation problems, this has …
[PDF][PDF] Fast algorithms for poker require modelling it as a sequential Bayesian game
Many recent results in imperfect information games were only formulated for, or evaluated
on, poker-like games. We argue that sequential Bayesian games constitute a natural class of …
on, poker-like games. We argue that sequential Bayesian games constitute a natural class of …
Sound algorithms in imperfect information games
Search has played a fundamental role in computer game research since the very beginning.
And while online search has been commonly used in perfect information games such as …
And while online search has been commonly used in perfect information games such as …
Revisiting Game Representations: The Hidden Costs of Efficiency in Sequential Decision-making Algorithms
Recent advancements in algorithms for sequential decision-making under imperfect
information have shown remarkable success in large games such as limit-and no-limit …
information have shown remarkable success in large games such as limit-and no-limit …