Approximating nash equilibria in normal-form games via stochastic optimization
We propose the first, to our knowledge, loss function for approximate Nash equilibria of
normal-form games that is amenable to unbiased Monte Carlo estimation. This construction …
normal-form games that is amenable to unbiased Monte Carlo estimation. This construction …
Exploitability minimization in games and beyond
D Goktas, A Greenwald - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Pseudo-games are a natural and well-known generalization of normal-form games, in which
the actions taken by each player affect not only the other players' payoffs, as in games, but …
the actions taken by each player affect not only the other players' payoffs, as in games, but …
Beyond Strict Competition: Approximate Convergence of Multi Agent Q-Learning Dynamics
The behaviour of multi-agent learning in competitive settings is often considered under the
restrictive assumption of a zero-sum game. Only under this strict requirement is the …
restrictive assumption of a zero-sum game. Only under this strict requirement is the …
Game Theoretic Rating in N-player general-sum games with Equilibria
Rating strategies in a game is an important area of research in game theory and artificial
intelligence, and can be applied to any real-world competitive or cooperative setting …
intelligence, and can be applied to any real-world competitive or cooperative setting …
The impact of exploration on convergence and performance of multi-agent Q-learning dynamics
A Hussain, F Belardinelli… - … Conference on Machine …, 2023 - proceedings.mlr.press
Understanding the impact of exploration on the behaviour of multi-agent learning has, so far,
benefited from the restriction to potential, or network zero-sum games in which convergence …
benefited from the restriction to potential, or network zero-sum games in which convergence …
Policy Space Response Oracles: A Survey
In game theory, a game refers to a model of interaction among rational decision-makers or
players, making choices with the goal of achieving their individual objectives. Understanding …
players, making choices with the goal of achieving their individual objectives. Understanding …
Data structures for deviation payoffs
B Wiedenbeck, E Brinkman - arXiv preprint arXiv:2302.13232, 2023 - arxiv.org
We present new data structures for representing symmetric normal-form games. These data
structures are optimized for efficiently computing the expected utility of each unilateral pure …
structures are optimized for efficiently computing the expected utility of each unilateral pure …
Reinforcement Nash Equilibrium Solver
Nash Equilibrium (NE) is the canonical solution concept of game theory, which provides an
elegant tool to understand the rationalities. Though mixed strategy NE exists in any game …
elegant tool to understand the rationalities. Though mixed strategy NE exists in any game …
On the Stability of Learning in Network Games with Many Players
Multi-agent learning algorithms have been shown to display complex, unstable behaviours
in a wide array of games. In fact, previous works indicate that convergent behaviours are …
in a wide array of games. In fact, previous works indicate that convergent behaviours are …
Developing, evaluating and scaling learning agents in multi-agent environments
Abstract The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-
agent learning ranging from computing approximations to fundamental concepts in game …
agent learning ranging from computing approximations to fundamental concepts in game …