Approximating nash equilibria in normal-form games via stochastic optimization

I Gemp, L Marris, G Piliouras - arXiv preprint arXiv:2310.06689, 2023 - arxiv.org
We propose the first, to our knowledge, loss function for approximate Nash equilibria of
normal-form games that is amenable to unbiased Monte Carlo estimation. This construction …

Exploitability minimization in games and beyond

D Goktas, A Greenwald - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Pseudo-games are a natural and well-known generalization of normal-form games, in which
the actions taken by each player affect not only the other players' payoffs, as in games, but …

Beyond Strict Competition: Approximate Convergence of Multi Agent Q-Learning Dynamics

A Hussain, F Belardinelli, G Piliouras - arXiv preprint arXiv:2307.13928, 2023 - arxiv.org
The behaviour of multi-agent learning in competitive settings is often considered under the
restrictive assumption of a zero-sum game. Only under this strict requirement is the …

Game Theoretic Rating in N-player general-sum games with Equilibria

L Marris, M Lanctot, I Gemp, S Omidshafiei… - arXiv preprint arXiv …, 2022 - arxiv.org
Rating strategies in a game is an important area of research in game theory and artificial
intelligence, and can be applied to any real-world competitive or cooperative setting …

The impact of exploration on convergence and performance of multi-agent Q-learning dynamics

A Hussain, F Belardinelli… - … Conference on Machine …, 2023 - proceedings.mlr.press
Understanding the impact of exploration on the behaviour of multi-agent learning has, so far,
benefited from the restriction to potential, or network zero-sum games in which convergence …

Policy Space Response Oracles: A Survey

A Bighashdel, Y Wang, S McAleer, R Savani… - arXiv preprint arXiv …, 2024 - arxiv.org
In game theory, a game refers to a model of interaction among rational decision-makers or
players, making choices with the goal of achieving their individual objectives. Understanding …

Data structures for deviation payoffs

B Wiedenbeck, E Brinkman - arXiv preprint arXiv:2302.13232, 2023 - arxiv.org
We present new data structures for representing symmetric normal-form games. These data
structures are optimized for efficiently computing the expected utility of each unilateral pure …

Reinforcement Nash Equilibrium Solver

X Wang, C Yang, S Li, P Li, X Huang, H Chan… - arXiv preprint arXiv …, 2024 - arxiv.org
Nash Equilibrium (NE) is the canonical solution concept of game theory, which provides an
elegant tool to understand the rationalities. Though mixed strategy NE exists in any game …

On the Stability of Learning in Network Games with Many Players

A Hussain, D Leonte, F Belardinelli… - arXiv preprint arXiv …, 2024 - arxiv.org
Multi-agent learning algorithms have been shown to display complex, unstable behaviours
in a wide array of games. In fact, previous works indicate that convergent behaviours are …

Developing, evaluating and scaling learning agents in multi-agent environments

I Gemp, T Anthony, Y Bachrach… - AI …, 2022 - content.iospress.com
Abstract The Game Theory & Multi-Agent team at DeepMind studies several aspects of multi-
agent learning ranging from computing approximations to fundamental concepts in game …