Learning in games with continuous action sets and unknown payoff functions
P Mertikopoulos, Z Zhou - Mathematical Programming, 2019 - Springer
This paper examines the convergence of no-regret learning in games with continuous action
sets. For concreteness, we focus on learning via “dual averaging”, a widely used class of no …
sets. For concreteness, we focus on learning via “dual averaging”, a widely used class of no …
No-regret learning and mixed nash equilibria: They do not mix
EV Vlatakis-Gkaragkounis, L Flokas… - Advances in …, 2020 - proceedings.neurips.cc
Understanding the behavior of no-regret dynamics in general N-player games is a
fundamental question in online learning and game theory. A folk result in the field states that …
fundamental question in online learning and game theory. A folk result in the field states that …
[PDF][PDF] No-regret learning and mixed Nash equilibria: They do not mix
L Flokas, EV Vlatakis-Gkaragkounis… - arXiv preprint arXiv …, 2020 - proceedings.neurips.cc
Understanding the behavior of no-regret dynamics in general 𝑁-player games is a
fundamental question in online learning and game theory. A folk result in the field states that …
fundamental question in online learning and game theory. A folk result in the field states that …
Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees.
In this work, we propose introduce a variant of online stochastic gradient descent and prove
it converges to Nash equilibria and simultaneously it has sublinear regret for the class of …
it converges to Nash equilibria and simultaneously it has sublinear regret for the class of …
Polynomial Convergence of Bandit No-Regret Dynamics in Congestion Games
We introduce an online learning algorithm in the bandit feedback model that, once adopted
by all agents of a congestion game, results in game-dynamics that converge to an $\epsilon …
by all agents of a congestion game, results in game-dynamics that converge to an $\epsilon …
Opinion dynamics with limited information
We study opinion formation games based on the Friedkin-Johnsen (FJ) model. We are
interested in simple and natural variants of the FJ model that use limited information …
interested in simple and natural variants of the FJ model that use limited information …
Continuous-time convergence rates in potential and monotone games
In this paper, we provide exponential rates of convergence to the interior Nash equilibrium
for continuous-time dual-space game dynamics such as mirror descent (MD) and actor-critic …
for continuous-time dual-space game dynamics such as mirror descent (MD) and actor-critic …
Convergence of for Gradient-Based Algorithms in Zero-Sum Games without the Condition Number: A Smoothed Analysis
I Anagnostides, T Sandholm - arXiv preprint arXiv:2410.21636, 2024 - arxiv.org
Gradient-based algorithms have shown great promise in solving large (two-player) zero-sum
games. However, their success has been mostly confined to the low-precision regime since …
games. However, their success has been mostly confined to the low-precision regime since …
Learning Rationalizable Equilibria in Multiplayer Games
A natural goal in multiagent learning besides finding equilibria is to learn rationalizable
behavior, where players learn to avoid iteratively dominated actions. However, even in the …
behavior, where players learn to avoid iteratively dominated actions. However, even in the …
[PDF][PDF] Multi-Agent Learning for Iterative Dominance Elimination: Formal Barriers and New Algorithms.
Dominated actions are natural (and perhaps the simplest possible) multi-agent
generalizations of sub-optimal actions as in standard single-agent decision making. Thus …
generalizations of sub-optimal actions as in standard single-agent decision making. Thus …