Exploration-exploitation in multi-agent learning: Catastrophe theory meets game theory
S Leonardos, G Piliouras - Artificial Intelligence, 2022 - Elsevier
Exploration-exploitation is a powerful and practical tool in multi-agent learning (MAL);
however, its effects are far from understood. To make progress in this direction, we study a …
however, its effects are far from understood. To make progress in this direction, we study a …
A geometric decomposition of finite games: Convergence vs. recurrence under exponential weights
In view of the complexity of the dynamics of learning in games, we seek to decompose a
game into simpler components where the dynamics' long-run behavior is well understood. A …
game into simpler components where the dynamics' long-run behavior is well understood. A …
Advertising patterns in a dynamic oligopolistic growing market with decay
A finite-horizon Lanchester model of a (continuous-time) differential game of oligopolistic
advertising is considered, and the analytical form of the unique closed-loop Nash …
advertising is considered, and the analytical form of the unique closed-loop Nash …
GameVLM: A Decision-making Framework for Robotic Task Planning Based on Visual Language Models and Zero-sum Games
A Mei, J Wang, GN Zhu, Z Gan - arXiv preprint arXiv:2405.13751, 2024 - arxiv.org
With their prominent scene understanding and reasoning capabilities, pre-trained visual-
language models (VLMs) such as GPT-4V have attracted increasing attention in robotic task …
language models (VLMs) such as GPT-4V have attracted increasing attention in robotic task …
Disequilibrium play in tennis
A Anderson, J Rosen, J Rust, KP Wong - 2021 - journals.uchicago.edu
Do the world's best tennis pros play Nash equilibrium mixed strategies? We answer this
question using data on serve direction choices (to the receiver's left, right or body) from the …
question using data on serve direction choices (to the receiver's left, right or body) from the …
[HTML][HTML] The lottery contest is a best-response potential game
C Ewerhart - Economics Letters, 2017 - Elsevier
It is shown that the n-player lottery contest admits a best-response potential (Voorneveld,
2000). This is true also when the contest technology reflects the possibility of a draw. The …
2000). This is true also when the contest technology reflects the possibility of a draw. The …
On the convergence of fictitious play: A decomposition approach
Fictitious play (FP) is one of the most fundamental game-theoretical learning frameworks for
computing Nash equilibrium in $ n $-player games, which builds the foundation for modern …
computing Nash equilibrium in $ n $-player games, which builds the foundation for modern …
A geometric decomposition of finite games: Convergence vs. recurrence under no-regret learning
In view of the complexity of the dynamics of no-regret learning in games, we seek to
decompose a finite game into simpler components where the day-to-day behavior of the …
decompose a finite game into simpler components where the day-to-day behavior of the …
Decomposition of games: some strategic considerations
J Abdou, N Pnevmatikos, M Scarsini… - Mathematics of …, 2022 - pubsonline.informs.org
Orthogonal direct-sum decompositions of finite games into potential, harmonic and
nonstrategic components exist in the literature. In this paper we study the issue of …
nonstrategic components exist in the literature. In this paper we study the issue of …