作者
Johannes Heinrich, Marc Lanctot, David Silver
发表日期
2015
研讨会论文
Proceedings of the 32nd International Conference on Machine Learning
简介
Fictitious play is a popular game-theoretic model of learning in games. However, it has received little attention in practical applications to large problems. This paper introduces two variants of fictitious play that are implemented in behavioural strategies of an extensive-form game. The first variant is a full-width process that is realization equivalent to its normal-form counterpart and therefore inherits its convergence guarantees. However, its computational requirements are linear in time and space rather than exponential. The second variant, Fictitious Self-Play, is a machine learning framework that implements fictitious play in a sample-based fashion. Experiments in imperfect-information poker games compare our approaches and demonstrate their convergence to approximate Nash equilibria.
引用总数
201420152016201720182019202020212022202320241171023393850648839
学术搜索中的文章
J Heinrich, M Lanctot, D Silver - International conference on machine learning, 2015