Generalised weakened fictitious play |
| |
Authors: | David S. Leslie E.J. Collins |
| |
Affiliation: | aDepartment of Statistics, University of Oxford, Oxford, OX1 3TG, UK;bDepartment of Mathematics, University of Bristol, Bristol, BS8 1TW, UK |
| |
Abstract: | ![]() A general class of adaptive processes in games is developed, which significantly generalises weakened fictitious play [Van der Genugten, B., 2000. A weakened form of fictitious play in two-person zero-sum games. Int. Game Theory Rev. 2, 307–328] and includes several interesting fictitious-play-like processes as special cases. The general model is rigorously analysed using the best response differential inclusion, and shown to converge in games with the fictitious play property. Furthermore, a new actor–critic process is introduced, in which the only information given to a player is the reward received as a result of selecting an action—a player need not even know they are playing a game. It is shown that this results in a generalised weakened fictitious play process, and can therefore be considered as a first step towards explaining how players might learn to play Nash equilibrium strategies without having any knowledge of the game, or even that they are playing a game. |
| |
Keywords: | Fictitious play Best response differential inclusion Stochastic approximation Actor– critic process |
本文献已被 ScienceDirect 等数据库收录! |
|