A Behavioral Learning Process in Games |
| |
Authors: | Jean-Fran ois Laslier, Richard Topol,Bernard Walliser |
| |
Affiliation: | a CNRS, Laboratoire d'Économétrie, École Polytechnique, Paris, France;b CNRS, CREA, École Polytechnique, Paris, France;c CERAS, École Nationale des Ponts et Chaussées, Paris, France;d CREA, École Polytechnique, Paris, France |
| |
Abstract: | This paper studies the cumulative proportional reinforcement (CPR) rule, according to which an agent plays, at each period, an action with a probability proportional to the cumulative utility that the agent has obtained with that action. The asymptotic properties of this learning process are examined for a decision-maker under risk, where it converges almost surely toward the expected utility maximizing action(s). The process is further considered in a two-player game; it converges with positive probability toward any strict pure Nash equilibrium and converges with zero probability toward some mixed equilibria (which are characterized). The CPR rule is compared in its principles with other reinforcement rules and with replicator dynamics. Journal of Economic Literature Classification Number: C72. |
| |
Keywords: | evolution learning Nash equilibrium Polya urn reinforcement |
本文献已被 ScienceDirect 等数据库收录! |
|