首页 | 本学科首页   官方微博 | 高级检索  
     检索      


Learning with perfect information
Authors:Pradeep Dubey  Ori Haimanko  
Institution:a Center for Game Theory, Department of Economics, SUNY at, Stony Brook, NY 11794, USA;b Department of Economics, Ben-Gurion University, Beer Sheva 84105, Israel
Abstract:For extensive form games with perfect information, consider a learning process in which, at any iteration, each player unilaterally deviates to a best response to his current conjectures of others' strategies; and then updates his conjectures in accordance with the induced play of the game. We show that, for generic payoffs, the outcome of the game becomes stationary, and is consistent with Nash equilibrium. In general, if payoffs have ties or if players observe more of each others' strategies than is revealed by plays of the game, the same result holds provided a rationality constraint is imposed on unilateral deviations: no player changes his moves in subgames that he deems unreachable, unless he stands to improve his payoff there. Moreover, with this constraint, the sequence of strategies and conjectures also becomes stationary, and yields a self-confirming equilibrium.
Keywords:Learning in extensive form games  Perfect information  Self-confirming and Nash equilibria  Objective updates  Convergence in finite time
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号