首页 | 本学科首页   官方微博 | 高级检索  
     


Der diskontierte Einarmige Bandit
Authors:Dipl. Math. J. Fischer
Affiliation:(1) Inst. für Statistik und Unternehmensforschung, TU München, Barerstr. 23, D-8000 München 2
Abstract:Summary A sequential stochastic process is observed in discrete time intervals; the process is generated by Bernoulli trials with unknown meanp. Given an a-priori distribution forp, regarded as a random variable, and discounting future payoffs with a factor ngr, O<ngr<1, optimal and suboptimal stopping rules (depending on ngr) are constructed. this leads to the connection of the process under consideration with another process of the same structure, but with known meanPo for the Bernoulli trials, thus finally resulting in an One-Armed-Bandit problem.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号