Der diskontierte Einarmige Bandit期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

Der diskontierte Einarmige Bandit

Authors:	Dipl. Math. J. Fischer

Affiliation:	(1) Inst. für Statistik und Unternehmensforschung, TU München, Barerstr. 23, D-8000 München 2

Abstract:	Summary A sequential stochastic process is observed in discrete time intervals; the process is generated by Bernoulli trials with unknown meanp. Given an a-priori distribution forp, regarded as a random variable, and discounting future payoffs with a factor , O<<1, optimal and suboptimal stopping rules (depending on ) are constructed. this leads to the connection of the process under consideration with another process of the same structure, but with known meanP_o for the Bernoulli trials, thus finally resulting in an One-Armed-Bandit problem.

Keywords:
本文献已被 SpringerLink 等数据库收录！