A. V. Kolnogorov, “Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)”, Avtomat. i Telemekh., 2011, Issue 5,Pages <nobr>127

This article is cited in 10 papers

Stochastic Systems, Queuing Systems

Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)

A. V. Kolnogorov

Yaroslav-the-Wise State University, Novgorod, Russia

Abstract: Minimax strategy and risk in a stationary random environment are found as Bayesian ones corresponding to the worst prior distribution. For environments with normally distributed incomes with unit variance and expectations that depend only on the alternative selected, this distribution can be chosen to be symmetric and asymptotically uniform. This lets one use numerical methods. The results can be used for systems with parallel data processing, in particular, for controlling environments with distributions other than normal.

Presented by the member of Editorial Board: A. V. Nazin

Received: 22.03.2010