RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2011 Issue 5, Pages 127–138 (Mi at1708)

This article is cited in 10 papers

Stochastic Systems, Queuing Systems

Finding minimax strategy and minimax risk in a random environment (the two-armed bandit problem)

A. V. Kolnogorov

Yaroslav-the-Wise State University, Novgorod, Russia

Abstract: Minimax strategy and risk in a stationary random environment are found as Bayesian ones corresponding to the worst prior distribution. For environments with normally distributed incomes with unit variance and expectations that depend only on the alternative selected, this distribution can be chosen to be symmetric and asymptotically uniform. This lets one use numerical methods. The results can be used for systems with parallel data processing, in particular, for controlling environments with distributions other than normal.

Presented by the member of Editorial Board: A. V. Nazin

Received: 22.03.2010


 English version:
Automation and Remote Control, 2011, 72:5, 1017–1027

Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026