RUS  ENG
Full version
JOURNALS // Vestnik Yuzhno-Ural'skogo Universiteta. Seriya Matematicheskoe Modelirovanie i Programmirovanie // Archive

Vestnik YuUrGU. Ser. Mat. Model. Progr., 2025 Volume 18, Issue 1, Pages 35–45 (Mi vyuru747)

This article is cited in 1 paper

Mathematical Modelling

A limiting description in a Gaussian one-armed bandit problem with both unknown parameters

A. V. Kolnogorov

Yaroslav-the-Wise Novgorod State University, Veliky Novgorod, Russian Federation, kolnogorov53@mail.ru

Abstract: We consider the limiting description of control in a Gaussian one-armed bandit problem, which is a mathematical model for optimizing batch processing of big data in the presence of two alternative methods with known efficiency of the first method. We establish that this description is given by a second-order partial differential equation in which the variance of one-step income is known. This means that in the case of big data, the variance can be arbitrarily accurate estimated at a short initial stage of processing, and then the obtained estimate is used by the control strategy.

Keywords: one-armed bandit, Bayesian and minimax approaches, invariant description, batch processing.

UDC: 519.244+519.83

MSC: 62C10, 62L05, 91A35

Received: 24.10.2024

Language: English

DOI: 10.14529/mmp250103



© Steklov Math. Inst. of RAS, 2026