K. S. Lukyanov, P. A. Yaskov, A. I. Perminov, A. P. Kovalenko, D. Y. Turdakov, “Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution”, Uspekhi Mat. Nauk, 2024, Volume 79, Issue 6(480),Pages <nobr>57

This article is cited in 3 papers

Extrapolation of the Bayesian classifier with an unknown support of the two-class mixture distribution

K. S. Lukyanov^abc, P. A. Yaskov^de, A. I. Perminov^ac, A. P. Kovalenko^f, D. Y. Turdakov^ac

^a Ivannikov Institute for System Programming of the Russian Academy of Science, Moscow, Russia
^b Moscow Institute of Physics and Technology (National Research University), Moscow, Russia
^c Research Center for Trusted Artificial Intelligence, ISP RAS, Moscow, Russia
^d Steklov Mathematical Institute of Russian Academy of Sciences, Moscow, Russia
^e National University of Science and Technology MISIS, Moscow, Russia
^f Academy of Cryptography of Russian Federation, Moscow, Russia

Abstract: This work introduces a method aimed at enhancing the reliability of the Bayesian classifier. The method involves augmenting the training dataset, which consists of a mixture of distributions from two original classes, with artificially generated observations from a third, ‘background’ class, uniformly distributed over a compact set that contains the unknown support of the original mixture. This modification allows the value of the discriminant function outside the support of the training data distribution to approach a prescribed level (in this case, zero). Adding a decision option for ‘Refusal to Classify’, triggered when the discriminant function takes sufficiently small values, results in a localized increase in classifier reliability. Specifically, this approach addresses several issues: it enables the rejection of data that differs significantly from the training data; facilitates the detection of anomalies in input data; and avoids decision-making in ‘boundary’ regions when separating classes.
The paper provides a theoretical justification for the optimality of the proposed classifier. The practical utility of the method is demonstrated through classification tasks involving images and time series.
Additionally, a methodology for identifying trusted regions is proposed. This methodology can be used to detect anomalous data, cases of parameter shifts in class distributions, and areas of overlap between the distributions of the original classes. Based on these trusted regions, quantitative metrics for classifier reliability and efficiency are introduced.
Bibliography: 23 titles.

Keywords: machine learning, Bayesian classifier, trusted machine learning, interpretability, out-of-distribution (OOD), image classification, time series classification, rejection of classification, background class.

UDC: 004.8+519.6

MSC: 62H30, 68T10

Received: 05.09.2024

DOI: 10.4213/rm10208