RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2025 Volume 37, Issue 6(2), Pages 93–106 (Mi tisp1076)

Method for training perceptron on tabular data with missing values

A. I. Perminov, A. P. Kovalenko, D. Yu. Turdakov

Ivannikov Institute for System Programming of the RAS

Abstract: Handling missing values in tabular data remains a critical challenge for building robust machine learning models. This paper presents a novel approach to imputation based on unary classification. The proposed method employs an ensemble of perceptrons trained independently for each class to estimate the likelihood of reconstructed values with respect to the empirical support of that class. A uniform distribution over a bounded region of the feature space is used as a background model, enabling the interpretation of the model’s output as an approximation of the posterior probability that an object belongs to a given class. This probabilistic interpretation is then leveraged within an iterative procedure for missing value imputation and classifier training. The theoretical validity of the proposed estimator is rigorously justified. Experiments on synthetic two-dimensional datasets with missing values generated under the MCAR (Missing Completely At Random) mechanism demonstrate the superiority of the proposed method over classical imputation techniques, particularly in scenarios with high missingness rates and complex class boundaries.

Keywords: missing data imputation; unary classification; perceptron; machine learning; Bayesian classifier; posterior probability estimation; MCAR; neural network regression

DOI: 10.15514/ISPRAS-2025-37(6)-22



© Steklov Math. Inst. of RAS, 2026