A. A. Sirota, A. O. Donskikh, A. V. Akimov, D. A. Minakov, “Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements”, Computer Optics, 2019, Volume 43, Issue 4,Pages <nobr>677

This article is cited in 5 papers

NUMERICAL METHODS AND DATA ANALYSIS

Multivariate mixed kernel density estimators and their application in machine learning for classification of biological objects based on spectral measurements

A. A. Sirota, A. O. Donskikh, A. V. Akimov, D. A. Minakov

Voronezh State University, Voronezh, Russia

Abstract: A problem of non-parametric multivariate density estimation for machine learning and data augmentation is considered. A new mixed density estimation method based on calculating the convolution of independently obtained kernel density estimates for unknown distributions of informative features and a known (or independently estimated) density for non-informative interference occurring during measurements is proposed. Properties of the mixed density estimates obtained using this method are analyzed. The method is compared with a conventional ParzenRosenblatt window method applied directly to the training data. The equivalence of the mixed kernel density estimator and the data augmentation procedure based on the known (or estimated) statistical model of interference is theoretically and experimentally proven. The applicability of the mixed density estimators for training of machine learning algorithms for the classification of biological objects (elements of grain mixtures) based on spectral measurements in the visible and near-infrared regions is evaluated.

Keywords: machine learning, pattern classification, data augmentation, kernel density estimation, spectral measurements.

Received: 15.03.2019
Accepted: 10.04.2019

DOI: 10.18287/2412-6179-2019-43-4-677-691