RUS  ENG
Full version
JOURNALS // Vestnik of Saint Petersburg University. Mathematics. Mechanics. Astronomy // Archive

Vestnik of Saint Petersburg University. Mathematics. Mechanics. Astronomy, 2021 Volume 8, Issue 3, Pages 394–405 (Mi vspua90)

This article is cited in 1 paper

MATHEMATICS

The symptom-syndrome analysis of multivariate categorical data based on Zhegalkin polynomials

N. P. Alekseeva

St. Petersburg State University, 7-9, Universitetskaya nab., St. Petersburg, 199034, Russian Federation

Abstract: In this article, we study the distribution, entropy and other informational properties of finite projective subspaces (syndromes) parameterized by impulse sequences with basic elements in the form of symptoms - polynomials over the field F2 which are known as Zhegalkin polynomials. It has been proven that the super syndrome, which is a linear syndrome with basic elements in the form of a multiplicative syndrome, is closed. If in the multiplication of two symptoms one is neutral, then we are talking about its majorization. The ordered by majorization symptoms form a majorized syndrome. Is proved that the majorized syndrome is closed and coincides with the super syndrome. The statements formulated in the first part of the paper are used to justify the convergence of the iterative procedure (PI), in which the most informative symptoms selected from partial super syndromes are again used in the next step. The stationary state of PI is obtained if all elements of the input set belong to either the same partial super syndrome or to the majorized syndrome. Thanks IP it is possible to quickly find the optimal syndrome from a large set of variables. An example from phthisiology shows how the specificity of classification can be improved using symptom analysis.

Keywords: multivariate analysis of categorical data, finite geometries, algebraic normal forms, entropy, uncertainty coefficient, iterative procedure, symptom-syndromic method, dimension reduction, classification, sensitivity, specificity.

UDC: 519.22-24

MSC: 62-07, 62B10, 62H86

Received: 18.07.2020
Revised: 21.10.2020
Accepted: 19.03.2020

DOI: 10.21638/spbu01.2021.302



© Steklov Math. Inst. of RAS, 2026