RUS  ENG
Full version
JOURNALS // Informatics and Automation // Archive

Tr. SPIIRAN, 2017 Issue 50, Pages 190–208 (Mi trspy932)

This article is cited in 1 paper

Theoretical and Applied Mathematics

Approximation of distributions of text characters bigrams frequencies for alphabetic characters identification

Yu. A. Kotov

Novosibirsk State Technical University (NSTU)

Abstract: The article discusses the application features of methods of the frequencies ordering and approximation to solve the problem of text characters identification. The conditions for realization of Jacobsen’s method for receiving the least error of identification are defined. The method of approximation of one- and two-dimensional distributions of the frequencies of characters bigrams of the text and the language is offered. The experimental data about errors of Jacobsen’s method and the offered approximation method for Russian language texts are provided.
The error of the offered method is less than that of Jacobsen's method. This method can be used for identification of text characters for any language that has a reference distribution of the alphabetic characters bigrams frequencies.

Keywords: approximation; identification; character; bigram; one-to-one substitution; cypher.

UDC: 519.6

DOI: 10.15622/sp.50.8



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026