V. V. Arlazarov, “Methods for combining multiple text recognition results”, Artificial Intelligence and Decision Making, 2022, Issue 3,Pages <nobr>106

This article is cited in 1 paper

Analysis of textual and graphical information

Methods for combining multiple text recognition results

V. V. Arlazarov^ab

^a Federal Research Center "Computer Science and Control" of Russian Academy of Sciences, Moscow, Russia
^b Smart Engines Service LLC, Moscow, Russia

Abstract: The task of per-frame combination of text recognition results from multiple images is an important component of video stream document recognition systems. Currently there is no unified approach to solving this problem which would yield a high precision of text recognition. In this paper a comparative study is presented of known approaches to the combination of recognition results for identity document fields. It was demonstrated that different approaches are advantageous on different parts of the data sets, while a sepection of the potential best single result can still significantly outperform all the analyzed methods.

Keywords: text recognition, document analysis, video stream recognition, combination methods, OCR, image processing.

DOI: 10.14357/20718594220309