RUS  ENG
Full version
JOURNALS // Informatsionnye Tekhnologii i Vychslitel'nye Sistemy // Archive

Informatsionnye Tekhnologii i Vychslitel'nye Sistemy, 2019 Issue 3, Pages 66–78 (Mi itvs354)

This article is cited in 4 papers

PATTERN RECOGNITION

Dynamic programming approach to textual structured objects segmentation in images

M. A. Povolotskiyabc, D. V. Tropinca, T. S. Chernovc, B. I. Savelyevc

a Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russia
b The Institute for Information Transmission Problems of the Russian Academy of Sciences (Kharkevich Institute), Moscow, Russia
c Smart Engines Service, Moscow, Russia

Abstract: This paper deals with the problem of segmentation of images of text fragments with known constraints on the relative position of elements. The model in which the constraints form a path graph is considered. It is shown that the segmentation problem in this case can be solved precisely with use of a dynamic programming algorithm, and this algorithm has an optimal asymptotic complexity. This algorithm was built into two recognition systems. The first system was designed to recognize identity documents, such as passports and driver's licenses. The proposed algorithm was used in this system to extract information fields. To do this, a two-level field hierarchy was introduced, in which the fields were grouped in rows, within which they were ordered from left to right, and the lines themselves were ordered from top to bottom. The second system was designed to recognize license plates in which the proposed algorithm was used to segment plates into individual characters. In this case, the natural ordering of characters from left to right was introduced. Thus, the generality of the proposed approach is demonstrated. Experiments were conducted on a closed data set to measure the quality and performance of the solutions obtained on a mobile phone. Experimental results showed that the solutions obtained are superior in quality to algorithms that do not use constraints on the mutual arrangement of elements, and their complexity allows them to work on mobile devices in real time.

Keywords: text segmentation, dynamic programming, document recognition, image processing, OCR.

DOI: 10.14357/20718632190306



© Steklov Math. Inst. of RAS, 2026