I. V. Vinokurov, “Using a convolutional neural network to recognize text elements in poor quality scanned images”, Program Systems: Theory and Applications, 2022, Volume 13, Issue 3,Pages <nobr>29

This article is cited in 5 papers

Artificial Intelligence, Intelligent Systems, Neural Networks

Using a convolutional neural network to recognize text elements in poor quality scanned images

I. V. Vinokurov

Financial University under the Government of the Russian Federation

Abstract: The paper proposes a method for recognizing the content of scanned images of poor quality using convolutional neural networks (CNNs). The method involves the implementation of three main stages.
At the first stage, image preprocessing is implemented, which consists of identifying the contours of its alphabetic and numeric elements and basic punctuation marks.
At the second stage, the content of the image fragments inside the identified contours is sequentially fed to the input of the CNN, which implements a multiclass classification.
At the third and final stage, the post-processing of the set of SNA responses and the formation of a text document with recognition results are implemented.
An experimental study of all stages was carried out in Python using the Keras deep learning libraries and OpenCV computer vision and showed fairly good results for the main types of deterioration in the quality of a scanned image: geometric distortions, blurring of borders, the appearance of extra lines and spots during scanning, etc.

Key words and phrases: image processing, convolutional neural network, Python, Keras, OpenCV.

UDC: 004.932.75'1+004.89

MSC: Primary 68T20; Secondary 68T07, 68T45

Received: 28.07.2022
Accepted: 15.09.2022

DOI: 10.25209/2079-3316-2022-13-3-29-43