RUS  ENG
Full version
JOURNALS // Computer Optics // Archive

Computer Optics, 2022 Volume 46, Issue 5, Pages 790–800 (Mi co1072)

This article is cited in 1 paper

IMAGE PROCESSING, PATTERN RECOGNITION

Development of software for the segmentation of text areas in real-scene images

V. A. Lobanova, Yu. A. Ivanova

Tomsk Polytechnic University

Abstract: This article discusses the design and development of a neural network algorithm for the segmentation of text areas in real-scene images. After reviewing the available neural network models, the U-net model was chosen as a basis. Then an algorithm for detecting text areas in real-scene images was proposed and implemented. The experimental training of the network allows one to define the neural network parameters such as the size of input images and the number and types of the network layers. Bilateral and low-pass filters were considered as a preprocessing stage. The number of images in the KAIST Scene Text Database was increased by applying rotations, compression, and splitting of the images. The results obtained were found to surpass competing methods in terms of the F-measure value.

Keywords: deep learning, U-Net architecture, image processing, image segmentation, text areas, real scenes images

Received: 13.09.2021
Accepted: 22.04.2022

DOI: 10.18287/2412-6179-CO-1047



© Steklov Math. Inst. of RAS, 2026