RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2022 Volume 34, Issue 2, Pages 89–110 (Mi tisp680)

Loss functions for train document image segmentation models

A. I. Perminovab, D. Yu. Turdakovab, O. V. Belyaevaa

a Ivannikov Institute for System Programming of the RAS
b Lomonosov Moscow State University

Abstract: The work is devoted to improving the quality of the results of image segmentation of documents of various scientific articles and legal acts by neural network models by learning using modified loss functions that take into account the features of images of the selected subject area. The analysis of existing loss functions is carried out, as well as the development of new functions that operate both with the coordinates of the bounding boxes and using information about the pixels of the input image. To assess the quality, a neural network segmentation model with modified loss functions is trained, and a theoretical assessment is carried out using a simulation experiment showing the convergence rate and segmentation error. As a result of the study, rapidly converging loss functions were created that improve the quality of document image segmentation using additional information about the input data.

Keywords: document image segmentation, loss functions, loss function modifications

DOI: 10.15514/ISPRAS-2022-34(2)-8



© Steklov Math. Inst. of RAS, 2026