RUS  ENG
Full version
JOURNALS // Informatika i Ee Primeneniya [Informatics and its Applications] // Archive

Inform. Primen., 2025 Volume 19, Issue 3, Pages 73–81 (Mi ia956)

Automation of annotating implicit discourse relations: Challenges and opportunities

A. A. Goncharova, P. V. Iaroshenkoab

a Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, 44-2 Vavilov Str., Moscow 119333, Russian Federation
b Research Computing Center Lomonosov Moscow State University, 1, bld. 4 Leninskie Gory, GSP-1, Moscow 119991, Russian Federation

Abstract: The article outlines the principal challenges encountered in the automation of annotating implicit discourse relations, analyzes the underlying causes of these challenges, and suggests possible solutions. The article examines the main stages of the process: ($i$) the extraction of examples with implicit discourse relations; ($ii$) the delimitation of relational argument boundaries; and ($iii$) the selection of features for annotation of the extracted fragments. The results of applying the method of search with exclusion in parallel texts are presented along with a critical assessment of its limitations. Two factors significantly hindering the automation of argument identification in text spans with implicit discourse relations are analyzed: the considerable variability in argument length and the noncontiguous nature of arguments, which may be interrupted by intervening tokens. A comprehensive analysis of methods for automating feature selection for the linguistic data is provided. It has been demonstrated that even the processing of formal features may require the involvement of experts. Furthermore, while some semantic features are amenable to partial automation, others currently require manual annotation. The conclusions are illustrated by examples from the corpus.

Keywords: linguistic annotation, discourse relations, logical-semantic relations, implicitness, parallel texts.

Received: 15.06.2025
Accepted: 15.08.2025

DOI: 10.14357/19922264250309



© Steklov Math. Inst. of RAS, 2026