RUS  ENG
Full version
JOURNALS // Modelirovanie i Analiz Informatsionnykh Sistem // Archive

Model. Anal. Inform. Sist., 2022 Volume 29, Number 2, Pages 134–147 (Mi mais772)

This article is cited in 1 paper

Theory of data

Recursive sentiment detection algorithm for Russian sentences

A. Yu. Poletaev, I. V. Paramonov

P. G. Demidov Yaroslavl State University, 14 Sovetskaya str., Yaroslavl 150003, Russia

Abstract: The article is devoted to the task of sentiment detection of Russian sentences. The sentiment is conceived as the author's attitude to the topic of a sentence. This assay considers positive, neutral, and negative sentiment classes, i.e., the task of three-classes classification is solved.
The article introduces a rule-based sentiment detection algorithm for Russian sentences. The algorithm is based on the assumption that the sentiment of a phrase can be determined by the sentiments of its parts by the recursive application of appropriate semantic rules to the sentiments of its parts organized as a constituency parse tree. The utilized set of semantic rules was constructed based on a discussion with experts in linguistics. The experiments showed that the proposed recursive algorithm performs slightly worse on the hotel reviews corpus than the adapted rule-based approach: weighted $F_1$-measures are $0.75$ and $0.78$, respectively. To measure the algorithm efficiency on complex sentences, we created OpenSentimentCorpus based on OpenCorpora, an open corpus of sentences extracted from Russian news and periodicals. On OpenSentimentCorpus the recursive algorithm performs better than the adapted approach does: $F_1$-measures are $0.70$ and $0.63$, respectively. This indicates that the proposed algorithm has an advantage in case of more complex sentences with more subtle ways of expressing the sentiment.

Keywords: sentiment analysis, sentiment detection, semantic rules, sentiment corpus.

UDC: 004.912

MSC: 68T50

Received: 30.04.2022
Revised: 22.05.2022
Accepted: 25.05.2022

DOI: 10.18255/1818-1015-2022-2-134-147



© Steklov Math. Inst. of RAS, 2026