RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2022 Issue 12, Pages 44–62 (Mi at16096)

Topical issue

Optimizing modality weights in topic models of transactional data

K. Ya. Khrylchenko, K. V. Vorontsov

Federal Research Center “Computer Science and Control,” Russian Academy of Sciences, Moscow, 119333 Russia

Abstract: Modern natural language processing models such as transformers operate multimodal data. In the present paper, multimodal data is explored using multimodal topic modeling on transactional data of bank corporate clients. A definition of the importance of modality for the model is proposed on the basis of which improvements are considered for two modeling scenarios: preserving the maximum amount of information by balancing modalities and automatic selection of modality weights to optimize auxiliary criteria based on topic representations of documents.
A model is proposed for adding numerical data to topic models in the form of modalities: each topic is assigned a normal distribution with learning parameters. Significant improvements are demonstrated in comparison with standard topic models on the problem of modeling bank corporate clients. Based on the topic representations of the bank's customers, a 90-day delay on the loan is predicted.

Keywords: multimodal topic modeling, transactional data, classification, loan delinquency forecast.

Presented by the member of Editorial Board: A. A. Lazarev

Received: 31.01.2022
Revised: 18.05.2022
Accepted: 29.06.2022

DOI: 10.31857/S0005231022120054


 English version:
Automation and Remote Control, 2022, 83:12, 1908–1922


© Steklov Math. Inst. of RAS, 2026