RUS  ENG
Full version
JOURNALS // Doklady Rossijskoj Akademii Nauk. Mathematika, Informatika, Processy Upravlenia // Archive

Dokl. RAN. Math. Inf. Proc. Upr., 2025 Volume 527, Pages 182–191 (Mi danma677)

SPECIAL ISSUE: ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING TECHNOLOGIES

FoCAT: foundation model for estimating the conditional average treatment effect

S. R. Kirpichenko, A. V. Konstantinov, L. V. Utkin

Higher School of Artificial Intelligence Technologies Peter the Great St.Petersburg Polytechnic University, St.Petersburg, Russia

Abstract: The paper presents a novel foundation model, FoCAT (Foundation Causal Adaptive Transformer), developed for estimating the conditional treatment effect. The model addresses several key challenges inherent in causal inference tasks, including a limited sample size in the treatment group, the impossibility of simultaneously observing patient outcomes before and after intervention, and difficulties in testing models on real data. FoCAT employs a hypernetwork architecture. Unlike existing approaches that predict separate outcome functions for control and treatment groups, FoCAT directly estimates the conditional treatment effect. The model allows for control of the context informativeness through specialized classification tokens. Numerical experiments on synthetic and real-world datasets demonstrate superiority of FoCAT in estimation of the treatment effect. The code implementing FoCAT is publicly available.

Keywords: foundation model, treatment effect, hypernetwork, transformer.

UDC: 004.8

Received: 08.08.2025
Accepted: 22.09.2025

DOI: 10.7868/S268695432507015X



Bibliographic databases:


© Steklov Math. Inst. of RAS, 2026