RUS  ENG
Full version
JOURNALS // Proceedings of the Institute for System Programming of the RAS // Archive

Proceedings of ISP RAS, 2022 Volume 34, Issue 2, Pages 191–200 (Mi tisp687)

Modification of the Method for Calculating Polygenic Risks With Variation Graph

O. A. Kondratevaab, E. A. Karpulevitcha

a Ivannikov Institute for System Programming of the RAS
b Lomonosov Moscow State University

Abstract: Representation of the DNA sequence is possible in various ways. The variation graph is one of the most accurate methods that allows you to work with atypical areas and take into account all their diversity. Based on this data structure and the polygenic risk assessment method, a DNA interpretation system was built. As a result, a correlation coefficient was obtained between the path in the column responsible for a specific DNA sequence and the feature. We then compared it with a coefficient obtained by a similar method but using sequence representation using a reference genome. Such a comparison helped to evaluate the effectiveness of the representation in the form of a graph. After that, a modified method for calculating the polygenic score on the alignment data of the vg tool was built, which was also compared with existing methods. The modified method showed an improvement in the prediction of the trait.

Keywords: graph, genome representation, variation graph, HISAT2, vg, minimap2, GGP, genomic graph pipeline, PRS, polygenic score, polygenic risk score

DOI: 10.15514/ISPRAS-2022-34(2)-15



© Steklov Math. Inst. of RAS, 2026