RUS  ENG
Full version
JOURNALS // Avtomatika i Telemekhanika // Archive

Avtomat. i Telemekh., 2025 Issue 4, Pages 71–91 (Mi at16531)

Intellectual Control Systems, Data Analysis

Data quality management in problem-solving using research infrastructures over heterogeneous data sources

N. A. Skvortsov

Federal Research Center “Computer Science and Control”, Russian Academy of Sciences, Moscow, Russia

Abstract: Problem-solving based on available research data, especially in the context of open science and research infrastructures, should ensure the possibility of their multiple reuse. Data quality metrics are important characteristics that affect not only the accuracy of research results but also the assessment of data suitability, the feasibility of solving specific research problems, the choice of methods for working with data, object matching, data compatibility, and other aspects of reuse. This requires an assessment of various data quality dimensions at different levels of aggregation, from entire datasets to individual values. This study presents an approach to integrated data quality management based on data specifications, as well as data and metadata quality requirements. Various data quality assessment dimensions, including accuracy, completeness, and provenance, are discussed. The developed approach is applied to problem-solving using multiple data sources in stellar astronomy.

Keywords: data quality, data reuse, formal specifications, non-functional requirements.

Presented by the member of Editorial Board: A. A. Galyaev

Received: 29.11.2024
Revised: 10.01.2025
Accepted: 14.01.2025

DOI: 10.31857/S0005231025040057


 English version:
Automation and Remote Control, 2025, 86:4, 343–357


© Steklov Math. Inst. of RAS, 2026