Abstract:
A three-step scheme for constructing algorithms for transforming metric information in data mining is proposed and investigated. The correction problem of a local perturbation of a semimetric on a finite set of objects is considered. In the framework of the proposed scheme, algorithms correcting the changes of the distance between a pair of objects by a given quantity that preserve the metric properties are examined. Sufficient conditions under which the correction of semimetrics using the proposed three-step scheme actually completes in two steps and in some special cases even after the first step are obtained. Semimetric similarity functionals are considered, and the correction algorithms are matched to those functionals.
Key words:data mining, preprocessing of metric information, metric recognition algorithms, semimetric on a finite set, similarity of semimetrics.