Data Quality CZ - portál věnující se tématu kvalitních dat

Výzkum v oblasti řízení datové kvality

SOMACODE – The Matching Strategy

Research Goals

The goal of this research it ot develope SOMACODE (Sorted Matching Code) strategy using advanced matching codes definition for data deduplication, matchnig and merging. Advantages of this approach are demonstrated on artificial data containing typical data errors from European national environment. Performance of this method is compared with similarity metrics included in SimMetrics library provided by UK Sheffield University funded by (AKT) an IRC sponsored by EPSRC, grant number GR/N15764/01

The Logic of SOMACODE

Structure of Knowledge Base

Document Tree