Magic mirror football leagues forum

broken image
broken image

Unfortunately, the cost of obtaining these examples is typically high, often requiring a researcher to manually review pairs of records in order to make an informed judgement about whether they are a match. Naturally, the accuracy of a candidate linking algorithm can be substantially improved when an algorithm has access to “ground-truth” examples-matches which can be validated using institutional knowledge or auxiliary data. To address this problem, researchers have developed probabilistic record linkage algorithms which use statistical patterns in identifying characteristics to perform linking tasks. While linking records across large administrative datasets has the potential to revolutionize empirical social science research, many administrative data files do not have common identifiers and are thus not designed to be linked to others.

broken image