Free academic record linkage software

Merge ToolBox (MTB)

The GRLC maintains the Merge ToolBox (MTB), a free academic record linkage and de-duplication program written in Java. Features of MTB include probabilistic and distance-based linkage techniques. MTB is free for academic use only.

Download MTB here
Download the MTB manual here


Software from previous projects:

Safelink

We developed a protocol for determining string similarities in a privacy-preserving manner (Safelink). The main idea is to store q-grams from identifying strings in Bloom filters using cryptographic hash functions (HMACs). Note that Safelink is free for academic use only.

Download Safelink here
Download the Safelink manual here

 

Test Data Generator (TDGen)
TDGen is a pre-built workflow for KNIME, allowing the user to insert errors into test data for Record Linkage testing purposes. There is a publication including an installation guide for TDGen.

Download TDGen here

 

Contact
Dr. Manfred Antoni
Tel.: +49 203/379-5645
Fax: +49 203/379-1728
recordlinkag[at]iab.de