reclin2: Record Linkage Toolkit

Functions to assist in performing probabilistic record linkage and deduplication: generating pairs, comparing records, em-algorithm for estimating m- and u-probabilities (I. Fellegi & A. Sunter (1969) <doi:10.1080/01621459.1969.10501049>, T.N. Herzog, F.J. Scheuren, & W.E. Winkler (2007), "Data Quality and Record Linkage Techniques", ISBN:978-0-387-69502-0), forcing one-to-one matching. Can also be used for pre- and post-processing for machine learning methods for record linkage. Focus is on memory, CPU performance and flexibility.

Version: 0.1.1
Depends: data.table, R (≥ 3.6.0)
Imports: stringdist, stats, utils, lpSolve, Rcpp, parallel
LinkingTo: Rcpp
Suggests: simplermarkdown
Published: 2022-01-07
Author: Jan van der Laan ORCID iD [aut, cre]
Maintainer: Jan van der Laan <r at>
License: GPL-3
NeedsCompilation: yes
SystemRequirements: C++11
Materials: README
CRAN checks: reclin2 results


Reference manual: reclin2.pdf
Vignettes: Deduplication using reclin2
Introduction to reclin2
Record linkage using machine learning
Using a cluster for record linkage


Package source: reclin2_0.1.1.tar.gz
Windows binaries: r-devel:, r-release:, r-oldrel:
macOS binaries: r-release (arm64): reclin2_0.1.1.tgz, r-oldrel (arm64): reclin2_0.1.1.tgz, r-release (x86_64): reclin2_0.1.1.tgz, r-oldrel (x86_64): reclin2_0.1.1.tgz


Please use the canonical form to link to this page.