The homepage for the Splink documentation can be found here, including a tutorial and examples that can be run in the browser. The specification of the Fellegi Sunter statistical model behind splink ...
Splink is a Python package for probabilistic record linkage (entity resolution) that allows you to deduplicate and link records from datasets that lack unique identifiers. It is used widely by within ...
The Ministry of Justice (MoJ) has urged other government bodies to make use of its Splink software for linking datasets. MoJ data scientist Robin Linacre said in a blogpost that the software, in ...