This project is an interactive entity resolution plugin for Elasticsearch based on Duke. Basically, it uses [Bayesian probabilities] (http://en.wikipedia.org/wiki/Bayesian_probability) to compute probability. You can pretty much use it as an interactive deduplication engine.
To understand basics, go to Duke project documentation.
A list of [available comparators] (https://github.com/larsga/Duke/wiki/Comparator) is available here.
This project is licended under APL V2
Copyright (c) 2014 Yann Barraud