Detect duplicate or similar publications from database. This project aim to reduce size of the database by showing pairs of suspect duplications, to help citation easier and cleaner.
Export database as CSV file without header, with this fields:
- ID
- Authors
- Title of the article
- Year
- Abstract
Run with
python3 report.py publications.csv