Skip to content

Allows to delete aproximate duplicates in a pandas DataFrame using the Levenstein distance

Notifications You must be signed in to change notification settings

crsegerie/Duplicates

Repository files navigation

This library is aiming at deleting approximate string duplicate in a pandas dataframe.

You can open the jupyter example named : duplicates.ipynb in order to see how it works.

About

Allows to delete aproximate duplicates in a pandas DataFrame using the Levenstein distance

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published