-
Notifications
You must be signed in to change notification settings - Fork 167
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Distance with special characters #114
Comments
Hi @erikradisch , thanks for reaching out! |
Sure! I use the python package. I use it to align historical place names to a gazetteer. Your algorithm has two huge puses. first, it can be aborted, if levenshtein reaches a limit, second, you can align additional equalities, which is very important, as there are a lot of predictable differences, which are in fact equalities in historical place names (c instead of an k for example) |
It seems to me, that edlib does not calculate the right distance, if there are special characters (with diacritic signs).
for example:
übund - ubung should have a distance of 1 but I end up with 3. Is this a bug or is it wanted?
The text was updated successfully, but these errors were encountered: