Skip to content

Why number of Analyzer Results is different from Anonymizer Results? #1056

Answered by omri374
neboduus asked this question in Q&A
Discussion options

You must be logged in to vote

Hi @neboduus. Thanks for the kind words.

One reason for not having a 1-1 mapping between the results coming from the analyzer and the results coming from the anonymizer, is conflict resolution happening within the anonymizer. In case of overlaps, the analyzer would return multiple results whereas the anonymizer could potentially return only one result. For example, the string [email protected] could produce two types of PII: EMAIL and URL, however during the anonymization process there is handling of overlaps, which would only keep one of them. More on this can be found here: https://microsoft.github.io/presidio/anonymizer/#handling-overlaps-between-entities

Could this be the re…

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@neboduus
Comment options

Answer selected by neboduus
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants