Training coreference resolver on Italian Ontonotes produces low scores #12913
Replies: 2 comments 1 reply
-
Hi @DvdGhl78, sorry for the late response! Which corpus are you using exactly? Can you provide a link? I was under the impression that OntoNotes isn't available in Italian.
Yes.
Can you clarify why you think your config contains errors? Also, can you tell me what exactly you changed in your config compared to https://github.com/explosion/projects/blob/v3/experimental/coref? |
Beta Was this translation helpful? Give feedback.
-
There's no public OntoNotes for IT but I requested a translated version from here.
Then I converted these files into spacy docbins with:
After checking that all was correct, I started the pipeline with the config above. I don't really know if the error is in the config or in the data conversion. Since I couldn't find a solution, in the mean time I tried the implementation of coref-hoi and achieved a 73% F1. Iterating the docbins I get the following sample output:
|
Beta Was this translation helpful? Give feedback.
-
I transformed Ontonotes in a format readable by Spacy but the training doesn't overcome 25% in the SCORE metric.
Data should be ok, so the problems could be the following two:
Beta Was this translation helpful? Give feedback.
All reactions