You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, in cases like "3. Minute" the sentence wrongly ends at "3." according to Punkt.
I see you have an effective list of words (notably months) in packages/tokenizers/punkt_tab/german/collocations.tab but it is incomplete. It would be useful to add the following time expressions:
##number## sekunde
##number## minute
##number## stunde
##number## tag
##number## woche
##number## monat
##number## jahr
I'm not sure how to proceed, can I open a PR to change the file directly or are other steps involved?
The text was updated successfully, but these errors were encountered:
Hi, in cases like "3. Minute" the sentence wrongly ends at "3." according to Punkt.
I see you have an effective list of words (notably months) in
packages/tokenizers/punkt_tab/german/collocations.tab
but it is incomplete. It would be useful to add the following time expressions:I'm not sure how to proceed, can I open a PR to change the file directly or are other steps involved?
The text was updated successfully, but these errors were encountered: