Replies: 1 comment
-
Good question! Astred is focused on the sentence level (and uses word level info). It doesn't work on paragraph level and I'm also not sure what good paragraph/document aligners are. Maybe have a look at bitext miners that are used to crawl/create parallel corpora? |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Do you have any recommendations for alignment between two paragraphs? From my usage of spaCy so far, it seems there's no guarantee that it would parse each paragraph into an equal number of sentences that can be mapped 1:1 with each-other.
Beta Was this translation helpful? Give feedback.
All reactions