-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Using spacyr for language processing instead of the current UD treebank #87
Comments
Thank you for your comment. We have already considered using spaCy and decided to continue with Udpipe. This is because spaCy is not native in R but requires a Python installation, which often leads to numerous errors and requires a lot of work on the part of the user. It took me a whole day to get a properly functioning Python environment on my Mac to be able to use spacyr. To improve Udpipe's performance, we plan to train updated models for the most commonly used languages. This will be done in the coming months. |
@massimoaria Thank you! |
I tried using spaCy to parse CJK languages. I can attach the files FYI. And the modified lines in Server.R as below:
|
I found UD treebank models performed very weakly for some languages, esp. for CJK languages. spaCy supports so many languages and performs much better than UD treebank models.
The text was updated successfully, but these errors were encountered: