Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parsing docs or examples to Multiprocessing #10

Open
kunansy opened this issue Aug 6, 2020 · 2 comments
Open

Parsing docs or examples to Multiprocessing #10

kunansy opened this issue Aug 6, 2020 · 2 comments
Labels
enhancement New feature or request

Comments

@kunansy
Copy link
Owner

kunansy commented Aug 6, 2020

Parsing in MultilingualParaCorpus gets a lot of time.
Which of the parse method to Multiprocessing: parse_doc or parse_example? Profile the project to know.

@kunansy kunansy added the enhancement New feature or request label Aug 6, 2020
@kunansy
Copy link
Owner Author

kunansy commented Aug 6, 2020

If the Multiprocessing gets more time, use it only in MultilingualParaCorpus .

@kunansy
Copy link
Owner Author

kunansy commented Dec 13, 2020

Move parse_page to Multiprocessing in the main Corpus class.
Set count of processes according to count of CPU cores.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant