We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
大佬们在预训练生成特定格式的文件时,如果输入文件很大时(如news_zh_1.txt,我自己是600M左右),运行create_pretrain_data.sh需要跑很久(>4小时)并且96G内存使用率达100%后killed掉,各位大佬们是怎么处理这种情况的呀?只能拆分文件分步无监督学习么
The text was updated successfully, but these errors were encountered:
同问
Sorry, something went wrong.
No branches or pull requests
大佬们在预训练生成特定格式的文件时,如果输入文件很大时(如news_zh_1.txt,我自己是600M左右),运行create_pretrain_data.sh需要跑很久(>4小时)并且96G内存使用率达100%后killed掉,各位大佬们是怎么处理这种情况的呀?只能拆分文件分步无监督学习么
The text was updated successfully, but these errors were encountered: