Replies: 1 comment 5 replies
-
lower the batch size, and increase the gradient accumulation to account for the change. Also, you can change the maximum sequence length in the tokenizer to make the model size smaller |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
I am trying to fine-tune arabert for the sentiment analysis task, I am following the jupyter notebook here.
However, when It comes to training my Trainer object, I get this error:
RuntimeError: cuda runtime error (711) : peer mapping resources exhausted at /pytorch/aten/src/THC/THCGeneral.cpp:139
Thanks in advance for your help
Beta Was this translation helpful? Give feedback.
All reactions