You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Noticed that for some examples, this environment variable CUDA_DEVICE_MAX_CONNECTIONS is explicitly set to 1. Why is that?
I've seen some explanations that indicate this is necessary for overlapping comms / compute when using TP / SP but others that would suggest this in fact interferes with comms / compute overlap when using ZeRO-DP / FSDP.
The text was updated successfully, but these errors were encountered:
Noticed that for some examples, this environment variable
CUDA_DEVICE_MAX_CONNECTIONS
is explicitly set to 1. Why is that?I've seen some explanations that indicate this is necessary for overlapping comms / compute when using
TP / SP
but others that would suggest this in fact interferes with comms / compute overlap when usingZeRO-DP / FSDP
.The text was updated successfully, but these errors were encountered: