You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We observed a loss discrepancy between FSDP1 and FSDP2 while training with the AdamW optimizer. Are you aware of any known issues with the AdamW optimizer and FSDP2 that might contribute to this behavior?
The text was updated successfully, but these errors were encountered:
We have not seen this issue. You may need to provide more details about the training setup. We ran long-running numeric testing to compare FSDP1 and FSDP2 before and saw parity.
We observed a loss discrepancy between FSDP1 and FSDP2 while training with the AdamW optimizer. Are you aware of any known issues with the AdamW optimizer and FSDP2 that might contribute to this behavior?
The text was updated successfully, but these errors were encountered: