-
Notifications
You must be signed in to change notification settings - Fork 518
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support quantize_fp8_row for up to 4d non contiguous tensor #3508
Conversation
This pull request was exported from Phabricator. Differential Revision: D67303282 |
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
…3508) Summary: X-link: facebookresearch/FBGEMM#589 reland D66990975 with fix for the NaN issued observed during LLaMa4 17B model run with fp8_rowwise FFN Specifically, offset was not properly updated when loading/storing data. Differential Revision: D67303282
c863b9a
to
9d0dd32
Compare
This pull request was exported from Phabricator. Differential Revision: D67303282 |
…3508) Summary: X-link: facebookresearch/FBGEMM#589 reland D66990975 with fix for the NaN issued observed during LLaMa4 17B model run with fp8_rowwise FFN Specifically, offset was not properly updated when loading/storing data. Differential Revision: D67303282
9d0dd32
to
0b9537a
Compare
This pull request was exported from Phabricator. Differential Revision: D67303282 |
…3508) Summary: X-link: facebookresearch/FBGEMM#589 reland D66990975 with fix for the NaN issued observed during LLaMa4 17B model run with fp8_rowwise FFN Specifically, offset was not properly updated when loading/storing data. Differential Revision: D67303282
0b9537a
to
8495846
Compare
This pull request was exported from Phabricator. Differential Revision: D67303282 |
…3508) Summary: X-link: facebookresearch/FBGEMM#589 reland D66990975 with fix for the NaN issued observed during LLaMa4 17B model run with fp8_rowwise FFN Specifically, offset was not properly updated when loading/storing data. Differential Revision: D67303282
8495846
to
dcbfabd
Compare
This pull request was exported from Phabricator. Differential Revision: D67303282 |
This pull request has been merged in eaa0961. |
Summary:
reland D66990975 with fix for the NaN issued observed during LLaMa4 17B model run with fp8_rowwise FFN
Specifically, offset was not properly updated when loading/storing data.
Differential Revision: D67303282