-
Notifications
You must be signed in to change notification settings - Fork 501
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable E2E MoE INT4 BMM with loopover #3170
Conversation
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
c503c93
to
5b0fbc9
Compare
This pull request was exported from Phabricator. Differential Revision: D63303484 |
This pull request was exported from Phabricator. Differential Revision: D63303484 |
5b0fbc9
to
bca97d0
Compare
This pull request was exported from Phabricator. Differential Revision: D63303484 |
bca97d0
to
a44fe1f
Compare
Summary: Pull Request resolved: pytorch#3170 X-link: facebookresearch/FBGEMM#264 - Enable E2E MoE INT4 BMM with loopover - Support 3d tensors in quantize_marlin_int4 - Add unit tests Reviewed By: jianyuh Differential Revision: D63303484
This pull request was exported from Phabricator. Differential Revision: D63303484 |
Summary: Pull Request resolved: pytorch#3170 X-link: facebookresearch/FBGEMM#264 - Enable E2E MoE INT4 BMM with loopover - Support 3d tensors in quantize_marlin_int4 - Add unit tests Reviewed By: jianyuh Differential Revision: D63303484
a44fe1f
to
79c45cb
Compare
Summary: Pull Request resolved: pytorch#3170 X-link: facebookresearch/FBGEMM#264 - Enable E2E MoE INT4 BMM with loopover - Support 3d tensors in quantize_marlin_int4 - Add unit tests Reviewed By: jianyuh Differential Revision: D63303484
This pull request was exported from Phabricator. Differential Revision: D63303484 |
79c45cb
to
d7432d8
Compare
This pull request has been merged in 2cb9b22. |
Summary:
Differential Revision: D63303484