-
Notifications
You must be signed in to change notification settings - Fork 501
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
- Required changes for kernels #3165
Conversation
✅ Deploy Preview for pytorch-fbgemm-docs ready!
To edit notification comments on pull requests, go to your Netlify site configuration. |
This pull request was exported from Phabricator. Differential Revision: D63151913 |
This pull request was exported from Phabricator. Differential Revision: D63151913 |
1 similar comment
This pull request was exported from Phabricator. Differential Revision: D63151913 |
955855c
to
3972d03
Compare
Summary: Pull Request resolved: pytorch#3165 X-link: facebookresearch/FBGEMM#259 Adding small changes to kernels for CompiledAutograd support. Adding `static constexpr bool is_traceable = true;` on kernels, making some kernels to use tensors instead of double and unrolling input shapes on GroupIndexSelectDim0GPUOp from vector into the ctx dict to help enablement of CompiledAutograd. Reviewed By: Microve Differential Revision: D63151913
This pull request was exported from Phabricator. Differential Revision: D63151913 |
Summary: Pull Request resolved: pytorch#3165 X-link: facebookresearch/FBGEMM#259 Adding small changes to kernels for CompiledAutograd support. Adding `static constexpr bool is_traceable = true;` on kernels, making some kernels to use tensors instead of double and unrolling input shapes on GroupIndexSelectDim0GPUOp from vector into the ctx dict to help enablement of CompiledAutograd. Reviewed By: Microve Differential Revision: D63151913
3972d03
to
f636ade
Compare
This pull request was exported from Phabricator. Differential Revision: D63151913 |
This pull request has been merged in d65942e. |
Summary:
Adding small changes to kernels for CompiledAutograd support.
Adding
static constexpr bool is_traceable = true;
on kernels, making some kernels to use tensors instead of double and unrolling input shapes on GroupIndexSelectDim0GPUOp from vector into the ctx dict to help enablement of CompiledAutograd.Differential Revision: D63151913