-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add CUDA/HIP RCM kernels #1503
Add CUDA/HIP RCM kernels #1503
Conversation
Codecov ReportAttention:
Additional details and impacted files@@ Coverage Diff @@
## develop #1503 +/- ##
=========================================
Coverage 91.04% 91.05%
=========================================
Files 700 697 -3
Lines 56996 56773 -223
=========================================
- Hits 51894 51692 -202
+ Misses 5102 5081 -21 ☔ View full report in Codecov by Sentry. |
Could you rebase this? There seem to be a lot of unrelated changes. |
@MarcelKoch not until #1485 is resolved |
83f67c5
to
f47439a
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly LGTM. I left some minor comments.
One larger thing is that the cuda/hip implementations ignore the construct_inverse_permutation
factory parameter. Either that should be deprecated, or added to cuda/hip.
Co-authored-by: Marcel Koch <[email protected]>
Co-authored-by: Yuhsiang M. Tsai <[email protected]>
This adds Thrust-based kernels for RCM.
TODO: