-
Notifications
You must be signed in to change notification settings - Fork 501
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Fp8 hipstream fix #3127
Fp8 hipstream fix #3127
Conversation
❌ Deploy Preview for pytorch-fbgemm-docs failed.
|
0c9fc22
to
863f596
Compare
We use hipify script for this, is the change here needed? |
Rebase to resolve conflicts: also as Xiaodong mentioned, internally we don't need this (automatically hipify). Is this needed on OSS workflow? I guess hipify torch is not enabled? |
c948fe8
to
e94d9e9
Compare
@@ -12,10 +12,13 @@ | |||
#include <numeric> | |||
|
|||
#include <ATen/ATen.h> | |||
#if !defined(USE_ROCM) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
hipify script should hipify .h right?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in any case, I think maybe we can just remove this block just to be clean.
@@ -12,11 +12,14 @@ | |||
#include <numeric> | |||
|
|||
#include <ATen/ATen.h> | |||
#include <c10/hip/HIPStream.h> | |||
#if !defined(USE_ROCM) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
is this block even needed? If USE_ROCM is not defined, it'll basically be an empty file. So we can just remove this block here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Let's just remove this include and move it down to the USE_ROCM. No need to include cuda header here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
changed as required. Please review.
@@ -12,10 +12,13 @@ | |||
#include <numeric> | |||
|
|||
#include <ATen/ATen.h> | |||
#if !defined(USE_ROCM) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
in any case, I think maybe we can just remove this block just to be clean.
ee380a4
to
05e9b76
Compare
@xw285cornell has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I guess we should never use nvcc to build those files so we should be ok?
@@ -201,4 +199,3 @@ at::Tensor f8f8bf16_rowwise_impl( | |||
return Y; | |||
} | |||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@acoskunses-AMD can you remove the blank lines at the end of each file? Otherwise we cannot land it internally due to some lint check
05e9b76
to
1e4e129
Compare
@xw285cornell has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator. |
@xw285cornell merged this pull request in 8e7beba. |
Pick hip stream where cuda is not defined