-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
remove refined recompute deep copy #9617
base: develop
Are you sure you want to change the base?
Conversation
Thanks for your contribution! |
@@ -484,13 +484,13 @@ def forward(self, hidden_states): | |||
|
|||
|
|||
class QWenBlock(nn.Layer): | |||
def __init__(self, config): | |||
def __init__(self, config, layer_idx: int = 0): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
QwenBlock的外部调用,缺少了layer_idx的输入,可以检查一下。
Codecov ReportAttention: Patch coverage is
Additional details and impacted files@@ Coverage Diff @@
## develop #9617 +/- ##
===========================================
+ Coverage 52.77% 52.80% +0.02%
===========================================
Files 718 718
Lines 112154 112225 +71
===========================================
+ Hits 59193 59262 +69
- Misses 52961 52963 +2 ☔ View full report in Codecov by Sentry. |
PR types
Function optimization
PR changes
APIs
Description
删除refined recompute对config的deep copy操作。
添加对应的文档介绍。