Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于测试代码中gradient_checkpointing问题 #208

Open
lixinyyang opened this issue Nov 22, 2024 · 1 comment
Open

关于测试代码中gradient_checkpointing问题 #208

lixinyyang opened this issue Nov 22, 2024 · 1 comment

Comments

@lixinyyang
Copy link

您好!很精彩的工作!但是我仍有一些疑问,在推理过程中我将gradient_checkpointing设置为了False, 但是推理结果的效果会出现问题,综合来看是出现了每个clip不连续的问题。请问为什么不启用梯度检查,会出现这个问题呢?

@xumingw
Copy link
Contributor

xumingw commented Nov 26, 2024

代码里gradient_checkpointing打开和关闭时走的两个条件分支 如果你需要关闭它,请修改相关分支的代码

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants