You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This happens to Qwen2.5-72B-Instruct.
The badcase can be reproduced with the following steps:
max_length:int = 3000
batch_size:int = 1
gradient_accumulation_steps:int = 16
log_iter:int = 10
max_lr:float = 2e-3
min_lr:float = 2e-4
warmup_steps:int = 1000
使用如上参数微调reasoning 推理模型。数据集:KingNish/reasoning-base-20k。相同的代码和参数在Qwen2.0-72B-Instruct上成功,但是Qwen2.5-72B-Instruct无法收敛。backward报错。
并且对于Qwen2.0-72B-Instruct,如果学习率降低一个量级也会出现无法收敛的情况
The text was updated successfully, but these errors were encountered:
Model Series
Qwen2.5
What are the models used?
Qwen2.5-72B-Instruct
What is the scenario where the problem happened?
Qwen2.5-72B-Instruct用reasoning数据微调,模型无法收敛
Is this badcase known and can it be solved using avaiable techniques?
Information about environment
GPU: H100
DATASET: KingNish/reasoning-base-20k
batch size: 1
max_length:int = 3000
batch_size:int = 1
gradient_accumulation_steps:int = 16
log_iter:int = 10
max_lr:float = 2e-3
min_lr:float = 2e-4
warmup_steps:int = 1000
Description
Steps to reproduce
This happens to Qwen2.5-72B-Instruct.
The badcase can be reproduced with the following steps:
max_length:int = 3000
batch_size:int = 1
gradient_accumulation_steps:int = 16
log_iter:int = 10
max_lr:float = 2e-3
min_lr:float = 2e-4
warmup_steps:int = 1000
使用如上参数微调reasoning 推理模型。数据集:KingNish/reasoning-base-20k。相同的代码和参数在Qwen2.0-72B-Instruct上成功,但是Qwen2.5-72B-Instruct无法收敛。backward报错。
并且对于Qwen2.0-72B-Instruct,如果学习率降低一个量级也会出现无法收敛的情况
The text was updated successfully, but these errors were encountered: