Skip to content

Issues: hiyouga/LLaMA-Factory

🚨FAQs | 常见问题🚨
#4614 opened Jun 28, 2024 by hiyouga
Open
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

the Difference of Memory Usage between Llama-factory and Transformer Trainer pending This problem is yet to be addressed
#6435 opened Dec 24, 2024 by Znull-1220
1 task done
lora微调Mamba-Codestral-7B-v0.1出现了问题 pending This problem is yet to be addressed
#6434 opened Dec 24, 2024 by tongzeliang
1 task done
寒武纪:咱们是否能支持寒武纪? pending This problem is yet to be addressed
#6429 opened Dec 24, 2024 by y149604146
1 task done
Ascend NPU 910B3采用deepspeed引擎训练,Q1:未调用NPU,Q2:NPU健康状态是否影响训练。 npu This problem is related to NPU devices pending This problem is yet to be addressed
#6428 opened Dec 24, 2024 by Lexlum
1 task done
奖励模型能否不是一个model,而是一个自己定义的函数 pending This problem is yet to be addressed
#6423 opened Dec 23, 2024 by cdhx
1 task done
ppo训练相关问题 pending This problem is yet to be addressed
#6419 opened Dec 22, 2024 by ccp123456789
Tokenizer does not derive the newer config pending This problem is yet to be addressed
#6415 opened Dec 21, 2024 by xiaosu-zhu
1 task done
Questions about resuming training form ckpt pending This problem is yet to be addressed
#6414 opened Dec 21, 2024 by Jiawei-Guo
1 task done
Why Speed per iteration slower when dataset is large pending This problem is yet to be addressed
#6410 opened Dec 20, 2024 by coding2debug
1 task done
sft have bug while lora run successfully pending This problem is yet to be addressed
#6405 opened Dec 20, 2024 by TimeFlysLeo
1 task done
How to reproduce the paper results? pending This problem is yet to be addressed
#6387 opened Dec 19, 2024 by StiphyJay
1 task done
LLaMA-Factory对话预期之外存在问题 pending This problem is yet to be addressed
#6386 opened Dec 19, 2024 by 3237522375
1 task done
如何把我训练的奖励模型放到ppo的工作管线里 pending This problem is yet to be addressed
#6385 opened Dec 19, 2024 by chcoo
1 task done
LLava Series (7B, 14B) freeze_vision_tower=false bug pending This problem is yet to be addressed
#6376 opened Dec 18, 2024 by xirui-li
1 task done
多节点使用zero3速度很慢 pending This problem is yet to be addressed
#6372 opened Dec 18, 2024 by HelloWorld506
1 task done
webui加载qwen2-vl-7b进行chat报错 pending This problem is yet to be addressed
#6371 opened Dec 18, 2024 by laoqiongsuan
1 task done
Can you support fast resume with streaming option? pending This problem is yet to be addressed
#6352 opened Dec 16, 2024 by JonghwanMun
1 task done
Support phi-4 released by msft on 2024-12-16 pending This problem is yet to be addressed
#6346 opened Dec 16, 2024 by yx-lamini
1 task done
支持Cohere2架構的c4ai-command-r7b-12-2024 pending This problem is yet to be addressed
#6338 opened Dec 15, 2024 by win10ogod
选择pissa微调时,当提示训练完成后,转换为lora时会报错 pending This problem is yet to be addressed
#6331 opened Dec 13, 2024 by therealoliver
1 task done
when will suport internvl2.5, minicpm, ovis or any newest vlm? pending This problem is yet to be addressed
#6328 opened Dec 13, 2024 by fuxuelinwudi
1 task done
InternVL2.5-8B enhancement New feature or request pending This problem is yet to be addressed
#6322 opened Dec 12, 2024 by saeedkhaki92
1 task done
NLG评估DPO,不输出结果 pending This problem is yet to be addressed
#6321 opened Dec 12, 2024 by sunxiaoyu12
1 task done
ProTip! Adding no:label will show everything without a label.