-
Notifications
You must be signed in to change notification settings - Fork 79
Issues: NVIDIA/NeMo-Aligner
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Out of Memory (OOM) During Training a LLaMA 7B Reward Model (8 A800 40GB GPUs)
bug
Something isn't working
#444
opened Dec 11, 2024 by
qingyiaaaaa
use lightning or pytorch-lightning
bug
Something isn't working
#438
opened Dec 10, 2024 by
better629
OOM when saving torch_dist checkpoint
bug
Something isn't working
#436
opened Dec 7, 2024 by
Cppowboy
Fix Something isn't working
dev
branch's build after PTL upgrade
bug
#418
opened Nov 22, 2024 by
terrykong
How can I use nvidia/Llama-3.1-Nemotron-70B-Reward-HF directly for inference?
#360
opened Oct 25, 2024 by
arunasank
attribute_annotate.py
is not worked by KeyError: 'exceeded'
bug
#349
opened Oct 18, 2024 by
AtsunoriFujita
[Question] Converting a Megatron-LM ckpt to nemo so we can use NeMo-Aligner for post-training
#340
opened Oct 10, 2024 by
abgoswam
Error during saving checkpoint with TensorRT-enabled PPO actor training
bug
Something isn't working
#281
opened Sep 5, 2024 by
haizadinia
[Question] TransfomerEngine and Apex dependencies
bug
Something isn't working
#278
opened Sep 2, 2024 by
peri044
Does NeMo Aligner support tensor parallel and pipeline parallel?
#265
opened Aug 15, 2024 by
cizhenshi
GPTGenerateTRTLLM.trt_llm_exporter.refit failed due to empty weights in the refit engine during PPO actor training
bug
Something isn't working
#264
opened Aug 10, 2024 by
renweizhukov
job hangs or IndexError when train reward model with PP> 1
bug
Something isn't working
#251
opened Jul 24, 2024 by
zirui
SFT not working on nemo:24.05.01 container
bug
Something isn't working
#236
opened Jul 13, 2024 by
vecorro
Policy Log Probs and Reference Log Probs differ at 1st iteration of DPO/RPO
bug
Something isn't working
#227
opened Jul 3, 2024 by
shengyangs
Previous Next
ProTip!
Follow long discussions with comments:>50.