Block or Report
Block or report wangclnlp
Report abuse
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePinned Loading
-
DeepSpeed-Chat-Extension
DeepSpeed-Chat-Extension PublicThis repo contains some extensions of deepspeed-chat for fine-tuning LLMs (SFT+RLHF).
-
Vision-LLM-Alignment
Vision-LLM-Alignment PublicThis repo contains the codes for supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF) designed for vision LLMs.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.