Popular repositories Loading
-
-
Megatron-DeepSpeed
Megatron-DeepSpeed PublicForked from bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python
-
Llama-X
Llama-X PublicForked from AetherCortex/Llama-X
Open Academic Research on Improving LLaMA to SOTA LLM
Python
-
openspg
openspg PublicForked from OpenSPG/openspg
OpenSPG is a Knowledge Graph Engine developed by Ant Group in collaboration with OpenKG, based on the SPG (Semantic-enhanced Programmable Graph) framework. Core Capabilities: 1) domain model constr…
Java
-
-
swift
swift PublicForked from modelscope/ms-swift
ms-swift: Use PEFT or Full-parameter to finetune 300+ LLMs or 50+ MLLMs. (Qwen2, GLM4v, Internlm2.5, Yi, Llama3.1, Llava-Video, Internvl2, MiniCPM-V, Deepseek, Baichuan2, Gemma2, Phi3-Vision, ...)
Python
If the problem persists, check the GitHub status page or contact support.