Popular repositories Loading
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
-
neural-compressor
neural-compressor PublicForked from intel/neural-compressor
Intel® Neural Compressor (formerly known as Intel® Low Precision Optimization Tool), targeting to provide unified APIs for network compression technologies, such as low precision quantization, spar…
Python
-
accelerate
accelerate PublicForked from huggingface/accelerate
🚀 A simple way to train and use PyTorch models with multi-GPU, TPU, mixed-precision
Python
-
-
optimum-intel
optimum-intel PublicForked from huggingface/optimum-intel
Accelerate inference of 🤗 Transformers with Intel optimization tools
Jupyter Notebook 1
-
transformers
transformers PublicForked from huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python 3
If the problem persists, check the GitHub status page or contact support.