mgoin

Follow

🤠

Michael Goin mgoin

🤠

Follow

LLM inference optimization and HPC Engineering Lead @neuralmagic Committer @vllm-project

78 followers · 53 following

Sponsoring

Achievements

Achievements

Organizations

Pinned Loading

vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 32.1k 4.9k
vllm-project/llm-compressor vllm-project/llm-compressor Public

Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM

Python 798 66
neuralmagic/deepsparse neuralmagic/deepsparse Public

Sparsity-aware deep learning inference runtime for CPUs

Python 3.1k 175
neuralmagic/sparseml neuralmagic/sparseml Public

Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

Python 2.1k 148
advos advos Public

RISC-V OS in Rust with hardware support for SiFive's HiFive1 board

Rust
torch_bitmask torch_bitmask Public

Implementations for fast bitmask compression for weight sparsity in PyTorch

Python 3