shixianc

Follow

😁

i like mcdonalds

shixianc shixianc

😁

i like mcdonalds

Follow

[email protected]

0 followers · 1 following

Block or Report

Block or report shixianc

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

shixianc/README.md

Hi there, I'm Shixian Cui 👋

Interested in machine learning, especially model inference optimization.

Connect with me:

emai: [email protected]

linkedin: shixian cui

school projects

Pinned Loading

vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
ray-project/ray ray-project/ray Public

Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Python 32.2k 5.5k
vllm-project/vllm vllm-project/vllm Public

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 23.4k 3.3k
triton-inference-server/server triton-inference-server/server Public

The Triton Inference Server provides an optimized cloud and edge inferencing solution.

Python 7.8k 1.4k
NVIDIA/TensorRT-LLM NVIDIA/TensorRT-LLM Public

TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…

C++ 7.6k 831
triton-inference-server/model_navigator triton-inference-server/model_navigator Public

Triton Model Navigator is an inference toolkit designed for optimizing and deploying Deep Learning models with a focus on NVIDIA GPUs.

Python 169 24