liangyuwang

Follow

Liangyu Wang liangyuwang

Follow

LLM, CUDA/System, Distributed training

6 followers · 1 following

KAUST (King Abdullah University of Science and Technology)

Achievements

Achievements

Pinned Loading

Tiny-DeepSpeed Tiny-DeepSpeed Public

Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library

Python 5 1
Tiny-Megatron Tiny-Megatron Public

Tiny-Megatron, a minimalistic re-implementation of the Megatron library

Python 3
TheCoreTeam/core_scheduler TheCoreTeam/core_scheduler Public

CoreScheduler: A High-Performance Scheduler for Large Model Training

C++ 21 6
Flash-Attention-Implementation Flash-Attention-Implementation Public

Implementation of Flash-Attention (both forward and backward) with PyTorch, CUDA, and Triton

Python 1