πA curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ππ
-
Updated
Dec 2, 2024
πA curated list of Awesome LLM/VLM Inference Papers with codes, such as FlashAttention, PagedAttention, Parallelism, etc. ππ
πA curated list of Awesome Diffusion Inference Papers with codes, such as Sampling, Caching, Multi-GPUs, etc. ππ
Add a description, image, and links to the open-sora topic page so that developers can more easily learn about it.
To associate your repository with the open-sora topic, visit your repo's landing page and select "manage topics."