FEAT: Support 1.58-bit LLMs training #114

younesbelkada · 2024-03-21T13:14:19Z

Hi there!

Microsoft have just released the full handbook for reproduing the 1-bit LLM paper: https://github.com/microsoft/unilm/blob/master/bitnet/The-Era-of-1-bit-LLMs__Training_Tips_Code_FAQ.pdf

Would be exciting to see if we can have an official implementation of that paper in nanotron, and support 1-bit LLM inference directly in transformers for the models that have been trained with that method using nanotron

cc @NouamaneTazi @xrsrke @3outeille @thomwolf

cc original author:
@shumingma

xrsrke · 2024-03-25T12:01:00Z

@younesbelkada, hey, thanks for the suggestion. I've talked with @NouamaneTazi; we agree that we will add support for 1bit later on for consumer hardware because FP8 is the coolest. You get a speedup in training (FP8 matmul, this is very important), memory reduction, and it's tested at scale (180B).... So, currently, we focus on FP8 :)

younesbelkada changed the title ~~FEAT: Support 1.58-bit LLMs~~ FEAT: Support 1.58-bit LLMs training Mar 21, 2024

xrsrke added enhancement New feature or request help wanted Extra attention is needed Low Priority Medium Priority and removed Low Priority labels Mar 25, 2024

xrsrke added good first issue Good for newcomers Low Priority and removed Medium Priority labels Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Support 1.58-bit LLMs training #114

FEAT: Support 1.58-bit LLMs training #114

younesbelkada commented Mar 21, 2024 •

edited

Loading

xrsrke commented Mar 25, 2024

FEAT: Support 1.58-bit LLMs training #114

FEAT: Support 1.58-bit LLMs training #114

Comments

younesbelkada commented Mar 21, 2024 • edited Loading

xrsrke commented Mar 25, 2024

younesbelkada commented Mar 21, 2024 •

edited

Loading