[RFC] Intx Tensor Subclasses Quantization #439

vayuda · 2024-06-25T18:32:37Z

Objective:

Implement sub byte unsigned integer quantization baselines from 1-7 to enable users to experiment with low bit quantization in pytorch.

Tracker:

Create a UIntx Tensor Subclass per [RFC] torchao Contributor Guide #391
Integrate with existing quant API + AQT
Profile performance with llama2 and 3, noting metrics mentioned in The next tutorials #426
Add support for int_x as well
Integrate with existing uint dtypes
Add fused kernel for unpack + dequant

Tasks

Give feedback

Use torch.uint1 to torch.uint7 for Uintx tensor subclass #672

CLA Signed
Options

The text was updated successfully, but these errors were encountered:

jerryzh168 · 2024-06-26T00:28:45Z

This is great @vayuda, after Intx Tensor subclass matures we can also merge this into pytorch core, but we can keep this in torchao for a while to flesh out the extensibility stories (how to add a new op, layout, implementation branch to these Tensors) etc.

HDCharles · 2024-07-02T19:21:58Z

@vayuda can you link the results/PRs for some of these checked off bits?

vayuda self-assigned this Jun 25, 2024

vayuda mentioned this issue Jul 2, 2024

Intx Quantization Tensor Class #468

Merged

jerryzh168 added the good first issue Good for newcomers label Jul 29, 2024

vayuda assigned jerryzh168 Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] Intx Tensor Subclasses Quantization #439

[RFC] Intx Tensor Subclasses Quantization #439

vayuda commented Jun 25, 2024 •

edited

Loading

Tasks

jerryzh168 commented Jun 26, 2024

HDCharles commented Jul 2, 2024

[RFC] Intx Tensor Subclasses Quantization #439

[RFC] Intx Tensor Subclasses Quantization #439

Comments

vayuda commented Jun 25, 2024 • edited Loading

Objective:

Tracker:

Tasks

jerryzh168 commented Jun 26, 2024

HDCharles commented Jul 2, 2024

vayuda commented Jun 25, 2024 •

edited

Loading