Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about quantize time for custom flux transformer #24

Open
chuck-ma opened this issue Nov 12, 2024 · 5 comments
Open

Question about quantize time for custom flux transformer #24

chuck-ma opened this issue Nov 12, 2024 · 5 comments

Comments

@chuck-ma
Copy link

I'm currently using H800 to do Smooth Quantization for my custom flux transformer. I'm wondering how long it would take to finish quantization. I have been quantizing for 20 minutes, but the progress bar is still empty.

python -m deepcompressor.app.diffusion.ptq configs/model/flux.1-custom.yaml configs/svdquant/int4.yaml --save-model /root/autodl-tmp/flux.1-custom-svdquant-int4

image
@chuck-ma chuck-ma changed the title Question about quantize time Question about quantize time for custom flux transformer Nov 12, 2024
@adhikjoshi
Copy link

Do share update if it did work,

@chuck-ma
Copy link
Author

chuck-ma commented Nov 12, 2024

image

Well, it will take 70 hours to quantize. But I have no money. Any idea about how to speed up ? @synxlin @bobboli

@dome272
Copy link

dome272 commented Nov 15, 2024

Also encountering this taking 12 hours on an H100

@synxlin
Copy link
Contributor

synxlin commented Nov 25, 2024

Hi, @chuck-ma @dome272 ,

We are working on improving our codebase to support fast calibration without online activation generation. We'll keep this issue updated.

@chuck-ma
Copy link
Author

Understood. I have read your paper and studied the general framework of your "nunchaku" project. If there is anything I can do to help you, I will be very happy to contribute. @synxlin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants