Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
sayakpaul 
posted an update Jul 30
Post
3742
With larger and larger diffusion transformers coming up, it's becoming increasingly important to have some good quantization tools for them.

We present our findings from a series of experiments on quantizing different diffusion pipelines based on diffusion transformers.

We demonstrate excellent memory savings with a bit of sacrifice on inference latency which is expected to improve in the coming days.

Diffusers 🤝 Quanto ❤️

This was a juicy collaboration between @dacorvo and myself.

Check out the post to learn all about it
https://huggingface.co/blog/quanto-diffusers

Also worth mentioning is that you can also use stable-diffusion.cpp. They recently merged SD3 into their main branch. GitHub

P.S. Inference on CPU is still super slow.

No one has ever thought to Quantize Reverse Diffusion before? Really? What about the sausage one? Bravo and kudos to you either way!

Would LOVE to see TortoiseTTS quantizable!