You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Due to its high generation quality and fast inference, we believe integrating this model into diffusers will make diffusers more appealing to text-to-audio generation researchers and users! Thank you very much.
Open source status
The model implementation is available.
The model weights are available (Only relevant if addition is not a scheduler).
I am the main author of the code, and am more than happy to assist the integration.
The text was updated successfully, but these errors were encountered:
Bai-YT
changed the title
ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
[馃専 New Model] ConsistencyTTA: Accelerating Diffusion-Based Text-to-Audio Generation with Consistency Distillation
Jun 5, 2024
@Bai-YT Thank you for your awesome work! I just finished understanding the paper and think that I have a good grasp of the modeling and inference code to convert to diffusers.
@sayakpaul Could I pick this up if no one's working on it?
@Bai-YT Thank you for your awesome work! I just finished understanding the paper and think that I have a good grasp of the modeling and inference code to convert to diffusers.
@sayakpaul Could I pick this up if no one's working on it?
Appreciate everyone's time for helping!!! Massive thanks.
Model/Pipeline/Scheduler description
ConsistencyTTA, introduced in the paper Accelerating Diffusion-Based Text-to-Audio Generation
with Consistency Distillation, is an efficient text-to-audio generation model. Compared to a comparable diffusion-based TTA model, ConsistencyTTA achieves a 400x generation speed-up, while retaining the generation quality and diversity.
Due to its high generation quality and fast inference, we believe integrating this model into
diffusers
will makediffusers
more appealing to text-to-audio generation researchers and users! Thank you very much.Open source status
Provide useful links for the implementation
The open-source code implementation can be found at https://github.com/Bai-YT/ConsistencyTTA.
There is also a simplified implementation for inference only: https://github.com/Bai-YT/ConsistencyTTA/tree/main/easy_inference.
The model checkpoints can be found at https://huggingface.co/Bai-YT/ConsistencyTTA.
I am the main author of the code, and am more than happy to assist the integration.
The text was updated successfully, but these errors were encountered: