Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert vocos checkpoint to bigvgan #654

Open
4 tasks done
kmn1024 opened this issue Dec 20, 2024 · 3 comments
Open
4 tasks done

Convert vocos checkpoint to bigvgan #654

kmn1024 opened this issue Dec 20, 2024 · 3 comments
Labels
question Further information is requested

Comments

@kmn1024
Copy link

kmn1024 commented Dec 20, 2024

Checks

  • This template is only for question, not feature requests or bug reports.
  • I have thoroughly reviewed the project documentation and read the related paper(s).
  • I have searched for existing issues, including closed ones, no similar questions.
  • I confirm that I am using English to submit this report in order to facilitate communication.

Question details

Is there a fast way to convert a F5 checkpoint that was trained using vocos mels to one that is compatible with bigvgan? For example, can we freeze most of the parameters, and only continue-train a small number of parameters? Or must we train all parameters again?

@kmn1024 kmn1024 added the question Further information is requested label Dec 20, 2024
@ZhikangNiu
Copy link
Collaborator

But we also release BigVGAN F5 checkpoint in the huggingface ( Why we need to convert the checkpoint?

@kmn1024
Copy link
Author

kmn1024 commented Dec 20, 2024

Because I have finetuned many models on top of the vocos checkpoint. Now I wonder if I need to redo all that if I want bigvgan.

@ZhikangNiu
Copy link
Collaborator

Got it, this is a great idea. I think you can first compare the differences between these two MelSpec implementation methods, then add a preprocessing network or postprocessing network, and freeze all the dit blocks. You can try this approach to see if the model can map the vocos features space to the bigvgan features space on its own. (Intuitively, this seems to be the simplest approach

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants