How to fine tune Vits En TTS model for a different language ? #5922
Replies: 4 comments 5 replies
-
@treacker can you provide some advice for training/fine-tuning VITS? |
Beta Was this translation helpful? Give feedback.
-
@Ca-ressemble-a-du-fake, |
Beta Was this translation helpful? Give feedback.
-
I haven't researched finetuning, but I think the process will be pretty much the same. This checkpoint was trained with IPA tokenizer, so TextEmbedding will likely still work, but check if your French phonemes set does not exceed NeMo IPA set |
Beta Was this translation helpful? Give feedback.
-
Hi, Currently I am working on VITS Model to finetune in my own voice but I am getting the following error. I don't know exactly what causes this error but I think it is the padding issue.Even though I've done some changes in the code this error is encountering again and again.
It would help a lot can someone guide me on this. |
Beta Was this translation helpful? Give feedback.
-
Hi,
I need Vits to speak French (https://catalog.ngc.nvidia.com/orgs/nvidia/teams/nemo/models/tts_en_lj_vits). So I need to fine tune it with my own dataset.
I read the tutorial on Fastpitch fine tuning (https://github.com/NVIDIA/NeMo/blob/main/tutorials/tts/FastPitch_Finetuning.ipynb) which clearly depicts the fune tuning process.
But now I don't know where to go to fine tune (or even train Vits from scratch) on my French dataset.
Any guide or tutorial would help me a lot !
Beta Was this translation helpful? Give feedback.
All reactions