Making LJSpeech Tacotron2 model model say 'video'. #883

bduvenhage · 2021-03-23T22:07:19Z

Hi

This is possibly a question on TTS in general, but I'm fine tuning PyTorch Tacotron2 and I found that even the pre-trained LJSpeech model has trouble saying 'video'. I see the training and inference code supports arpabet, but I assume that the pre-trained models were trained on the plain text of LJSpeech and not phonetic sequences? I tried doing inference using arpabet sequences in curly brackets, but it doesn't work.

Do you have any recommendations for making the model say modern words that might not have been part of the LJSpeech corpus?

Thanks,

nv-kkudrynski assigned ghost Jun 15, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Making LJSpeech Tacotron2 model model say 'video'. #883

Making LJSpeech Tacotron2 model model say 'video'. #883

bduvenhage commented Mar 23, 2021 •

edited

Loading

Making LJSpeech Tacotron2 model model say 'video'. #883

Making LJSpeech Tacotron2 model model say 'video'. #883

Comments

bduvenhage commented Mar 23, 2021 • edited Loading

bduvenhage commented Mar 23, 2021 •

edited

Loading