Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using the program for other voices #15

Open
StElysse opened this issue Feb 13, 2021 · 1 comment
Open

Using the program for other voices #15

StElysse opened this issue Feb 13, 2021 · 1 comment

Comments

@StElysse
Copy link

Hello,

How can I use the FastPitch part of this program for a non-Bethesda game speaker?

I discovered this repo in my quest to create a personal modding project involving voice synthesis. Over the past week, I’ve successfully been fiddling around with the Real Time Voice Cloning repo to fine-tune their pretrained models to a single speaker. The model is continuing to improve slowly, but my sole gripe is the inability to control the pitch of the generated audios.

Would it be possible for you to tell me how I can modify or use your repo for a non-Bethesda speaker? I know how to compile datasets with LibriTTS and train a pre-made synthesizer on them, but not much else.

If you can help, I’d be grateful!

@DanRuta
Copy link
Owner

DanRuta commented Feb 13, 2021

Hi there. xVA is just an app for doing inference (with editing) for FastPitch models. Your best bet is to head over to NVIDIA's github page for FastPitch, where they give instructions for how to train the model. Once you train a model on your data, you can drop it into xVA for using the editor.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants