MellotronCPU

Mellotron singing synthesizer using CPU

Insallation

Download pretrained model checkpoints from nvidia/mellotron repository and specify the paths here

Usage

Check this Google Colab.

About musicXML Format

The characters must be in [a-zA-Z]
Each word must start with an upper case
Every word must exist in the cmu_dictionary dictionary. https://en.wikipedia.org/wiki/ARPABET

Relevant notes 1

In reference to the GST part of mellotron, there is no 1:1 lock. You can use GST the same way as in other repos.

If you want to do inference with the mellotron model however, we additionally extract two things from a reference audio: the rhythm and the pitch which creates the 1:1 correspondence. It's the rhythm that creates the 1:1 correspondence actually. But your automatically-extracted pitch might not make sense if you do not additionally condition on the rhythm.

If you don't want rhythm (which you can disable by using model.inferece()) and pitch conditioning (which you can disable by sending zeros as the pitch), you get essentially tacotron 2 with GST and speaker ids.

Relevant notes 2

The paper states that "the target speaker, St, would always be found in the training set, while the source text, pitch and rhythm (Ts, Ps, Rs) could be from outside the training set." so I presume there is no need for speaker ids for source audios - it doesn't make sense after all for some arbitrary input audio outside the training set to have a valid speaker id. However in the examples_filelist.txt there is a column for speaker ids. What is the significance of this column?

The model expects a speaker id, so we give it a random speaker id.

NVIDIA/mellotron#18

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
checkpoints		checkpoints
mellotron		mellotron
musicXML		musicXML
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
run_mellotron.py		run_mellotron.py
train_utils.py		train_utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MellotronCPU

Insallation

Usage

About musicXML Format

Relevant notes 1

Relevant notes 2

About

Releases

Packages

Languages

mathigatti/MellotronCPU

Folders and files

Latest commit

History

Repository files navigation

MellotronCPU

Insallation

Usage

About musicXML Format

Relevant notes 1

Relevant notes 2

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages