Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about ScoreDraft #7

Open
melspectrum007 opened this issue May 8, 2018 · 1 comment
Open

Some questions about ScoreDraft #7

melspectrum007 opened this issue May 8, 2018 · 1 comment

Comments

@melspectrum007
Copy link

It's nice job about singing synthesizer. There are some questions about ScoreDraft:
(1) which pitch shifting method using in ScoreDraft?
(2) which time shifting method using in ScoreDraft?
(3) It seems there are some noise in singing voice synthesizer case, so how to reduce noise or solve it?
(4) from the source code, ScoreDraft is concatenated based method, do ScoreDraft have plan use HMM or DNN based method?
(5) What about the future plan of next version?

@fynv
Copy link
Owner

fynv commented May 8, 2018

(1) Which pitch shifting method using in ScoreDraft?
(2) Which time shifting method using in ScoreDraft?

The current solution is mixture of short-time FFT analysis + HNM(harmony noise model)
By stretching and interpolating time domain windows (harmony part) or frequency domain coefficients (noise part), pitch shifting and time shifting can be adjusted independently.
I will write a blog about the details when I got the time..
If you can read Chinese, see below link
https://www.zhihu.com/question/26165668/answer/318431652
I wrote it earlier when ScoreDraft didn't have HNM yet, and consonants are treated the same way as vowels. That is closer to PSOLA in concept.
(3) It seems there are some noise in singing voice synthesizer case, so how to reduce noise or solve it?
Noises are caused by different reasons. Very complicated to solve.
(4) from the source code, ScoreDraft is concatenated based method, do ScoreDraft have plan use HMM or DNN based method?
Not yet.. That involves fundamental restructuring of voicebanks. Not going to do that in predicable future.
(5) What about the future plan of next version?

  1. As you have noticed, there are still some quality issues just using UTAU voicebanks.
    I think that should be the first the solve. And there are some kinds of UTAU voicebanks I've not studied yet.
  2. Instrumental synthesis is also too simple currently. I found an interesting project on GitHub:
    https://github.com/LMMS/lmms
    A quite powerful musical software for Linux. Will study that and improve ScoreDraft. (Add support for SF2, vst etc)
  3. Kanru has opensourced some core algorithms behind moresampler.
    https://github.com/Sleepwalking/libllsm2
    I believe that could be a short-cut to high-quality singing synthesis. Will study that code thoroughly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants