Speech Not Clear #339

MMingabc · 2024-03-06T02:40:59Z

MMingabc
Mar 6, 2024

I am using my own dataset which is a Chinese professional tts speech dataset of 8k utterances to train bert-vits2. The generated speech is not clear. It sounds like the speaker but the content cannot be understood. It's just mumbling. What could be the problem? Thank you. And I am using 16k sampling rate.

AngelGuevara7 · 2024-03-14T10:47:39Z

AngelGuevara7
Mar 14, 2024

Same problem here for spanish! The alignment matrix looks good but the audio it's just mumbling. I'm using 44,1k sampling rate.
Did you solve it?

2 replies

starmoon-1134 Mar 15, 2024

This maype help you.
#245 (comment)

AngelGuevara7 Mar 18, 2024

Thanks! I will take a look at it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speech Not Clear #339

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 2 replies

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

Select a reply

Speech Not Clear #339

MMingabc Mar 6, 2024

Replies: 1 comment · 2 replies

AngelGuevara7 Mar 14, 2024

starmoon-1134 Mar 15, 2024

AngelGuevara7 Mar 18, 2024

MMingabc
Mar 6, 2024

Replies: 1 comment 2 replies

AngelGuevara7
Mar 14, 2024