Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

看起来大家对流式语音的生成以及长文字tts的速度不满, 大家有优化后的版本么 #757

Open
chg0901 opened this issue Dec 19, 2024 · 2 comments

Comments

@wanghuihhh
Copy link

其实大家主要不是不满,而是一开始提及150ms,吊足了胃口,但是最后开出来的版本距离这个值差太多,并且也没有提及任何相关信息,即使这部分不打算开源,也应该说一下

@shirubei
Copy link

shirubei commented Dec 23, 2024

插一句,https://github.com/Plachtaa/VITS-fast-fine-tuning 这个虽然功能比较少,速度很快。
用自己训练好的模型,20个汉字左右,第一次进行推理大约要6s(估计主要消耗的时间是模型加载)。
换别的文本(同样字数)再次进行推理,1s(或者更少)。之后一直保持1s左右。

不足之处就是,无法支持中英混合的情况,碰到英文它只能一个一个字母地读。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants