Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

提示语音过长生成效果变差。 #764

Open
APeiZou opened this issue Dec 20, 2024 · 2 comments
Open

提示语音过长生成效果变差。 #764

APeiZou opened this issue Dec 20, 2024 · 2 comments

Comments

@APeiZou
Copy link

APeiZou commented Dec 20, 2024

@passerbya @v3ucn @boji123 你好,我测试发现输入提示音色文件时长28s过长prompt-txt等于输入txt的时候生成的语音文件只有不到1个2个词。输入提示音色文件时长有限制吗?

@RHOWL3
Copy link

RHOWL3 commented Dec 20, 2024

image
在cosyvoice.py中,这意味着如果你的prompt音频文本太长,同时切分后的推理文本太短的话(不到prompt文本的一半长度),它就会报这个警告

@APeiZou
Copy link
Author

APeiZou commented Dec 20, 2024

@RHOWL3 如果输入的prompt音频文本内容跟说话prompt_speech_16k不一样而是用的设置为tts_text==prompt_text效果也不理想吧

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants