Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Very low accuracy #93

Open
zyc1310517843 opened this issue Jun 15, 2019 · 3 comments
Open

Very low accuracy #93

zyc1310517843 opened this issue Jun 15, 2019 · 3 comments

Comments

@zyc1310517843
Copy link

Hello, I downloaded the official Chinese speech model. It seems that the recognition rate is very low and the basic recognition is not correct. Thank you for your guidance.

@lvan-jone
Copy link

Where did you download the official Chinese package? If the recognition rate is low, you need to get tools like dictionary models yourself

@SwimmingTiger
Copy link

SwimmingTiger commented Apr 17, 2021

I think he downloaded this model:
https://sourceforge.net/projects/cmusphinx/files/Acoustic%20and%20Language%20Models/Mandarin/

I am also using this model. The dictation accuracy of this model is indeed very poor. In dictation mode, it can hardly generate any readable sentences. I can only get some unrelated fragments of words.

But if a JSGF grammar file is loaded, the accuracy is acceptable. Note: It seems that manual word segmentation is required for Chinese grammar file, that is, adding spaces between each word in the sentence. Otherwise, The dictionary is missing a phonetic transcription for the word 'xxxxxxxxxxxxxxxxxx' will be reported and you will not be able to identify any content.

@SwimmingTiger
Copy link

SwimmingTiger commented Apr 17, 2021

Or we can make some adjustments to the acoustic model or configuration to improve the accuracy of dictation.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Development

No branches or pull requests

3 participants