Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some different suggestions regarding the engine #56

Open
king-dahmanus opened this issue Jan 1, 2022 · 0 comments
Open

some different suggestions regarding the engine #56

king-dahmanus opened this issue Jan 1, 2022 · 0 comments
Labels
enhancement New feature or request

Comments

@king-dahmanus
Copy link

king-dahmanus commented Jan 1, 2022

Hello developers. I'm not a dev, but I am suggesting some improvements and features/ideas for this engine.
First with the shorter one, to improve the quality of the voice, you need to change the encoder. From what I've hird of the samples, this engine is using griffinlim encoder which sounds robotic. You need to change it to use something like hifigam or any other better encoder. Hifi gan sounds promising.
For the second one, I suggest making this engine available for windows assistive technologies by making a sapi5(speech application programming interface) distribution of the engine so screen readers like NVDA or jaws, text rraders like balabolka or textaloud, and many other programs can use it. The voice has to be optimized for responsiveness, meaning faster than realtime output and no lag or delay before or in the middle of the speech. Hope you consider my suggestions. Thanks, and hope we can discuss this.

@king-dahmanus king-dahmanus added the enhancement New feature or request label Jan 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant