Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

is there an option to set it to run in the gpu? #41

Open
yhown589 opened this issue Mar 1, 2024 · 2 comments
Open

is there an option to set it to run in the gpu? #41

yhown589 opened this issue Mar 1, 2024 · 2 comments
Labels
question Further information is requested

Comments

@yhown589
Copy link

yhown589 commented Mar 1, 2024

when I run audio-to-txt using api, it always run on my CPU and my gpu is free, I want to set it run on my gpu to improve running speed.

@rotemdan
Copy link
Member

rotemdan commented Mar 1, 2024

The models are loaded via onnxruntime-node, which is a node.js binding for Microsoft's ONNX runtime.

onnxruntime-node doesn't currently have GPU support on node.js.

This is currently a working item for them for 2024. It is still in development. They did add some early code but it isn't fully deployed yet.

Also, they don't support models large than 2.0GB on onnxruntime-node, so whisper-large models are not currently supported. I opened an issue for that on the ONNX runtime repository, several months ago.

Once they add GPU support, there will be GPU support added, both for Whisper recognition and possibly for synthesis with the VITS models, and maybe for other features like speech language recognition. Same for large model support.

If you are using Echogarden mostly for speech recognition, just know that it isn't actually its strongest area (strongest is likely alignment). There are faster implementations of OpenAI Whisper models, like whisper.cpp, that support NVIDIA GPUs and are otherwise significantly faster on CPU due to quantization and other optimizations they use.

@yhown589
Copy link
Author

yhown589 commented Mar 1, 2024

I got it. Thanks for your detailded explanation

@rotemdan rotemdan added the question Further information is requested label Jul 4, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants