Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How can I run the realtime model locally on my linux machine? #149

Open
CyberTimon opened this issue Nov 15, 2024 · 1 comment
Open

How can I run the realtime model locally on my linux machine? #149

CyberTimon opened this issue Nov 15, 2024 · 1 comment

Comments

@CyberTimon
Copy link

Hello

Ultravox looks really interesting.

Is it possible to run this model on my own linux machine, which has 2x RTX 3090 24GB GPU's?
I couldn't find a python inference server or something similar.

Thank you!
Kind regards,
Timon

@johnwick123f
Copy link

@CyberTimon I believe vllm and transformers have support for ultravox but vllm is faster. The vllm script to run ultravox.

https://github.com/vllm-project/vllm/blob/661a34fd4fdd700a29b2db758e23e4e243e7ff18/examples/offline_inference_audio_language.py#L23

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants