How can I run the realtime model locally on my linux machine? #149

CyberTimon · 2024-11-15T12:25:37Z

Hello

Ultravox looks really interesting.

Is it possible to run this model on my own linux machine, which has 2x RTX 3090 24GB GPU's?
I couldn't find a python inference server or something similar.

Thank you!
Kind regards,
Timon

johnwick123f · 2024-11-17T00:35:31Z

@CyberTimon I believe vllm and transformers have support for ultravox but vllm is faster. The vllm script to run ultravox.

https://github.com/vllm-project/vllm/blob/661a34fd4fdd700a29b2db758e23e4e243e7ff18/examples/offline_inference_audio_language.py#L23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I run the realtime model locally on my linux machine? #149

How can I run the realtime model locally on my linux machine? #149

CyberTimon commented Nov 15, 2024

johnwick123f commented Nov 17, 2024

How can I run the realtime model locally on my linux machine? #149

How can I run the realtime model locally on my linux machine? #149

Comments

CyberTimon commented Nov 15, 2024

johnwick123f commented Nov 17, 2024