Releases · VectorInstitute/vector-inference

Added vec-inf CLI:
- Install vec-inf via pip
- launch command to launch models
- status command to check inference server status
- shutdown command to stop inference server
- list command to see all available models
Upgraded vllm to 0.5.4
Added support for new model families:
- Llama 3.1 (Including 405B)
- Gemma 2
- Phi 3
- Mistral Large

Provide feedback