Add support for vLLM embedding models #29

petermuller · 2024-06-13T23:23:43Z

This change adds explicit support for vLLM embedding models (all one of them so far) for use within LISA. Either LiteLLM or LangChain was defaulting the embeddings API to using base64 as the encoding format, which the intfloat model threw errors on. By changing the encoding format to float, we preserve the functionality of the existing RAG implementation, and prevent the intfloat model from failing when it's called with default parameters.

Tested by deploying to my account and using the intfloat model for document upload and search within the chat UI.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

Add support for vLLM embedding models

d4cec97

petermuller requested a review from estohlmann June 13, 2024 23:23

petermuller self-assigned this Jun 13, 2024

estohlmann approved these changes Jun 14, 2024

View reviewed changes

petermuller merged commit 52fb8ae into main Jun 14, 2024
2 checks passed

petermuller deleted the feature/vllm-embeddings branch June 14, 2024 18:36

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for vLLM embedding models #29

Add support for vLLM embedding models #29

petermuller commented Jun 13, 2024

Add support for vLLM embedding models #29

Add support for vLLM embedding models #29

Conversation

petermuller commented Jun 13, 2024