Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation Updates for v2 #20

Merged
merged 3 commits into from
Jun 6, 2024
Merged

Conversation

petermuller
Copy link
Contributor

Update README for the updates that we are pushing for LISA V2.

Includes:

  • New architecture diagrams
  • vLLM docs
  • OpenAI Compatibility usage and examples
  • LiteLLM configuration + SageMaker and Bedrock documentation items were added in a previous commit: lines 93-112 if you would like to expand and provide feedback on those too

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@petermuller petermuller requested a review from estohlmann June 6, 2024 00:16
@petermuller petermuller self-assigned this Jun 6, 2024
curl -s -H 'Authorization: Bearer your-bearer-token' -X GET https://${lisa_serve_alb}/v2/serve/models
```

When using a library that requests an OpenAI-compatible base_url, you can provide `https://${lisa_serve_alb}/v2/serve` here. All of the OpenAI routes will
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just want to make sure lisa_serve_alb is still corect with the ALB complications we had.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, these requests still go to the load balancer and not the APIGW endpoint

@petermuller petermuller merged commit a90e35a into main Jun 6, 2024
2 checks passed
@petermuller petermuller deleted the feature/documentation-updates branch June 6, 2024 16:44
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants