Implement UpdateModel backend workflow #117

petermuller · 2024-09-27T01:19:17Z

Creates UpdateModel state machine implementation with frontend handler validations to force as many synchronous validations as possible before actually hitting the async workflow, which is a much user friendly experience, and easier to debug.

Model only polls for capacity between stopped and in-service states as the model is still functional in other scenarios. When starting a model, the state machine first waits for the desired number of instances to spin up, THEN it waits for the user-defined warmup time before adding the model to LiteLLM, that way customers can't try to make inference requires prior to the models actually spinning up. the model instances pop up healthy before the models are fully initialized, so this wait is necessary to ensure that the models are working before we open them back up to requests.

known issues:

UI does not seem to send the correct payload for updating ASG configuration to the backend
- UI is sending autoScalingConfig instead of autoScalingUpdateConfig
UI cannot update from embedding to textgen or from non-streaming to streaming
UI does not allow a user to start a stopped model without refreshing the page first if the model was stopped in that browser session

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

petermuller requested review from dustins and estohlmann September 27, 2024 01:19

petermuller self-assigned this Sep 27, 2024

petermuller force-pushed the feature/updatemodel-statemachine-impl branch from f7de2ff to 93fe870 Compare September 27, 2024 20:20

petermuller marked this pull request as ready for review September 27, 2024 20:21

petermuller changed the title ~~Feature/updatemodel statemachine impl~~ Implement UpdateModel backend workflow Sep 27, 2024

petermuller added 3 commits September 30, 2024 14:02

Add API validations for UpdateModel

9af2bc6

Add UpdateModel state machine implementation

2603a17

Bugfixes in update workflow

616bd97

petermuller force-pushed the feature/updatemodel-statemachine-impl branch from 93fe870 to 616bd97 Compare September 30, 2024 21:02

estohlmann approved these changes Oct 1, 2024

View reviewed changes

estohlmann merged commit 8d12ca6 into develop Oct 1, 2024
4 checks passed

estohlmann deleted the feature/updatemodel-statemachine-impl branch October 1, 2024 03:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement UpdateModel backend workflow #117

Implement UpdateModel backend workflow #117

petermuller commented Sep 27, 2024 •

edited

Loading

Implement UpdateModel backend workflow #117

Implement UpdateModel backend workflow #117

Conversation

petermuller commented Sep 27, 2024 • edited Loading

petermuller commented Sep 27, 2024 •

edited

Loading