Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

monorimet · 2024-02-03T05:46:38Z

TODO: rebase without SD changes or land sd-studio2 first.

UI/app structure and utility implementation. - Initializers for webui/API launch - Schedulers file for SD scheduling utilities - Additions to API-level utilities - Added embeddings module for LoRA, Lycoris, yada yada - Added image_processing module for resamplers, resize tools, transforms, and any image annotation (PNG metadata) - shared_cmd_opts module -- sorry, this is stable_args.py. It lives on. We still want to have some global control over the app exclusively from the command-line. At least we will be free from shark_args. - Moving around some utility pieces. - Try to make api+webui concurrency possible in index.py - SD UI -- this is just img2imgUI but hopefully a little better. - UI utilities for your nod logos and your gradio temps. Enable UI / bugfixes / tweaks

* Updates ProcessLoRA to use both embedded LoRA alpha, and lora_strength optional parameter (default 1.0) when applying LoRA weights. * Updates ProcessLoRA to cover more dim cases. * This bring ProcessLoRA into line with PR #2015 against Studio1

* Remove duplicate os import * Remove duplicate parse_seed_input function Migrating to JSON requests in SD UI More UI and app flow improvements, logging, shared device cache Model loading Complete SD pipeline. Tweaks to VAE, pipeline states Pipeline tweaks, add cmd_opts parsing to sd api

* Streaming LLM * Update precision and add gpu support * (studio2) Separate weights generation for quantization support * Adapt prompt changes to studio flow * Remove outdated flag from llm compile flags. * (studio2) use turbine vmfbRunner * tweaks to prompts * Update CPU path and llm api test. * Change device in test to cpu. * Fixes to runner, device names, vmfb mgmt * Use small test without external weights.

…2080) * HF-Reference LLM mode. * Fixup test to match current output from Turbine. * lint * Fix test error message + Only initialize HF torch model when used. * Remove redundant format_out change.

monorimet and others added 12 commits January 17, 2024 12:14

Add test for SD

a43c559

Small cleanup

019ba70

HF-Reference LLM mode + Update test result to match latest Turbine. (#…

934c352

…2080) * HF-Reference LLM mode. * Fixup test to match current output from Turbine. * lint * Fix test error message + Only initialize HF torch model when used. * Remove redundant format_out change.

Add rest API endpoint from LanguageModel API

83a1446

Merge branch 'main' into llm-rest-api

c4f3526

Formatting and init files.

198c42c

Remove unused import.

f009c8b

Merge branch 'main' into llm-rest-api

1dba5dd

monorimet requested review from dan-garvey and IanNod February 12, 2024 17:30

monorimet added 3 commits February 12, 2024 11:31

Merge branch 'main' into llm-rest-api

b983cb3

Merge branch 'main' into llm-rest-api

32557ab

Merge branch 'main' into llm-rest-api

859624e

monorimet closed this May 31, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

monorimet commented Feb 3, 2024

Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

Conversation

monorimet commented Feb 3, 2024