Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds LLM Rest API compatibility for Studio 2.0 API mode. #2083

Closed
wants to merge 15 commits into from

Commits on Jan 17, 2024

  1. (WIP): Studio2 app infra and SD API

    UI/app structure and utility implementation.
    
    - Initializers for webui/API launch
    - Schedulers file for SD scheduling utilities
    - Additions to API-level utilities
    - Added embeddings module for LoRA, Lycoris, yada yada
    - Added image_processing module for resamplers, resize tools,
      transforms, and any image annotation (PNG metadata)
    - shared_cmd_opts module -- sorry, this is stable_args.py. It lives on.
      We still want to have some global control over the app exclusively
      from the command-line. At least we will be free from shark_args.
    - Moving around some utility pieces.
    - Try to make api+webui concurrency possible in index.py
    - SD UI -- this is just img2imgUI but hopefully a little better.
    - UI utilities for your nod logos and your gradio temps.
    
    Enable UI / bugfixes / tweaks
    monorimet committed Jan 17, 2024
    Configuration menu
    Copy the full SHA
    dbacc36 View commit details
    Browse the repository at this point in the history
  2. Studio2/SD: Use more correct LoRA alpha calculation (#2034)

    * Updates ProcessLoRA to use both embedded LoRA alpha, and lora_strength
    optional parameter (default 1.0) when applying LoRA weights.
    * Updates ProcessLoRA to cover more dim cases.
    * This bring ProcessLoRA into line with PR #2015 against Studio1
    one-lithe-rune authored and monorimet committed Jan 17, 2024
    Configuration menu
    Copy the full SHA
    cdf2eb5 View commit details
    Browse the repository at this point in the history
  3. Studio2: Remove duplications from api/utils.py (#2035)

    * Remove duplicate os import
    * Remove duplicate parse_seed_input function
    
    Migrating to JSON requests in SD UI
    
    More UI and app flow improvements, logging, shared device cache
    
    Model loading
    
    Complete SD pipeline.
    
    Tweaks to VAE, pipeline states
    
    Pipeline tweaks, add cmd_opts parsing to sd api
    one-lithe-rune authored and monorimet committed Jan 17, 2024
    Configuration menu
    Copy the full SHA
    7a0017d View commit details
    Browse the repository at this point in the history
  4. Add test for SD

    monorimet committed Jan 17, 2024
    Configuration menu
    Copy the full SHA
    a43c559 View commit details
    Browse the repository at this point in the history

Commits on Feb 2, 2024

  1. Small cleanup

    monorimet committed Feb 2, 2024
    Configuration menu
    Copy the full SHA
    019ba70 View commit details
    Browse the repository at this point in the history
  2. Add StreamingLLM support to studio2 chat (#2060)

    * Streaming LLM
    
    * Update precision and add gpu support
    
    * (studio2) Separate weights generation for quantization support
    
    * Adapt prompt changes to studio flow
    
    * Remove outdated flag from llm compile flags.
    
    * (studio2) use turbine vmfbRunner
    
    * tweaks to prompts
    
    * Update CPU path and llm api test.
    
    * Change device in test to cpu.
    
    * Fixes to runner, device names, vmfb mgmt
    
    * Use small test without external weights.
    monorimet committed Feb 2, 2024
    Configuration menu
    Copy the full SHA
    2935166 View commit details
    Browse the repository at this point in the history
  3. HF-Reference LLM mode + Update test result to match latest Turbine. (#…

    …2080)
    
    * HF-Reference LLM mode.
    
    * Fixup test to match current output from Turbine.
    
    * lint
    
    * Fix test error message + Only initialize HF torch model when used.
    
    * Remove redundant format_out change.
    raikonenfnu authored and monorimet committed Feb 2, 2024
    Configuration menu
    Copy the full SHA
    934c352 View commit details
    Browse the repository at this point in the history

Commits on Feb 3, 2024

  1. Configuration menu
    Copy the full SHA
    83a1446 View commit details
    Browse the repository at this point in the history

Commits on Feb 5, 2024

  1. Configuration menu
    Copy the full SHA
    c4f3526 View commit details
    Browse the repository at this point in the history
  2. Formatting and init files.

    monorimet committed Feb 5, 2024
    Configuration menu
    Copy the full SHA
    198c42c View commit details
    Browse the repository at this point in the history
  3. Remove unused import.

    monorimet committed Feb 5, 2024
    Configuration menu
    Copy the full SHA
    f009c8b View commit details
    Browse the repository at this point in the history

Commits on Feb 7, 2024

  1. Configuration menu
    Copy the full SHA
    1dba5dd View commit details
    Browse the repository at this point in the history

Commits on Feb 12, 2024

  1. Configuration menu
    Copy the full SHA
    b983cb3 View commit details
    Browse the repository at this point in the history

Commits on Feb 19, 2024

  1. Configuration menu
    Copy the full SHA
    32557ab View commit details
    Browse the repository at this point in the history

Commits on Apr 19, 2024

  1. Configuration menu
    Copy the full SHA
    859624e View commit details
    Browse the repository at this point in the history