-
Notifications
You must be signed in to change notification settings - Fork 189
Pull requests: openvinotoolkit/openvino.genai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add a command for whisper quantization
no-match-files
#1422
opened Dec 22, 2024 by
nikita-savelyevv
Loading…
Pl bench
category: cmake / build
Cmake scripts
category: continuous batching
Continuous batching
category: GenAI C++ API
Changes in GenAI C++ public headers
category: GHA
CI based on Github actions
category: LLM
LLM pipeline (stateful, static)
category: Python API
Python API for GenAI
category: samples
GenAI samples
category: sampling
Sampling / Decoding algorithms
category: speculative decoding
Speculative decoding
no-match-files
remove redundant Something isn't working
category: llm_bench
Label for tool/llm_bench folder
.tolist()
bug
Use get_max_new_tokens() insted of max_new_tokens field when stopping…
category: sampling
Sampling / Decoding algorithms
#1417
opened Dec 20, 2024 by
michalkulakowski
Loading…
Support unfixed kv heads number
category: continuous batching
Continuous batching
no-match-files
#1416
opened Dec 20, 2024 by
mangguo321
Loading…
[test] Ensure that the first token generation is not included into TPOT
bug
Something isn't working
category: LLM
LLM pipeline (stateful, static)
no-match-files
port to LTS
PR needs to be ported to LTS
[Samples] merge LLM samples to "text_generation" folder
category: cmake / build
Cmake scripts
category: GHA
CI based on Github actions
category: samples
GenAI samples
#1411
opened Dec 19, 2024 by
olpipi
Loading…
add performance statistics for image generation
category: GenAI C++ API
Changes in GenAI C++ public headers
category: Python API
Python API for GenAI
category: samples
GenAI samples
category: text to image
Text 2 image pipeline
#1405
opened Dec 18, 2024 by
xufang-lisa
•
Draft
Add performance statistics for speculative decoding
category: continuous batching
Continuous batching
category: samples
GenAI samples
category: speculative decoding
Speculative decoding
#1403
opened Dec 18, 2024 by
xufang-lisa
•
Draft
Cross referencing blogs in genai samples readme
category: samples
GenAI samples
#1399
opened Dec 17, 2024 by
DimaPastushenkov
Loading…
Removed WAs for OpenVINO: pass properties as is
category: continuous batching
Continuous batching
category: GHA
CI based on Github actions
category: LLM
LLM pipeline (stateful, static)
category: Python API
Python API for GenAI
category: speculative decoding
Speculative decoding
category: text to image
Text 2 image pipeline
category: tokenizers
Tokenizer class or submodule update
category: visual language
Visual language pipeline
category: whisper
Whisper pipeline
no-match-files
[GHA] Samples tests
category: cmake / build
Cmake scripts
category: GHA
CI based on Github actions
no-match-files
[LLM Bench] Allow Image Generation Models to Run in BF16
category: llm_bench
Label for tool/llm_bench folder
[CB]Support 4-bit cache
category: continuous batching
Continuous batching
do_not_merge
no-match-files
#1366
opened Dec 12, 2024 by
zhangYiIntel
•
Draft
Dynamic KV cache allocation
category: continuous batching
Continuous batching
category: LLM
LLM pipeline (stateful, static)
category: samples
GenAI samples
category: speculative decoding
Speculative decoding
no-match-files
[LLM Bench] Defining Framework in Torch Compile Benchmarking
category: llm_bench
Label for tool/llm_bench folder
[WIP] LoRA for FLUX
category: GenAI C++ API
Changes in GenAI C++ public headers
category: text to image
Text 2 image pipeline
Drop check of 'import openvino'
category: cmake / build
Cmake scripts
category: Python API
Python API for GenAI
#1299
opened Dec 4, 2024 by
ilya-lavrenov
•
Draft
Add slice before matmut transformation for CB scenario
category: continuous batching
Continuous batching
category: LLM
LLM pipeline (stateful, static)
category: sampling
Sampling / Decoding algorithms
category: speculative decoding
Speculative decoding
no-match-files
[VLM] Image resize model
category: GHA
CI based on Github actions
category: tokenizers
Tokenizer class or submodule update
category: visual language
Visual language pipeline
Parallel sampling with threadpool
category: continuous batching
Continuous batching
category: sampling
Sampling / Decoding algorithms
no-match-files
Previous Next
ProTip!
Filter pull requests by the default branch with base:master.