Batch inference #1278

VietDunghacker · 2024-07-03T06:40:50Z

How to perform batch inference with swift? I don't see it mentioned anywhere in the docs and I cannot find it in the code either.

Jintao-Huang · 2024-07-03T07:27:21Z

Using the infer_backend vllm allows for batch inference.

https://github.com/modelscope/swift/blob/main/docs/source/LLM/VLLM%E6%8E%A8%E7%90%86%E5%8A%A0%E9%80%9F%E4%B8%8E%E9%83%A8%E7%BD%B2.md

The "inference_vllm" can take a "request_list" as input.

VietDunghacker · 2024-07-03T08:47:25Z

Thank you.

VietDunghacker · 2024-07-03T16:50:52Z

@Jintao-Huang
vllm is great, but unfortunately vllm does not support all models in this repo. For instance, Phi-3 Vision is supported in their Github repo but not in the official pip version.
I really think it will be helpful if the feature is implemented natively in swift instead of relying on vllm.

tastelikefeet · 2024-07-09T03:39:59Z

Thanks for you suggestion! We have added batch inference for pytorch native to our todo list. This requirement will be accomplished in one sprint

VietDunghacker closed this as completed Jul 3, 2024

VietDunghacker reopened this Jul 3, 2024

tastelikefeet added the enhancement New feature or request label Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch inference #1278

Batch inference #1278

VietDunghacker commented Jul 3, 2024

Jintao-Huang commented Jul 3, 2024

VietDunghacker commented Jul 3, 2024

VietDunghacker commented Jul 3, 2024 •

edited

Loading

tastelikefeet commented Jul 9, 2024

Batch inference #1278

Batch inference #1278

Comments

VietDunghacker commented Jul 3, 2024

Jintao-Huang commented Jul 3, 2024

VietDunghacker commented Jul 3, 2024

VietDunghacker commented Jul 3, 2024 • edited Loading

tastelikefeet commented Jul 9, 2024

VietDunghacker commented Jul 3, 2024 •

edited

Loading