Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Batch inference #1278

Open
VietDunghacker opened this issue Jul 3, 2024 · 4 comments
Open

Batch inference #1278

VietDunghacker opened this issue Jul 3, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@VietDunghacker
Copy link

How to perform batch inference with swift? I don't see it mentioned anywhere in the docs and I cannot find it in the code either.

@Jintao-Huang
Copy link
Collaborator

Using the infer_backend vllm allows for batch inference.

https://github.com/modelscope/swift/blob/main/docs/source/LLM/VLLM%E6%8E%A8%E7%90%86%E5%8A%A0%E9%80%9F%E4%B8%8E%E9%83%A8%E7%BD%B2.md

The "inference_vllm" can take a "request_list" as input.

@VietDunghacker
Copy link
Author

Thank you.

@VietDunghacker
Copy link
Author

VietDunghacker commented Jul 3, 2024

@Jintao-Huang
vllm is great, but unfortunately vllm does not support all models in this repo. For instance, Phi-3 Vision is supported in their Github repo but not in the official pip version.
I really think it will be helpful if the feature is implemented natively in swift instead of relying on vllm.

@VietDunghacker VietDunghacker reopened this Jul 3, 2024
@tastelikefeet
Copy link
Collaborator

Thanks for you suggestion! We have added batch inference for pytorch native to our todo list. This requirement will be accomplished in one sprint

@tastelikefeet tastelikefeet added the enhancement New feature or request label Jul 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants