Large Model High-Concurrency Deployment Investigate and Discuss #12113
xueshuai0922
announced in
General
Replies: 1 comment 1 reply
-
anyone have a idea which will solve this problem? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Large Model High-Concurrency Deployment Investigate and Discuss
Overview
Prerequisites:
Key Points:
VRAM Requirement Analysis:
Hardware Configuration Recommendations:
Deployment Solutions:
Comparison of Inference Tools:
LMDeploy vs vLLM:
Beta Was this translation helpful? Give feedback.
All reactions