🐑
i++
USTC | ADSL | MLSYS | MLPerf | LLM
- Shanghai, China
-
18:09
(UTC +08:00)
Pinned Loading
-
vllm
vllm PublicForked from vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Python
-
DeepSpeed
DeepSpeed PublicForked from deepspeedai/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
Python
-
microsoft/pai
microsoft/pai Public archiveResource scheduling and cluster management for AI
-
DeepSpeed-MII
DeepSpeed-MII PublicForked from deepspeedai/DeepSpeed-MII
MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.