SqueezeAILab
SqueezeAI is part of Berkeley AI Research Lab at UC Berkeley focused on AI Systems research.
Popular repositories Loading
-
LLMCompiler
LLMCompiler Public[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
-
-
SqueezedAttention
SqueezedAttention PublicSQUEEZED ATTENTION: Accelerating Long Prompt LLM Inference
Repositories
Showing 10 of 13 repositories
- reward-under-attack Public
SqueezeAILab/reward-under-attack’s past year of commit activity - QuantSpec Public
SqueezeAILab/QuantSpec’s past year of commit activity - KVQuant Public
[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization
SqueezeAILab/KVQuant’s past year of commit activity
Top languages
PythonMost used topics
Loading…