Skip to content

Commit d3f4127

Browse files
fix for quality check and update dep ver suggestions
Signed-off-by: cliu-us <[email protected]>
1 parent 5bafb22 commit d3f4127

File tree

2 files changed

+1
-3
lines changed

2 files changed

+1
-3
lines changed

examples/quantization_w8a8_fp8/README_granite4.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,7 @@ pip install llmcompressor
2424
```
2525

2626
This checkpoint format will need the latest vllm (ver >= 0.10.1.1) to run correctly. Additional dependencies and environment variables needed are:
27-
1. Dependencies: `vllm=0.10.1.1, lm_eval=0.4.9.1, flash-attn=2.7.3, torch=2.7.1`
27+
1. Dependencies: `vllm>=0.10.1.1, lm_eval>=0.4.9.1, flash-attn=2.7.3, torch>=2.7.1`
2828
2. ENV VAR: `VLLM_USE_V1=0, VLLM_WORKER_MULTIPROC_METHOD=spawn`
2929

3030
## Quickstart

src/llmcompressor/modeling/granite4.py

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,4 @@
11
import torch
2-
from compressed_tensors.quantization import QuantizationStatus
3-
from compressed_tensors.utils import register_offload_parameter
42
from transformers.models.granitemoehybrid.modeling_granitemoehybrid import (
53
GraniteMoeHybridParallelExperts,
64
)

0 commit comments

Comments
 (0)