[WIP] Support for more gguf format and float zp for Q*_1 #560

n1ck-guo · 2025-05-14T01:20:42Z

No description provided.

Signed-off-by: n1ck-guo <[email protected]>

for more information, see https://pre-commit.ci

wenhuach21 · 2025-05-14T01:26:06Z

auto_round/autoround.py

@@ -162,6 +162,7 @@ def __init__(
            device_map: Union[str, dict] = None,
            super_bits: int = None,
            super_group_size: int = None,
+            float_zp: bool = False,


When the user only exports to GGUF using the q*1 format, switch to float zero points directly (add a new datatype), no need to add additional arguments to the API and wrapperlinear. If gptq/awq are also exported, using int zp and add a warning that this may affect the accuracy of gguf

Signed-off-by: n1ck-guo <[email protected]>

n1ck-guo and others added 3 commits May 13, 2025 21:19

support for float zp and q5_0/1

6fd5a4a

Signed-off-by: n1ck-guo <[email protected]>

Merge branch 'main' into hengguo/gguf_extend

9027e94

[pre-commit.ci] auto fixes from pre-commit.com hooks

6709896

for more information, see https://pre-commit.ci

wenhuach21 reviewed May 14, 2025

View reviewed changes

n1ck-guo added 2 commits May 14, 2025 03:05

support for q5_k_s

489b557

Signed-off-by: n1ck-guo <[email protected]>

support for q3_k

48669db

Signed-off-by: n1ck-guo <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Support for more gguf format and float zp for Q*_1 #560

[WIP] Support for more gguf format and float zp for Q*_1 #560

n1ck-guo commented May 14, 2025

wenhuach21 May 14, 2025 •

edited

Loading

[WIP] Support for more gguf format and float zp for Q*_1 #560

Are you sure you want to change the base?

[WIP] Support for more gguf format and float zp for Q*_1 #560

Conversation

n1ck-guo commented May 14, 2025

wenhuach21 May 14, 2025 • edited Loading

Choose a reason for hiding this comment

wenhuach21 May 14, 2025 •

edited

Loading