Added BitsAndBytes Support #61

timBoML · 2024-11-15T07:20:24Z

Hello Byaldi Team!

Description

I added BitsAndBytes support for all us GPU-poor people. This enables 4-bit/8-bit quantization to run the models on smaller GPUs or, in my case, leave space for a bigger LLM.

Changes Made

Added BitsAndBytes quantization options to model loading
Updated dependencies to include bitsandbytes
Added quant_strategy in the example notebook

Testing

I am using Byaldi in a commercial setting and the 4-bit quantization didn't affect performance.

timBoML · 2024-11-15T07:36:14Z

I found a typo

Tim Harmling added 2 commits November 15, 2024 07:13

added bitsandbytes support

84934fb

fix verbose typo

3b962fb

timBoML closed this Nov 15, 2024

timBoML reopened this Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added BitsAndBytes Support #61

Added BitsAndBytes Support #61

Uh oh!

timBoML commented Nov 15, 2024

Uh oh!

timBoML commented Nov 15, 2024

Uh oh!

Uh oh!

Added BitsAndBytes Support #61

Are you sure you want to change the base?

Added BitsAndBytes Support #61

Uh oh!

Conversation

timBoML commented Nov 15, 2024

Description

Changes Made

Testing

Uh oh!

timBoML commented Nov 15, 2024

Uh oh!

Uh oh!