Releases · LLukas22/llm-rs-python

19 Aug 14:24

LLukas22

0.2.15

3bc82ba

Custom RoPE support & Small Langchain bugfixes Latest

Latest

Adds the ability to extend the context length of models via the RoPE_scaling parameter.

Assets 4

19 Jul 15:30

LLukas22

0.2.14

547efaa

Better HuggingfaceHub Integration

Simplified the interaction with other GGML based repos. Like TheBloke/Llama-2-7B-GGML created by TheBloke.

Assets 4

17 Jul 15:33

LLukas22

0.2.13

b5eaae5

Stable GPU Support

Fixed many gpu acceleration bugs in rustformers\llm and improved performance to match native ggml.

Assets 4

27 Jun 13:31

LLukas22

0.2.12

3eef549

Experimental GPU support

Adds support for Metal/CUDA and OpenCL acceleration for LLama-based models.

Adds CI for the different acceleration backends to create prebuild binaries

Assets 4

21 Jun 12:09

LLukas22

0.2.11

e122489

Added 🌾🔱 Haystack Support + BigCode-Models

Added support for the haystack library
Support "BigCode" like models (e.g. WizardCoder) via the gpt2 architecture

Assets 2

06 Jun 14:45

LLukas22

0.2.10

e2a6d45

Added 🦜️🔗 LangChain support

Merge pull request #21 from LLukas22/feat/langchain

Add LangChain support

Assets 2

04 Jun 14:25

LLukas22

0.2.9

e2925c4

Added Huggingface Tokenizer Support

AutoModel compatible models will now use the official tokenizers library, which improves the decoding accuracy, especially for all non llama based models.

If you want to specify a tokenizer manually, it can be set via the tokenizer_path_or_repo_id parameter. If you want to use the default GGML tokenizer the huggingface support can be disabled via use_hf_tokenizer.

Assets 2

29 May 10:03

LLukas22

0.2.8

fb75d58

Fixed GPT-J quantization

0.2.8

GPT-J quantization bugfix

Assets 2

28 May 08:27

LLukas22

0.2.7

f893129

Added other quantization formats

Added support for q5_0,q5_1 and q8_0 formats.

Assets 2

27 May 15:31

LLukas22

0.2.6

c7e3efc

Streaming support

Added the stream method to each model, which returns a generator that can be consumed to generate a response.

Assets 2

Releases: LLukas22/llm-rs-python

Custom RoPE support & Small Langchain bugfixes

Uh oh!

Better HuggingfaceHub Integration

Uh oh!

Stable GPU Support

Uh oh!

Experimental GPU support

Uh oh!

Added 🌾🔱 Haystack Support + BigCode-Models

Uh oh!

Added 🦜️🔗 LangChain support

Uh oh!

Added Huggingface Tokenizer Support

Uh oh!

Fixed GPT-J quantization

Uh oh!

Added other quantization formats

Uh oh!

Streaming support

Uh oh!