llm-fine-tuning

Star

Here are 33 public repositories matching this topic...

intel / neural-speed

Star

An innovative library for efficient LLM inference via low-bit quantization

Updated Aug 30, 2024
C++

pdaicode / awesome-LLMs-finetuning

Star

Collection of resources for finetuning Large Language Models (LLMs).

multimodal llm llm-finetuning llm-fine-tuning mmllm

Updated Jan 12, 2025

Datalore-ai / datalore-localgen-cli

Star

synthetic dataset generation workflow using local file resources for finetuning llms.

python openai synthetic-dataset-generation finetuning local-files llm-training llm-fine-tuning agentic-ai mistral-ocr

Updated Sep 16, 2025
Python

BY571 / DistRL-LLM

Star

Distributed Reinforcement Learning for LLM Fine-Tuning with multi-GPU utilization

reinforcement-learning pg r1 multi-gpu-training multi-gpu-inference llm llm-training llm-finetuning llm-fine-tuning grpo reinforcement-learning-fine-tuning

Updated Mar 12, 2025
Python

HewlettPackard / sustain-lc

Star

Sustain-LC is a benchmarking environment for traditional and reinforcement learning based controls as well as LLM based control

benchmarking benchmark control sustainability modelica scaling benchmark-framework servers benchmark-suite datacenter modelica-models modelpredictivecontrol multihead-selection liquid-cooling llm-inference llm-finetuning llm-fine-tuning reinforement-learning neurips-2025

Updated Aug 7, 2025
Jupyter Notebook

studentiz / ConversationsWithGod

Star

A sacred space for heartfelt conversations, where wisdom flows freely and memories gently fade like whispers at sunset.

journaling html-only self-reflection privacy-focused llm-fine-tuning

Updated Apr 15, 2025
HTML

xphot / app

Star

Análise Avançada de Dados com Causalidade e Aprendizado por Reforço

reinforcement-learning bug-tracker data-preprocessing experimental-psychology causal-machine-learning shap-analysis hypergraph-neural-network llms-reasoning llm-fine-tuning explainability-metric unsloth gguf-quantization

Updated Feb 27, 2025
Jupyter Notebook

ethicalabs-ai / FlowerTune-Qwen2.5-Coder-0.5B-Instruct

Sponsor

Star

FlowerTune LLM on Coding Dataset

machine-learning ai ml transformers federated-learning federated-learning-framework transformers-models llm-training llm-finetuning llm-fine-tuning qwen2 qwen2-5

Updated Feb 18, 2025
Python

Eden-Eldith / ChatInsights

Sponsor

Star

The Personal Knowledge Graph You Didn’t Know You Already Wrote

open-source obsidian pkm knowledge-management personal-knowledge-graph chatgpt llm-fine-tuning ai-history chatgpt-export cognitive-tools

Updated May 12, 2025
Python

BY571 / ARC-TTT

Star

ARC-Test-Time-Training (ARC-TTT)

arc fine-tuning test-time-augmentation abstract-reasoning test-time-adaptation test-time-training llm-training llm-finetuning llm-fine-tuning arc-agi

Updated Jan 15, 2025
Python

gdsmith1 / Replicant

Star

Clone your Discord friends with AI!

discord aws-s3 openai aws-ec2 trolling voice-cloning elevenlabs llm-fine-tuning

Updated May 27, 2025
Python

TDRoss / DNA-LLM

Star

Chaining thoughts and LLMs to learn DNA structural biophysics

dna llm-fine-tuning

Updated Mar 5, 2024
Python

Jatin-Mehra119 / Essay-Scoring-Modeling

Star

This repository contains all the notebooks, resources, and documentation used to develop and evaluate models for the Automated Essay Scoring (AES) Kaggle competition. The project aims to build an open-source solution for automated essay evaluation to support educators and provide timely feedback to students.

text-classification linear-regression lightgbm essay-scoring llm-fine-tuning

Updated Dec 30, 2024
Jupyter Notebook

sivakiran7 / Finetuning_LLM

Star

lora quantization quantization-aware-training qlora llm-fine-tuning

Updated Aug 19, 2025
Jupyter Notebook

ethicalabs-ai / FlowerTune-phi-4-NLP

Sponsor

Star

FlowerTune LLM on NLP Dataset

microsoft machine-learning ai ml federated-learning federated-learning-framework llm-training llm-finetuning llm-fine-tuning phi-4 microsoft-phi-4

Updated Jan 15, 2025
Python

hishamp3 / codeDetection

Star

Django implementation of CodeBERT for detecting vulnerable code.

django-framework html-css codebert large-language-models llm-fine-tuning

Updated Dec 29, 2023
Python

Zeldrizz / EmpathyLLM-Finetuning-RU-FEELIX

Star

llama llm-fine-tuning empathy-llm-fine-tunning

Updated Mar 22, 2025
Python

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

Star

The course teaches how to fine-tune LLMs using Group Relative Policy Optimization (GRPO)—a reinforcement learning method that improves model reasoning with minimal data. Learn RFT concepts, reward design, LLM-as-a-judge evaluation, and deploy jobs on the Predibase platform.

reinforcement-learning machine-learning-algorithms language-model reward-design rft ai-training deeplearning-ai-courses ai-optimization multi-step-reasoning ai-evaluation rlhf llm-fine-tuning opensource-ai llm-as-judge predibase grpo llm-development token-level-control

Updated Jun 13, 2025
Jupyter Notebook

ethicalabs-ai / FlowerTune-Qwen2.5-7B-Instruct-Medical

Sponsor

Star

FlowerTune LLM on Medical Dataset

machine-learning ai ml federated-learning federated-learning-framework llm-training llm-finetuning llm-fine-tuning qwen2-5

Updated Dec 3, 2024
Python

v-ade-r / LLM-Finetuning-using-LORA

Star

Schematic Blueprint for Finetuning LLM (e.g. Qwen or Llama) for text classification using LORA. Output model can have original or modified head (e.g. for SequenceClassification).

lora sequence-classification llm llm-fine-tuning

Updated Jan 20, 2025
Python

Improve this page

Add a description, image, and links to the llm-fine-tuning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-fine-tuning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-fine-tuning

Here are 33 public repositories matching this topic...

intel / neural-speed

pdaicode / awesome-LLMs-finetuning

Datalore-ai / datalore-localgen-cli

BY571 / DistRL-LLM

HewlettPackard / sustain-lc

studentiz / ConversationsWithGod

xphot / app

ethicalabs-ai / FlowerTune-Qwen2.5-Coder-0.5B-Instruct

Eden-Eldith / ChatInsights

BY571 / ARC-TTT

gdsmith1 / Replicant

TDRoss / DNA-LLM

Jatin-Mehra119 / Essay-Scoring-Modeling

sivakiran7 / Finetuning_LLM

ethicalabs-ai / FlowerTune-phi-4-NLP

hishamp3 / codeDetection

Zeldrizz / EmpathyLLM-Finetuning-RU-FEELIX

ksm26 / Reinforcement-Fine-Tuning-LLMs-with-GRPO

ethicalabs-ai / FlowerTune-Qwen2.5-7B-Instruct-Medical

v-ade-r / LLM-Finetuning-using-LORA

Improve this page

Add this topic to your repo