Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.

Updated May 31, 2026
Python

always-further / deepfabric

Star

Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline

python open-source data-science machine-learning ai open evaluation source dataset agents synthetic synthetic-data fine-tuning distillation huggingface huggingface-datasets unsloth

Updated Jun 15, 2026
Python

Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.

transformer lora fine-tuning sft dpo huggingface apple-silicon rlhf qlora unsloth grpo claude-code gaslamp

Updated May 7, 2026
Python

ToolBrain / ToolBrain

Star

A framework for agentic tool use training with reinforcement learning

reinforcement-learning fine-tuning dpo tool-use langchain llm-agents unsloth agentic-ai smolagents grpo

Updated Apr 8, 2026
Python

riyanshibohra / TuneKit

Star

Upload your data → Get a fine-tuned SLM. Free.

open-source machine-learning fine-tuning slms llm llms unsloth

Updated Jan 16, 2026
Python

GAD-cell / vlm-grpo

Star

An implementation of GRPO for Unsloth's VLMs training

reinforcement-learning vlm huggingface trl unsloth grpo grpotrainer

Updated Aug 7, 2025
Python

qqqqqf-q / MirrorFlow

Star

从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation

ai chatbot openai finetune open-ai digital-twin distill huggingface personal-ai llm ai-avatar qlora qwen finetune-llm unsloth 4o

Updated Mar 19, 2026
Python

JuliusBrussee / cavegemma

Sponsor

Star

LoRA fine-tune Gemma 4 31B to speak caveman-mode natively. Style: github.com/JuliusBrussee/caveman

style-transfer lora gemma caveman fine-tuning llm qlora unsloth

Updated May 17, 2026
Python

sinanuozdemir / oreilly-pytorch-dl

Star

Code for Deep Learning for Modern AI

deep-learning mnist neural-networks llama quantization clip bert diffusion distillation multimodal llms dreambooth unsloth llama3 dreambooth-finetuning

Updated Apr 23, 2026
Jupyter Notebook

Breeze648 / MedCoT-7B

Star

本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调，通过 QLoRA 量化和 Unsloth 加速训练，显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势，实现高效、准确且具有解释性的医学问答系统。

nlp ai lora medical-application distillation llm chain-of-thought qlora qwen unsloth deepseek-r1 4-bit-quantization

Updated Mar 10, 2025
Python

Cre4T3Tiv3 / unsloth-llama3-alpaca-lora

Sponsor

Star

Custom model training using modern architectures. 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs. Published adapter on HuggingFace. From training pipeline to deployed model.

open-source transformers colab lora gradio 4bit alpaca peft finetuning huggingface llm instruction-tuning qlora unsloth llama3

Updated Apr 2, 2026
Jupyter Notebook

pranavkumaarofficial / nlcli-wizard

Star

Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)

nlp machine-learning slm quantization gemma fine-tuning cli-tools local-first llm llama-cpp qlora unsloth

Updated Apr 10, 2026
Python

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

Sponsor

Star

Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋

Updated Jan 30, 2025
Jupyter Notebook

shlokchorge / Fine-Tuning-QWEN2vl

Star

Fine-tuned Qwen2-VL-7B for LaTeX OCR using LoRA and Unsloth on the LaTeX OCR dataset. Built augmentation pipeline (rotation, noise, contrast jitter), ran LoRA rank sweep (r=8/16/32), and evaluated across CER, Token F1, BLEU-4, and Exact Match. Deployed as a Gradio Space with live metric computation.

ocr latex computer-vision deep-learning transformers lora gradio fine-tuning qwen2 unsloth

Updated May 5, 2026
Jupyter Notebook

Yog-Sotho / LLM-fine-tuner

Star

Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes. Unsloth 2-5× acceleration · QLoRA/DPO/RLHF/PPO/ORPO · Reward Model training · GGUF export · vLLM inference · BLEU/ROUGE/BERTScore · full CLI · Heretic Mode to unlock full model potential

python ai gradio fine-tuning peft dpo llm llms local-llm local-ai ollama lm-studio gguf unsloth abliteration

Updated Jun 18, 2026
Python

Eviltr0N / Make-AI-Clone-of-Yourself

Star

Cloning Yourself using your whatsapp chat history and training a model on it.

ai whatsapp-bot finetuning ai-project whatsapp-python whatsapp-clone llm ollama llm-project unsloth llama3 llama3-rag llama3-finetune ai-clones llama3-8b

Updated Aug 14, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the unsloth topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the unsloth topic, visit your repo's landing page and select "manage topics."

Learn more

CS Knowledge Base

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

unsloth

Here are 383 public repositories matching this topic...

unslothai / unsloth

AlexsJones / llmfit

unslothai / notebooks

R6410418 / Jackrong-llm-finetuning-guide

ARahim3 / mlx-tune

always-further / deepfabric

TYH-labs / unsloth-buddy

ToolBrain / ToolBrain

riyanshibohra / TuneKit

GAD-cell / vlm-grpo

qqqqqf-q / MirrorFlow

JuliusBrussee / cavegemma

sinanuozdemir / oreilly-pytorch-dl

Breeze648 / MedCoT-7B

Cre4T3Tiv3 / unsloth-llama3-alpaca-lora

pranavkumaarofficial / nlcli-wizard

shaheennabi / Production-Ready-Instruction-Finetuning-of-Meta-Llama-3.2-3B-Instruct-Project

shlokchorge / Fine-Tuning-QWEN2vl

Yog-Sotho / LLM-fine-tuner

Eviltr0N / Make-AI-Clone-of-Yourself

Improve this page

Add this topic to your repo