unsloth
Here are 383 public repositories matching this topic...
Unsloth Studio is a web UI for training and running open models like Gemma 4, Qwen3.6, DeepSeek, gpt-oss locally.
-
Updated
Jun 18, 2026 - Python
250+ Fine-tuning & RL Notebooks for text, vision, audio, embedding, TTS models.
-
Updated
Jun 17, 2026 - Jupyter Notebook
Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.
-
Updated
May 31, 2026 - Python
Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline
-
Updated
Jun 15, 2026 - Python
Zero-friction LLM fine-tuning skill for Claude Code, Gemini CLI & any ACP agent. Unsloth on NVIDIA · TRL+MPS/MLX on Apple Silicon. Automates env setup, LoRA training (SFT, DPO, GRPO, vision), post-hoc GRPO log diagnostics, evaluation, and export end-to-end. Part of the Gaslamp AI platform.
-
Updated
May 7, 2026 - Python
A framework for agentic tool use training with reinforcement learning
-
Updated
Apr 8, 2026 - Python
Upload your data → Get a fine-tuned SLM. Free.
-
Updated
Jan 16, 2026 - Python
An implementation of GRPO for Unsloth's VLMs training
-
Updated
Aug 7, 2025 - Python
从对话数据到训练:数字分身 + 模型蒸馏 From Dialogue Data to Training Closed-Loop: Digital Twin + Model Distillation
-
Updated
Mar 19, 2026 - Python
LoRA fine-tune Gemma 4 31B to speak caveman-mode natively. Style: github.com/JuliusBrussee/caveman
-
Updated
May 17, 2026 - Python
Code for Deep Learning for Modern AI
-
Updated
Apr 23, 2026 - Jupyter Notebook
本项目利用医学领域的 CoT 数据对 Deepseek-R1-Distill-Qwen-7B 进行微调,通过 QLoRA 量化和 Unsloth 加速训练,显著提升模型在复杂医学推理任务中的慢思考能力。知识蒸馏技术使轻量级模型获得大模型的推理优势,实现高效、准确且具有解释性的医学问答系统。
-
Updated
Mar 10, 2025 - Python
Custom model training using modern architectures. 4-bit QLoRA fine-tuning pipeline for LLaMA 3 8B with production-grade optimization. Memory-efficient training on consumer GPUs. Published adapter on HuggingFace. From training pipeline to deployed model.
-
Updated
Apr 2, 2026 - Jupyter Notebook
Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)
-
Updated
Apr 10, 2026 - Python
Instruction Fine-Tuning of Meta Llama 3.2-3B Instruct on Kannada Conversations. Tailoring the model to follow specific instructions in Kannada, enhancing its ability to generate relevant, context-aware responses based on conversational inputs. Using the Kannada Instruct dataset for fine-tuning! Happy Finetuning 🎋
-
Updated
Jan 30, 2025 - Jupyter Notebook
Fine-tuned Qwen2-VL-7B for LaTeX OCR using LoRA and Unsloth on the LaTeX OCR dataset. Built augmentation pipeline (rotation, noise, contrast jitter), ran LoRA rank sweep (r=8/16/32), and evaluated across CER, Token F1, BLEU-4, and Exact Match. Deployed as a Gradio Space with live metric computation.
-
Updated
May 5, 2026 - Jupyter Notebook
Powerful no-code LLM fine-tuner: upload data → train → deploy in minutes. Unsloth 2-5× acceleration · QLoRA/DPO/RLHF/PPO/ORPO · Reward Model training · GGUF export · vLLM inference · BLEU/ROUGE/BERTScore · full CLI · Heretic Mode to unlock full model potential
-
Updated
Jun 18, 2026 - Python
Cloning Yourself using your whatsapp chat history and training a model on it.
-
Updated
Aug 14, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to the unsloth topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the unsloth topic, visit your repo's landing page and select "manage topics."