Unsloth

Fine-tuning & trainingBreakout · Show HN

Signal summary

Category	Fine-tuning & training
Breakout	Show HN
Launched via	Show HN, Company blog, Founder X post
Owned	Brand X, Founder X, Company blog, Docs-as-SEO, Substack/newsletter, YouTube +2
Distribution	GitHub repo, PyPI, Hugging Face Hub, Colab notebooks
Integrations	Hugging Face ecosystem, Ollama, llama.cpp, TRL/PEFT/transformers
Amplifiers	AI-lab official account, Tech press, NVIDIA, Hugging Face, r/LocalLLaMA

Overview

Open-source LLM fine-tuning (and now local training/inference) library that hand-writes Triton kernels and a manual autograd engine to make finetuning 2-30x faster with up to 70-90% less VRAM, no accuracy loss. Founded by Australian brothers Daniel Han and Michael Han; YC S24; based in San Francisco. Current scale (June 2026): ~66.5K GitHub stars and ~6K forks on unslothai/unsloth; ~68,841 X followers on @UnslothAI (verified via Apify scrape, 2026-06-12); ~24,965 followers and ~1,346 models on their Hugging Face org (one of the most-followed orgs on HF); per their YC profile, 10M+ monthly model downloads.

First public appearance

Two-stage. The genuine first appearance was a soft, low-traction launch on Nov 30, 2023, the same day the X account and the company were created.

Earliest discoverable item: a self-post Show HN, Show HN: Unsloth: finetune Llama 2x faster 50% less memory on your GPU (HN objectID 38475955), submitted by user unsloth on 2023-11-30 17:08 UTC. It got 3 points, 0 comments (effectively flopped). A near-simultaneous submission by danielhanchen, Finetune language models 30x faster, 2023-11-30 15:38 UTC, also flopped at 2 points.

Format: Show HN / link post to the GitHub repo, paired with the unsloth.ai/introducing launch page.

KSP / hero copy they led with: the launch page headline was "Introducing Unsloth: 30x faster LLM training," with the concrete framing "30x faster. Alpaca takes 3 hours instead of 85," "60% less memory usage, allowing 6x larger batches," and "0% loss in accuracy." The open-source free tier was pitched as "2x faster finetuning with 50% less memory," with the 30x reserved for paid Pro/Max tiers. The selling point was a single dramatic speed/memory multiplier on a task (LLM finetuning) that was painful and GPU-poor for the hobbyist audience.

Original launch copy recovered via Wayback (snapshot 20231209081256; 38 captures, first 2023-11-30): confirms the live-page copy verbatim. Headline "Introducing Unsloth - 30x faster AI training," dated "Nov 30, 2023 - By Daniel Han." Highlights as published: "30x faster. Alpaca takes 3 hours instead of 85," "60% less memory usage, allowing 6x larger batches," "0% loss in accuracy or +20% increased accuracy with our Max offering," "No need for new hardware - only software changes," and "Free open source version makes finetuning 2x faster with 50% less memory." Details the live page no longer shows: a Discord invite was live at launch, the company branded itself "a Moonshot company" (moonshotai.org, Substack @moonshotai), it shipped Alpaca + SlimOrca Colab notebooks and a LAION Kaggle notebook on day one, and it closed with an explicit bootstrapping/investment ask ("We're trying to bootstrap our startup ... willing to chat investment - email us!"). The launch perf tables: Alpaca on one T4 cut 23h15m (HF) to 2h34m (Max), 8.8x; SlimOrca 391h to 51h, 7.6x; 2x T4 DDP on LAION Chip2 164h to 5h, 31x; memory on Open Assistant cut 16.7GB to 6.9GB on an A10 (59% less).

Note: the X account @UnslothAI was created 2023-11-30 (account ID 1730159888402395136), confirming the coordinated Nov 30 - Dec 1 launch window.

Launch sequence

2023-11-30
Soft launch. Show HN + "30x faster" link both flop (2-3 points). X account created same day. (HN objectIDs 38475955, 38474791.)
2023-12-01
The breakout. danielhanchen re-posts a reframed Show HN, Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning (objectID 38487199). This hit 385 points and 119 comments and reached the HN front page. Notable: the headline dropped the suspicious "30x" hero number (which had drawn skepticism) and led instead with the credible open-source numbers (80% faster / 50% less memory / 0% accuracy loss). Comments were heavily skeptical of the 30x paid claim and debated the open-core model; Daniel engaged extensively in the thread, explaining the Hyperlearn backstory (big companies used his prior open-source work uncredited) and the kernel-level optimizations. Two same-day reshares by others (Tomte, bratao) also appeared; the bratao repost separately drew 132 points.
2023-12-14 / 2023-12-16
Follow-on technical posts by Daniel: Reducing FLOPs for transformers and "CodeLlama-34B 13x faster finetuning" (both low traction on HN, 1-2 points). Build-in-public cadence of benchmark posts begins.
2024-01-10
Major earned-media moment: official Hugging Face blog post Make LLM Fine-tuning 2x faster with Unsloth and TRL, authored by the Unsloth team but published on HF's own blog and crediting deep compatibility with transformers/PEFT/TRL (SFTTrainer, DPOTrainer, PPOTrainer). Hero numbers: up to 2.7x faster, up to 74% memory reduction, 0% accuracy loss across 59 benchmark runs. This put Unsloth in front of the entire HF ecosystem audience.
2024-04 onward
"Day-of" model support cadence begins. Unsloth ships finetuning/GGUF support for new open models (Llama 3, etc.) on or near release day, capturing the search and social traffic of each model launch. Reflected in their Substack ("Llama 3 is now on Unsloth," "Unsloth December Update").
2024-05
Selected for the 2024 GitHub Accelerator (11 open-source AI projects). Institutional credibility + GitHub-channel distribution.
2024-06 → S24
Accepted into Y Combinator Summer 2024. Funding: a ~$500K seed (Sept 2024) with backers including GitHub Accelerator, Microsoft M12, Transpose Platform, Redpoint, Samsung NEXT (per Crunchbase/secondary sources; see Gaps).
2024-08-12
Milestone-marketing tweet, 2 million monthly downloads on Hugging Face, publicly celebrating traction and crediting the community + model teams.
2025-01-27
The second, larger inflection. Blog Run DeepSeek R1 Dynamic 1.58-bit + tweet Introducing 1.58bit DeepSeek-R1 GGUFs: they shrank the 671B DeepSeek-R1 from 720GB to 131GB (80% reduction) while keeping it functional (69.2% on their Flappy Bird benchmark vs 0% for naive quant). Timed to ride the global DeepSeek-R1 news wave. Credited llama.cpp, Bartowski's imatrix, Open WebUI, Ollama. Spawned heavy r/LocalLLaMA and press coverage (GIGAZINE, Medium guides).
2025-02 (around 2025-02-20)
Train your own R1 reasoning model locally with GRPO: made GRPO work with QLoRA/LoRA, reproducing R1-Zero's "aha moment" on 7GB (then 5GB) VRAM, 80% less GRPO memory. Rode the reasoning-model hype cycle.
2025-08
Fine-tune gpt-oss with Unsloth, day-of support for OpenAI's open-weight gpt-oss (14GB VRAM, free Colab); some of their fixes were pushed upstream into OpenAI's official HF model. Positions Unsloth as the default finetuning path for major new open models.
2026-02-28
Unsloth Dynamic 2.0 GGUFs hits HN front page (237 points, 68 comments).
2026-03-17
Unsloth Studio launch, a no-code local web UI for training/running/exporting models. HN story hit 388 points / 82 comments (top-voted Unsloth item ever); a companion Show HN by danielhanchen also posted same day. Marks the pivot from "library" to "product."
2026-05-07
Making LLM Training Faster with Unsloth and NVIDIA, an NVIDIA co-marketing collaboration (129 points on HN). Plus a PyTorch Ecosystem inclusion. Both are institutional-credibility plays.

Channels & accounts

GitHub: unslothai/unsloth, ~66.5K stars, ~6K forks, 200+ contributors. The primary owned asset and the main growth metric. They also run a fork unslothai/llama.cpp.
X / Twitter: @UnslothAI, ~68,841 followers (created 2023-11-30). Founder amp account @danielhanchen. Current per-tweet engagement is strong: June 2026 posts routinely hit 150K-210K views and 700-2,200 likes (e.g., Gemma 4 MTP post 2,159 likes / 212K views; DiffusionGemma post 1,723 likes / 162K views).
Hugging Face: huggingface.co/unsloth, ~24,965 followers, ~1,346 models. Individual GGUF/4-bit model repos pull huge download volume (e.g., a Qwen3.6 GGUF at ~800K downloads). HF is arguably their highest-leverage distribution channel: every model they re-upload is a discovery surface.
Website / blog: unsloth.ai and unsloth.ai/blog, plus extensive docs. Tutorial-style "how to run X locally" docs double as SEO landing pages.
Substack newsletter: unslothai.substack.com (model-launch and monthly-update posts).
Colab notebooks: free, copy-pasteable finetuning notebooks per model are a core distribution mechanism (referenced across blog/docs and conference talks).
YouTube / conference presence: Daniel Han gives talks (e.g., aiDotEngineer Advanced RL/Kernels/Reasoning workshop; "Can RL Lead to AGI?" talk).
Reddit: no official subreddit; presence is organic via r/LocalLLaMA (community-driven, not an owned channel). See Gaps.
Discord / LinkedIn: Daniel maintains LinkedIn; a community Discord is referenced but its size was not confirmed (see Gaps).

Amplification & KOLs

Hugging Face (lab/platform): earned, then institutional. HF published the Unsloth+TRL post on its own blog (2024-01-10), an outsized endorsement for an unknown project. Ongoing: HF org status and being among the most-followed orgs amplifies every model upload.
Open-model labs reposting / being reposted: Unsloth's day-of GGUF uploads create a symbiotic amplification loop with model teams. Observed in June 2026 scrape: Unsloth quote-tweets and is engaged by @MiniMax_AI, @googlegemma / Google DeepMind, when their models drop. Mix of organic and earned.
DeepSeek news wave (2025-01): not a single KOL, but the global DeepSeek-R1 moment was the carrier; r/LocalLLaMA, Medium writers, and tech press (GIGAZINE) amplified the 1.58-bit work organically because it solved a real "run R1 at home" problem the whole community wanted.
NVIDIA: co-marketing collaboration (2026-05), earned/partnership. PyTorch Ecosystem inclusion, earned/institutional.
GitHub / YC / Microsoft M12: accelerator and batch selection are earned third-party validation that fed distribution and credibility.; No evidence of paid influencer or paid-ads amplification was found. The pattern is consistently earned + organic, driven by genuinely useful releases.

Traction inflection

Primary inflection (the breakout from zero): the 2023-12-01 reframed Show HN, Show HN: 80% faster, 50% less memory, 0% loss of accuracy Llama finetuning, 385 points / 119 comments, HN front page. This is the single action that most plausibly converted a flopped Nov 30 launch into visible traction. Evidence: the Nov 30 launch with the "30x faster" hero number scored 2-3 points and died; literally the next day, the same founder re-posted the same product with a more credible, skepticism-proof headline (dropping 30x, leading with the verifiable open-source numbers) and it hit the front page with 385 points. The A/B is almost clean: same product, same week, only the framing changed. Confidence: high that this was the from-zero ignition, because the before/after is so tightly controlled and HN is a measurable surface. (Caveat: HN front page alone bought attention and early stars, not the eventual 60K+; it lit the fuse.)

Primary inflection (the breakout to scale / mainstream): the 2025-01-27 DeepSeek-R1 Dynamic 1.58-bit release. Evidence: the star-history curve shows minimal slope through most of 2024 and a clear steepening from late 2024 into 2025; the 1.58-bit work landed at the exact peak of global DeepSeek-R1 attention, solved the universally-wanted "run 671B R1 at home" problem, and generated a wave of r/LocalLLaMA threads, Medium tutorials, and press (GIGAZINE) plus their own viral tweet. The signal: trend-timing + a hero artifact (720GB → 131GB, with a benchmark proving it still works) riding the biggest open-model news event of the period. Confidence: medium-high. The star curve and the contemporaneous press/Reddit wave strongly support a 2025 step-change, and DeepSeek-R1 is the most plausible specific cause, and the DeepSeek-R1 launch tweet's engagement is now recovered: @UnslothAI "Introducing 1.58bit DeepSeek-R1 GGUFs" (2025-01-27 15:25 UTC) drew 687,401 views, 3,434 likes, 578 reposts, 158 quotes, 128 replies, and 1,841 bookmarks (scraped via Apify 2026-06-16), an order-of-magnitude spike over their baseline and strong corroboration that the DeepSeek-R1 work drove the scale inflection. The only remaining unknown is the precise per-week GitHub star delta.

Net read: two distinct inflections. HN (Dec 2023) took them from unknown to "known in the LLM-tinkerer niche"; DeepSeek-R1 1.58-bit (Jan 2025) took them from niche-popular to broadly known and is the more likely driver of the steep star ramp. The repeatable engine underneath both is "be first and best to make the hottest new open model runnable/trainable on consumer hardware, then publish a hero number."

Techniques & tactics

Hero-number headlines (Xx faster, Y% less memory, 0% accuracy loss) as the lead in every announcement.
Headline A/B / reframing on relaunch (dropped the skepticism-magnet "30x" for credible "80% / 50% / 0%" on HN day 2).
Show HN as the ignition channel; heavy founder participation in the comment thread (transparency, technical depth, backstory).
Open-core / open-source-as-distribution: free library drives adoption; the GitHub star count becomes the public scoreboard.
Build-in-public benchmark cadence (regular blog/X posts with reproducible benchmarks).
Trend-surfing / newsjacking: day-of support for each major open model (Llama 3, DeepSeek-R1, gpt-oss, Gemma, Qwen, MiniMax), capturing each launch's attention wave.
Free Colab notebooks as zero-friction onboarding and shareable artifacts.
Distribution via the platform that already has the audience: re-uploading optimized GGUF/4-bit models to Hugging Face, turning each into a discovery + download surface.
Hero-artifact engineering as marketing (the 1.58-bit DeepSeek-R1 quant; Dynamic 2.0 GGUFs) with a benchmark to prove it isn't broken.
Symbiotic amplification with model labs (quote-tweet loops with MiniMax, Google Gemma/DeepMind).
Earned media via the ecosystem incumbent (Hugging Face publishing their post; later NVIDIA co-marketing, PyTorch Ecosystem).
Credibility stacking: YC S24, GitHub Accelerator, Microsoft M12, NVIDIA, PyTorch.
Founder-as-technical-authority (Daniel Han's public bug-fixing of Gemma/Llama/Mistral/Phi, conference talks, NVIDIA/Hyperlearn pedigree).
Milestone-marketing tweets (2M downloads, etc.) to manufacture momentum narrative.
Docs-as-SEO ("How to run/finetune X locally" tutorial pages).
Product expansion to widen the funnel (Unsloth Studio, the no-code UI, 2026).