The Kaitchup – AI on a Budget

The Kaitchup – AI on a Budget

Home
Notes
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
Archive
About

Weekly Kaitchup

New DiffusionGemma and MoQ GGUFs for Gemma 4 12B and LFM2.5 8B A1B
The Weekly Kaitchup #146
Jun 12 • Benjamin Marie
MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
The Weekly Kaitchup #145
Jun 5 • Benjamin Marie
Qwen3.5 9B MoQ: Inside a Strong 3.6-bit GGUF
The Weekly Kaitchup #144
May 29 • Benjamin Marie
Gated DeltaNet-2: Better Memory Editing for Linear Attention
The Weekly Kaitchup #143
May 22 • Benjamin Marie
SlimQwen Compression, Elastic Models, and Aurora Optimization
The Weekly Kaitchup #142
May 15 • Benjamin Marie
MTP Layers for Gemma 4 and My Projects in Progress
The Weekly Kaitchup #141
May 8 • Benjamin Marie
Nemotron 3 Omni Explained: Architecture, Training, and How to Run It
The Weekly Kaitchup #140
May 1 • Benjamin Marie
Summary of Qwen3.6 GGUF Evals
The Weekly Kaitchup #139
Apr 24 • Benjamin Marie
DFlash for Qwen3.5, EAGLE for Gemma 4, and the MiniMax M2.7 License Debate
The Weekly Kaitchup #138
Apr 17 • Benjamin Marie
GLM 5.1 Is Here, MiniMax M2.7 and Qwen3.6 Are Coming Soon!
The Weekly Kaitchup #137
Apr 10 • Benjamin Marie
Gemma 4 31B and 26B A4B: Architecture and Memory Consumption
The Weekly Kaitchup #136
Apr 3 • Benjamin Marie
TurboQuant: Finally, Fast and Widely Available Low-Bit KV Cache Quantization?
The Weekly Kaitchup #135
Mar 27 • Benjamin Marie
© 2026 The Kaitchup · Privacy ∙ Terms ∙ Collection notice
Start your SubstackGet the app
Substack is the home for great culture