The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
Archive
About
Weekly Kaitchup
Latest
Top
Discussions
New DiffusionGemma and MoQ GGUFs for Gemma 4 12B and LFM2.5 8B A1B
The Weekly Kaitchup #146
Jun 12
•
Benjamin Marie
8
1
1
MoQ GGUFs and GSQ: Low-Bit GGUFs Are About to Get Much Better
The Weekly Kaitchup #145
Jun 5
•
Benjamin Marie
8
Qwen3.5 9B MoQ: Inside a Strong 3.6-bit GGUF
The Weekly Kaitchup #144
May 29
•
Benjamin Marie
12
Gated DeltaNet-2: Better Memory Editing for Linear Attention
The Weekly Kaitchup #143
May 22
•
Benjamin Marie
4
SlimQwen Compression, Elastic Models, and Aurora Optimization
The Weekly Kaitchup #142
May 15
•
Benjamin Marie
9
1
MTP Layers for Gemma 4 and My Projects in Progress
The Weekly Kaitchup #141
May 8
•
Benjamin Marie
10
7
1
Nemotron 3 Omni Explained: Architecture, Training, and How to Run It
The Weekly Kaitchup #140
May 1
•
Benjamin Marie
7
2
Summary of Qwen3.6 GGUF Evals
The Weekly Kaitchup #139
Apr 24
•
Benjamin Marie
20
1
1
DFlash for Qwen3.5, EAGLE for Gemma 4, and the MiniMax M2.7 License Debate
The Weekly Kaitchup #138
Apr 17
•
Benjamin Marie
11
3
GLM 5.1 Is Here, MiniMax M2.7 and Qwen3.6 Are Coming Soon!
The Weekly Kaitchup #137
Apr 10
•
Benjamin Marie
8
4
Gemma 4 31B and 26B A4B: Architecture and Memory Consumption
The Weekly Kaitchup #136
Apr 3
•
Benjamin Marie
11
1
TurboQuant: Finally, Fast and Widely Available Low-Bit KV Cache Quantization?
The Weekly Kaitchup #135
Mar 27
•
Benjamin Marie
10
7
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts