The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
Start Here
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
The Kaitchup Index
Archive
About
Weekly Kaitchup
Latest
Top
Discussions
2026 Predictions: Much Faster Inference, Pre-Training with RL, and FP4 Everywhere
The Weekly Kaitchup #124
Jan 2
•
Benjamin Marie
8
Encoder–Decoders and Byte LLMs: T5Gemma 2 and AI2’s New Models
The Weekly Kaitchup #123
Dec 19, 2025
•
Benjamin Marie
7
2
2
Notes on RNJ-1, K2-V2, Devstral 2, and GLM-4.6V
The Weekly Kaitchup #122
Dec 12, 2025
•
Benjamin Marie
1
Mistral Large 3: Not a Reasoning Model
The Weekly Kaitchup #121
Dec 6, 2025
•
Benjamin Marie
5
Scaling RL and Self-Verifiable Reasoning: INTELLECT-3 and DeepSeekMath-V2
The Weekly Kaitchup #120
Nov 28, 2025
•
Benjamin Marie
3
Olmo 3 Is Here!
The Weekly Kaitchup #119
Nov 21, 2025
•
Benjamin Marie
7
5
The Limits of GRPO-like Methods for Reinforcement Learning
The Weekly Kaitchup #118
Nov 14, 2025
•
Benjamin Marie
5
BF16 vs FP16 for Reinforcement Learning: Where Are We?
The Weekly Kaitchup #117
Nov 7, 2025
•
Benjamin Marie
4
MiniMax M2 and Kimi-Linear: Why Full Attention Still Wins
The Weekly Kaitchup #116
Oct 31, 2025
•
Benjamin Marie
4
The Weekly Kaitchup #115
Hi Everyone,
Oct 24, 2025
•
Benjamin Marie
5
1
DGX Spark: Use It for Fine-Tuning
The Weekly Kaitchup #114
Oct 17, 2025
•
Benjamin Marie
9
1
3
Tiny Recursive Models for Very Specific Problems
The Weekly Kaitchup #113
Oct 11, 2025
•
Benjamin Marie
8
1
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts