The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
Chat
Start Here
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
The Kaitchup Index
Archive
About
Weekly Kaitchup
Latest
Top
Discussions
This Week: GLM 4.7 Flash's Huge KV Cache and LFM2.5 Thinking
The Weekly Kaitchup #127
Jan 23
•
Benjamin Marie
8
MMLU-Pro Has an Answer Leak (and It’s Just Whitespace)
The Weekly Kaitchup #126
Jan 16
•
Benjamin Marie
6
LFM2.5 and Falcon H1R-7B: New Hybrid Models with Strong Benchmark Scores
The Weekly Kaitchup #125
Jan 9
•
Benjamin Marie
4
3
2
2026 Predictions: Much Faster Inference, Pre-Training with RL, and FP4 Everywhere
The Weekly Kaitchup #124
Jan 2
•
Benjamin Marie
12
Encoder–Decoders and Byte LLMs: T5Gemma 2 and AI2’s New Models
The Weekly Kaitchup #123
Dec 19, 2025
•
Benjamin Marie
8
2
2
Notes on RNJ-1, K2-V2, Devstral 2, and GLM-4.6V
The Weekly Kaitchup #122
Dec 12, 2025
•
Benjamin Marie
1
Mistral Large 3: Not a Reasoning Model
The Weekly Kaitchup #121
Dec 6, 2025
•
Benjamin Marie
5
Scaling RL and Self-Verifiable Reasoning: INTELLECT-3 and DeepSeekMath-V2
The Weekly Kaitchup #120
Nov 28, 2025
•
Benjamin Marie
3
Olmo 3 Is Here!
The Weekly Kaitchup #119
Nov 21, 2025
•
Benjamin Marie
7
5
The Limits of GRPO-like Methods for Reinforcement Learning
The Weekly Kaitchup #118
Nov 14, 2025
•
Benjamin Marie
5
BF16 vs FP16 for Reinforcement Learning: Where Are We?
The Weekly Kaitchup #117
Nov 7, 2025
•
Benjamin Marie
4
MiniMax M2 and Kimi-Linear: Why Full Attention Still Wins
The Weekly Kaitchup #116
Oct 31, 2025
•
Benjamin Marie
4
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts