The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup Pro
The Kaitchup's Book
AI Toolboxes
Tutorials
Models
LLM Leaderboards
Archive
About
Latest
Top
Discussions
2-bit VPTQ: 6.5x Smaller LLMs, >95% Accuracy
Very accurate 2-bit quantization for running 70B LLMs on a 24 GB GPU
Jan 27
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
2-bit VPTQ: 6.5x Smaller LLMs, >95% Accuracy
Copy link
Facebook
Email
Notes
More
2
The Weekly Kaitchup #76
GRPO/TRL - SmolVLM - Coconut
Jan 24
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #76
Copy link
Facebook
Email
Notes
More
DeepSeek-R1: Reinforcement Learning with Simple and Verifiable Rewards
Qwen2.5 and Llama 3.x are good students
Jan 22
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
DeepSeek-R1: Reinforcement Learning with Simple and Verifiable Rewards
Copy link
Facebook
Email
Notes
More
1
Estimating Memory Usage for LLMs During Inference (V2)
KV cache, GQA, FlashAttention, activations, batching...
Jan 20
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Estimating Memory Usage for LLMs During Inference (V2)
Copy link
Facebook
Email
Notes
More
6
The Weekly Kaitchup #75
MiniMax-01 - Qwen2.5 PRM - Kokoro and OuteTTS
Jan 17
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #75
Copy link
Facebook
Email
Notes
More
MMLU: Do LLMs Really Know?
When 1. = 1. = A. = A. = a. = A) disrupts LLM's world knowledge and language understanding capabilities
Jan 16
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
MMLU: Do LLMs Really Know?
Copy link
Facebook
Email
Notes
More
3
Local Agentic AI with smolagents and Qwen2.5 Coder
When to use it and when does it fail
Jan 13
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Local Agentic AI with smolagents and Qwen2.5 Coder
Copy link
Facebook
Email
Notes
More
3
The Weekly Kaitchup #74
Open Phi-4 - OLMo 2 Paper
Jan 10
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #74
Copy link
Facebook
Email
Notes
More
RTX 50 and DIGITS: What Does It Mean for Local AI?
Fine-tuning a 2-bit Llama 4 70B with a single consumer GPU
Jan 8
•
Benjamin Marie
8
Share this post
The Kaitchup – AI on a Budget
RTX 50 and DIGITS: What Does It Mean for Local AI?
Copy link
Facebook
Email
Notes
More
1
DeepSeek-V3: Understanding and Running the Best Open LLM Locally
A huge but efficient MoE
Jan 6
•
Benjamin Marie
13
Share this post
The Kaitchup – AI on a Budget
DeepSeek-V3: Understanding and Running the Best Open LLM Locally
Copy link
Facebook
Email
Notes
More
1
The Weekly Kaitchup #73
2025 - Smolagents - Bamba
Jan 3
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #73
Copy link
Facebook
Email
Notes
More
1
Reasoning with QwQ and QvQ on Your Computer
When "preview" becomes meaningful
Jan 2
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Reasoning with QwQ and QvQ on Your Computer
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts