The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
Archive
About
The Weekly Kaitchup #100
The Kaitchup, Year 3
READ THE LATEST
The Kaitchup – AI on a Budget
Weekly tutorials, tips, and news on fine-tuning, running, and serving large language models on your computer. The Kaitchup also publishes two new AI notebooks every week.
Subscribe
Recent posts
View all
Better Packing for Fine-Tuning LLMs with the First Fit Decreasing (FFD) Strategy
No more over-segmentation, no more cross-contamination
Jul 10
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Better Packing for Fine-Tuning LLMs with the First Fit Decreasing (FFD) Strategy
Copy link
Facebook
Email
Notes
More
vLLM vs Ollama: How They Differ and When To Use Them
With Examples of Offline and Online Inference
Jul 7
•
Benjamin Marie
27
Share this post
The Kaitchup – AI on a Budget
vLLM vs Ollama: How They Differ and When To Use Them
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #99
IFBench - ERNIE 4.5 - Gemma 3n
Jul 4
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #99
Copy link
Facebook
Email
Notes
More
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Running Gemma 3n with vLLM and fine-tuning with TRL
Jun 30
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #98
Survey - Mistral 3.2 - And More News
Jun 27
•
Benjamin Marie
3
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #98
Copy link
Facebook
Email
Notes
More
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
The default hyperparameters are suboptimal for quantized models
Jun 23
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
Copy link
Facebook
Email
Notes
More
2
See all
Top posts
Multimodal RAG with ColPali and Qwen2-VL on Your Computer
Sep 16, 2024
•
Benjamin Marie
23
Share this post
The Kaitchup – AI on a Budget
Multimodal RAG with ColPali and Qwen2-VL on Your Computer
Copy link
Facebook
Email
Notes
More
11
RAG with Qwen3 Embedding and Qwen3 Reranker
Jun 19
•
Benjamin Marie
20
Share this post
The Kaitchup – AI on a Budget
RAG with Qwen3 Embedding and Qwen3 Reranker
Copy link
Facebook
Email
Notes
More
4
LoRA Adapters: When a Naive Merge Leads to Poor Performance
Sep 7, 2023
•
Benjamin Marie
15
Share this post
The Kaitchup – AI on a Budget
LoRA Adapters: When a Naive Merge Leads to Poor Performance
Copy link
Facebook
Email
Notes
More
46
GRPO: Train LLMs with DeepSeek-R1's Reinforcement Learning Method
Feb 10
•
Benjamin Marie
18
Share this post
The Kaitchup – AI on a Budget
GRPO: Train LLMs with DeepSeek-R1's Reinforcement Learning Method
Copy link
Facebook
Email
Notes
More
5
Combine Multiple LoRA Adapters for Llama 2
Nov 27, 2023
•
Benjamin Marie
16
Share this post
The Kaitchup – AI on a Budget
Combine Multiple LoRA Adapters for Llama 2
Copy link
Facebook
Email
Notes
More
7
Recommendations
View all 10
AI Horizon Forecast
Nikos Kafritsas
💎DiamantAI
Nir Diamant
AI Tidbits
Sahar Mor
Artificial Ignorance
Charlie Guo
Why Try AI
Daniel Nest
150+ AI Notebooks Now + 2 Each Week:
Subscribe
Tutorials
View all
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Running Gemma 3n with vLLM and fine-tuning with TRL
Jun 30
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Copy link
Facebook
Email
Notes
More
RAG with Qwen3 Embedding and Qwen3 Reranker
How to use embedding and reranker models to efficiently retrieve only the most relevant chunks or documents given a user query
Jun 19
•
Benjamin Marie
20
Share this post
The Kaitchup – AI on a Budget
RAG with Qwen3 Embedding and Qwen3 Reranker
Copy link
Facebook
Email
Notes
More
4
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Faster, cheaper single-GPU training
Jun 16
•
Benjamin Marie
10
Share this post
The Kaitchup – AI on a Budget
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Copy link
Facebook
Email
Notes
More
1
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Code and best practices
Jun 9
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Copy link
Facebook
Email
Notes
More
Qwulu 3: Fine-Tuning Qwen3 Base with LoRA and TULU 3's Supervised Fine-Tuning Recipe
Can a supervised fine-tuning recipe that works effectively on Llama 3.1 be applied directly to Qwen3?
Jun 5
•
Benjamin Marie
6
Share this post
The Kaitchup – AI on a Budget
Qwulu 3: Fine-Tuning Qwen3 Base with LoRA and TULU 3's Supervised Fine-Tuning Recipe
Copy link
Facebook
Email
Notes
More
Share this publication
kaitchup
The Kaitchup – AI on a Budget
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts