The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
Archive
About
Latest
Top
Discussions
The Weekly Kaitchup #100
The Kaitchup, Year 3
Jul 11
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #100
Copy link
Facebook
Email
Notes
More
Better Packing for Fine-Tuning LLMs with the First Fit Decreasing (FFD) Strategy
No more over-segmentation, no more cross-contamination
Jul 10
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Better Packing for Fine-Tuning LLMs with the First Fit Decreasing (FFD) Strategy
Copy link
Facebook
Email
Notes
More
vLLM vs Ollama: How They Differ and When To Use Them
With Examples of Offline and Online Inference
Jul 7
•
Benjamin Marie
27
Share this post
The Kaitchup – AI on a Budget
vLLM vs Ollama: How They Differ and When To Use Them
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #99
IFBench - ERNIE 4.5 - Gemma 3n
Jul 4
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #99
Copy link
Facebook
Email
Notes
More
June 2025
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Running Gemma 3n with vLLM and fine-tuning with TRL
Jun 30
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #98
Survey - Mistral 3.2 - And More News
Jun 27
•
Benjamin Marie
3
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #98
Copy link
Facebook
Email
Notes
More
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
The default hyperparameters are suboptimal for quantized models
Jun 23
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
Copy link
Facebook
Email
Notes
More
2
The Weekly Kaitchup #97
Survey - Axolotl + LLM Compressor
Jun 20
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #97
Copy link
Facebook
Email
Notes
More
2
RAG with Qwen3 Embedding and Qwen3 Reranker
How to use embedding and reranker models to efficiently retrieve only the most relevant chunks or documents given a user query
Jun 19
•
Benjamin Marie
20
Share this post
The Kaitchup – AI on a Budget
RAG with Qwen3 Embedding and Qwen3 Reranker
Copy link
Facebook
Email
Notes
More
4
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Faster, cheaper single-GPU training
Jun 16
•
Benjamin Marie
10
Share this post
The Kaitchup – AI on a Budget
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Copy link
Facebook
Email
Notes
More
1
The Weekly Kaitchup #96
Magistral - Text-to-LoRA
Jun 13
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #96
Copy link
Facebook
Email
Notes
More
1
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Code and best practices
Jun 9
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts