The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup's Book
Weekly Kaitchup
Tutorials
Archive
About
Latest
Top
Discussions
vLLM vs Ollama: How They Differ and When To Use Them
With Examples of Offline and Online Inference
Jul 7
•
Benjamin Marie
23
Share this post
The Kaitchup – AI on a Budget
vLLM vs Ollama: How They Differ and When To Use Them
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #99
IFBench - ERNIE 4.5 - Gemma 3n
Jul 4
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #99
Copy link
Facebook
Email
Notes
More
June 2025
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Running Gemma 3n with vLLM and fine-tuning with TRL
Jun 30
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Gemma 3n: Fine-Tuning, Inference, and Submodel Extraction
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #98
Survey - Mistral 3.2 - And More News
Jun 27
•
Benjamin Marie
3
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #98
Copy link
Facebook
Email
Notes
More
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
The default hyperparameters are suboptimal for quantized models
Jun 23
•
Benjamin Marie
9
Share this post
The Kaitchup – AI on a Budget
Get the Best from GGUF Models: Optimize Your Inference Hyperparameters
Copy link
Facebook
Email
Notes
More
2
The Weekly Kaitchup #97
Survey - Axolotl + LLM Compressor
Jun 20
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #97
Copy link
Facebook
Email
Notes
More
2
RAG with Qwen3 Embedding and Qwen3 Reranker
How to use embedding and reranker models to efficiently retrieve only the most relevant chunks or documents given a user query
Jun 19
•
Benjamin Marie
19
Share this post
The Kaitchup – AI on a Budget
RAG with Qwen3 Embedding and Qwen3 Reranker
Copy link
Facebook
Email
Notes
More
4
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Faster, cheaper single-GPU training
Jun 16
•
Benjamin Marie
10
Share this post
The Kaitchup – AI on a Budget
RTX 6000 Pro vs H100 & A100: Best Single-GPU Choice for Fast, Low-Cost LLM Fine-Tuning
Copy link
Facebook
Email
Notes
More
1
The Weekly Kaitchup #96
Magistral - Text-to-LoRA
Jun 13
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #96
Copy link
Facebook
Email
Notes
More
1
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Code and best practices
Jun 9
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
Fine-Tuning 2-Bit Qwen3 Models on Your Computer
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #95
Qwen3 Embeddings/Reranker - Packing Improved - Unsloth's Notebooks - SGLang vs. vLLM
Jun 6
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #95
Copy link
Facebook
Email
Notes
More
Qwulu 3: Fine-Tuning Qwen3 Base with LoRA and TULU 3's Supervised Fine-Tuning Recipe
Can a supervised fine-tuning recipe that works effectively on Llama 3.1 be applied directly to Qwen3?
Jun 5
•
Benjamin Marie
6
Share this post
The Kaitchup – AI on a Budget
Qwulu 3: Fine-Tuning Qwen3 Base with LoRA and TULU 3's Supervised Fine-Tuning Recipe
Copy link
Facebook
Email
Notes
More
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts