The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup Pro
The Kaitchup's Book
AI Toolboxes
Tutorials
Models
LLM Leaderboards
Archive
About
Tutorials
Latest
Top
Discussions
Local Agentic AI with smolagents and Qwen2.5 Coder
When to use it and when does it fail
3 hrs ago
•
Benjamin Marie
Share this post
The Kaitchup – AI on a Budget
Local Agentic AI with smolagents and Qwen2.5 Coder
Copy link
Facebook
Email
Notes
More
Deploy Your Fine-Tuned LoRA Adapters with Ollama
Probably the easiest way to run adapters offline and online
Dec 30, 2024
•
Benjamin Marie
8
Share this post
The Kaitchup – AI on a Budget
Deploy Your Fine-Tuned LoRA Adapters with Ollama
Copy link
Facebook
Email
Notes
More
2
Fast and Memory-Efficient Text-to-SQL with Qwen2.5 Coder 32B Instruct on Your GPU
Quantization and prompting with vLLM
Dec 23, 2024
•
Benjamin Marie
12
Share this post
The Kaitchup – AI on a Budget
Fast and Memory-Efficient Text-to-SQL with Qwen2.5 Coder 32B Instruct on Your GPU
Copy link
Facebook
Email
Notes
More
2
Schedule-Free Optimizer: Does It Work for LLMs?
Experiments with Llama 3.2: schedule-free vs. standard AdamW
Dec 16, 2024
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Schedule-Free Optimizer: Does It Work for LLMs?
Copy link
Facebook
Email
Notes
More
Fine-Tuning Llama 3.3 70B with a Single GPU
And how to fix a poorly accurate 2-bit model
Dec 12, 2024
•
Benjamin Marie
13
Share this post
The Kaitchup – AI on a Budget
Fine-Tuning Llama 3.3 70B with a Single GPU
Copy link
Facebook
Email
Notes
More
Quantize and Run Llama 3.3 70B Instruct on Your GPU
4-bit👍, 3-bit👎, and 2-bit👎quantization
Dec 9, 2024
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
Quantize and Run Llama 3.3 70B Instruct on Your GPU
Copy link
Facebook
Email
Notes
More
1
LLM Alignment: Searching for Optimal ORPO Hyperparameters
Higher learning rate and beta
Dec 2, 2024
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
LLM Alignment: Searching for Optimal ORPO Hyperparameters
Copy link
Facebook
Email
Notes
More
2
The Recipe for Extremely Accurate and Cheap Quantization of 70B+ LLMs
Cost and accuracy for quantizing large models to 4-bit and 2-bit
Nov 25, 2024
•
Benjamin Marie
10
Share this post
The Kaitchup – AI on a Budget
The Recipe for Extremely Accurate and Cheap Quantization of 70B+ LLMs
Copy link
Facebook
Email
Notes
More
3
DPO Full Training vs. LoRA: How Good is LoRA for DPO Training?
One model, two adapters
Nov 18, 2024
•
Benjamin Marie
8
Share this post
The Kaitchup – AI on a Budget
DPO Full Training vs. LoRA: How Good is LoRA for DPO Training?
Copy link
Facebook
Email
Notes
More
Torch Compile: 2x Faster Llama 3.2 with Low Effort
But it will depend on your GPU
Nov 11, 2024
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
Torch Compile: 2x Faster Llama 3.2 with Low Effort
Copy link
Facebook
Email
Notes
More
8
LLM as a Judge: Evaluate Your LLMs with Another LLM
A good evaluation framework for quick feedback and monitoring
Nov 7, 2024
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
LLM as a Judge: Evaluate Your LLMs with Another LLM
Copy link
Facebook
Email
Notes
More
Llama 3.2 Embeddings: Training and Evaluation with LLM2Vec
A step-by-step tutorial
Nov 4, 2024
•
Benjamin Marie
11
Share this post
The Kaitchup – AI on a Budget
Llama 3.2 Embeddings: Training and Evaluation with LLM2Vec
Copy link
Facebook
Email
Notes
More
2
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts