The Kaitchup – AI on a Budget
Subscribe
Sign in
Home
Notes
AI Notebooks
The Kaitchup Pro
The Kaitchup's Book
AI Toolboxes
Tutorials
Models
Archive
About
Latest
Top
Discussions
Phi-4 Multimodal: A Mixture of Audio and Vision LoRA Adapters
A multimodal mixture of LoRA
23 hrs ago
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
Phi-4 Multimodal: A Mixture of Audio and Vision LoRA Adapters
Copy link
Facebook
Email
Notes
More
February 2025
The Weekly Kaitchup #81
DeepSeek’s #OpenSourceWeek - olmOCR - Phi-4 Mini
Feb 28
•
Benjamin Marie
3
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #81
Copy link
Facebook
Email
Notes
More
Don’t Let QLoRA Merging Undo Your Fine-Tuning Work
Revisiting an old recipe with today's frameworks.
Feb 24
•
Benjamin Marie
8
Share this post
The Kaitchup – AI on a Budget
Don’t Let QLoRA Merging Undo Your Fine-Tuning Work
Copy link
Facebook
Email
Notes
More
The Weekly Kaitchup #80
MoBA - Step-Video-T2V - PaliGemma 2 Mix
Feb 21
•
Benjamin Marie
6
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #80
Copy link
Facebook
Email
Notes
More
vLLM and Zero-Shot for Low-Cost LLM Evaluation
How to reduce the cost of your LLM evaluations
Feb 19
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
vLLM and Zero-Shot for Low-Cost LLM Evaluation
Copy link
Facebook
Email
Notes
More
Mistral Small 3: An Excellent 24B-Parameter Wide-Shallow LLM
Fine-tuning, quantization, and evaluation
Feb 17
•
Benjamin Marie
6
Share this post
The Kaitchup – AI on a Budget
Mistral Small 3: An Excellent 24B-Parameter Wide-Shallow LLM
Copy link
Facebook
Email
Notes
More
1
The Weekly Kaitchup #79
DeepScaleR-1.5B-Preview - TULU 3.1 - OpenR1 Math
Feb 14
•
Benjamin Marie
4
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #79
Copy link
Facebook
Email
Notes
More
"Thinking" LLMs with Simple Fine-tuning and Budget Forcing
How to activate "reasoning" in LLMs
Feb 13
•
Benjamin Marie
10
Share this post
The Kaitchup – AI on a Budget
"Thinking" LLMs with Simple Fine-tuning and Budget Forcing
Copy link
Facebook
Email
Notes
More
5
GRPO: Train LLMs with DeepSeek-R1's Reinforcement Learning Method
With a single consumer GPU!
Feb 10
•
Benjamin Marie
16
Share this post
The Kaitchup – AI on a Budget
GRPO: Train LLMs with DeepSeek-R1's Reinforcement Learning Method
Copy link
Facebook
Email
Notes
More
5
The Weekly Kaitchup #78
s1 - SmolLM2 Trainig Recipe - TULU 3 405B
Feb 7
•
Benjamin Marie
5
Share this post
The Kaitchup – AI on a Budget
The Weekly Kaitchup #78
Copy link
Facebook
Email
Notes
More
Qwen2.5-VL: What's New and How Good Are They for Chart and Table Analysis
Can Qwen2.5-VL analyze cosmology data without any context?
Feb 5
•
Benjamin Marie
15
Share this post
The Kaitchup – AI on a Budget
Qwen2.5-VL: What's New and How Good Are They for Chart and Table Analysis
Copy link
Facebook
Email
Notes
More
Fine-Tuning Your LLM to "Think" Like DeepSeek R1, on Your Computer
Experiments with SFT, Llama 3.2 3B, and Training Data Generated by DeepSeek R1
Feb 3
•
Benjamin Marie
7
Share this post
The Kaitchup – AI on a Budget
Fine-Tuning Your LLM to "Think" Like DeepSeek R1, on Your Computer
Copy link
Facebook
Email
Notes
More
6
Share
Copy link
Facebook
Email
Notes
More
This site requires JavaScript to run correctly. Please
turn on JavaScript
or unblock scripts