

RAG with Qwen3 Embedding and Qwen3 Reranker
How to use embedding and reranker models to efficiently retrieve only the most relevant chunks or documents given a user query

The Kaitchup – AI on a Budget
Weekly tutorials, tips, and news on fine-tuning, running, and serving large language models on your computer. The Kaitchup also publishes two new AI notebooks every week.