Table of Contents

The Kaitchup – AI on a Budget is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.

Fine-tuning

LoRA and QLoRA

Direct Preference Optimization (DPO) and Identity Preference Optimization (IPO)

Reinforcement Learning with Human Feedback (RLHF)

Optimization

Quantization

AQLM

GPTQ

AWQ

bitsandbytes NF4

ExLlama

SqueezeLLM

Efficient Loading and Inference

Pre-training

Merge and Mixture of Expert

Benchmarking

LLM Focus

Llama 2

Falcon

Mistral 7B

Microsoft phi-1.5 and phi-2

Google Gemma

Qwen

Machine Translation

Fine-tuning

GPT

Evaluation

Data Processing

*: Articles with sections only accessible to paid subscribers