Llama 3.1: Fine-tuning on Consumer Hardware — LoRA vs. QLoRA

And why you should pad right

Jul 29, 2024

∙ Paid

Along with Llama 3 405B, Meta also released new versions of Llama 3 8B and 70B (“Llama 3.1”). You can find them here:

Llama 3.1 Collection

The main differences with Llama 3 include the official support of German, French, Italian, Portuguese, Hindi, Spanish, and Thai, along with function calling. These new versions have been post-trained on very long sequences. They can handle contexts of up to 128k tokens without a noticeable accuracy drop.

Fine-tune Llama 3 on Your Computer

Benjamin Marie

April 22, 2024

Read full story

How is fine-tuning different for this new version? I found a couple of things that made the fine-tuning of Llama 3.1 easier and better.

In this article, we will fine-tune Llama 3.1 with LoRA and QLoRA and discuss the changes in the code and learning curves compared to Llama 3. We will see that the padding side chosen for fine-tuning has a significant and unexpected impact on the results.

The code for fine-tuning Llama 3.1 with LoRA and QLoRA is implemented in this notebook:

Get the notebook (#90)

The Kaitchup – AI on a Budget

Llama 3.1: Fine-tuning on Consumer Hardware — LoRA vs. QLoRA

And why you should pad right

Fine-tune Llama 3 on Your Computer

This post is for paid subscribers