Finetuning Llama3.1 using LoRA

In this notebook, we will finetune Llama-3.1-8B with Alpaca Financial Dataset.

LoRA

LoRA is a technique designed to make fine-tuning more efficient by focusing on reducing the number of trainable parameters involved. This approach speeds up the fine-tuning process and creates smaller fine-tuned checkpoints.

Rather than adjusting all the model's weights during fine-tuning, LoRA freezes most layers and selectively trains only a few within the attention mechanisms. Instead of directly modifying the weights of these layers, LoRA introduces two smaller matrices that are combined and added to the original weights. These smaller matrices are the only parts updated during fine-tuning and are saved separately. This method preserves the model's original parameters, allowing the LoRA weights to integrate later through an adaptation process seamlessly. Unloading the LoRA adapter and revert back to the original base model is also possible.

For more information about LoRA, please refer to this paper.

Name	Name	Last commit message	Last commit date
Latest commit EkinKarabulut Deleting the commented out parts Aug 30, 2024 c9f9601 · Aug 30, 2024 History 9 Commits
distributed	distributed	Deleting the commented out parts	Aug 30, 2024
Finetuning_Llama3.ipynb	Finetuning_Llama3.ipynb	Adding finetuning notebook	Aug 12, 2024
README.md	README.md	Update README.md	Aug 14, 2024
lora_diagram.png	lora_diagram.png	Add files via upload	Aug 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetuning Llama3.1 using LoRA

LoRA

About

Releases

Packages

Languages

EkinKarabulut/finetuning_llama3.1

Folders and files

Latest commit

History

Repository files navigation

Finetuning Llama3.1 using LoRA

LoRA

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages