Llama-recipes is a companion project to the Llama models. It's goal is to provide examples to quickly get started with fine-tuning for domain adaptation and how to run inference for the fine-tuned models.

These details have not been verified by PyPI

Project links

License
- Other/Proprietary License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

Llama Recipes: Examples to get started using the Llama models from Meta

The 'llama-recipes' repository is a companion to the Meta Llama models. We support the latest version, Llama 3.2 Vision and Llama 3.2 Text, in this repository. This repository contains example scripts and notebooks to get started with the models in a variety of use-cases, including fine-tuning for domain adaptation and building LLM-based applications with Llama and other tools in the LLM ecosystem. The examples here use Llama locally, in the cloud, and on-prem.

[!IMPORTANT] Llama 3.2 follows the same prompt template as Llama 3.1, with a new special token <|image|> representing the input image for the multimodal models.

Token Description

<|begin_of_text|> Specifies the start of the prompt.

<|image|> Represents the image tokens passed as an input to Llama.

<|eot_id|> This token signifies the end of a turn i.e. the end of the model's interaction either with the user or tool executor.

<|eom_id|> End of Message. A message represents a possible stopping point where the model can inform the execution environment that a tool call needs to be made.

<|python_tag|> A special tag used in the model’s response to signify a tool call.

<|finetune_right_pad_id|> Used for padding text sequences in a batch to the same length.

<|start_header_id|>{role}<|end_header_id|> These tokens enclose the role for a particular message. The possible roles can be: system, user, assistant and ipython.

<|end_of_text|> This is equivalent to the EOS token. For multiturn-conversations it's usually unused, this token is expected to be generated only by the base models.

More details on the prompt templates for image reasoning, tool-calling and code interpreter can be found on the documentation website.

Token	Description
`<\|begin_of_text\|>`	Specifies the start of the prompt.
`<\|image\|>`	Represents the image tokens passed as an input to Llama.
`<\|eot_id\|>`	This token signifies the end of a turn i.e. the end of the model's interaction either with the user or tool executor.
`<\|eom_id\|>`	End of Message. A message represents a possible stopping point where the model can inform the execution environment that a tool call needs to be made.
`<\|python_tag\|>`	A special tag used in the model’s response to signify a tool call.
`<\|finetune_right_pad_id\|>`	Used for padding text sequences in a batch to the same length.
`<\|start_header_id\|>{role}<\|end_header_id\|>`	These tokens enclose the role for a particular message. The possible roles can be: system, user, assistant and ipython.
`<\|end_of_text\|>`	This is equivalent to the EOS token. For multiturn-conversations it's usually unused, this token is expected to be generated only by the base models.

Llama Recipes: Examples to get started using the Llama models from Meta

Getting Started

These instructions will get you a copy of the project up and running on your local machine for development and testing purposes. See deployment for notes on how to deploy the project on a live system.

Prerequisites

PyTorch Nightlies

If you want to use PyTorch nightlies instead of the stable release, go to this guide to retrieve the right --extra-index-url URL parameter for the pip install commands on your platform.

Installing

Llama-recipes provides a pip distribution for easy install and usage in other projects. Alternatively, it can be installed from source.

[!NOTE] Ensure you use the correct CUDA version (from nvidia-smi) when installing the PyTorch wheels. Here we are using 11.8 as cu118. H100 GPUs work better with CUDA >12.0

Install with pip

pip install llama-recipes

Install with optional dependencies

Llama-recipes offers the installation of optional packages. There are three optional dependency groups. To run the unit tests we can install the required dependencies with:

pip install llama-recipes[tests]

For the vLLM example we need additional requirements that can be installed with:

pip install llama-recipes[vllm]

To use the sensitive topics safety checker install with:

pip install llama-recipes[auditnlg]

Some recipes require the presence of langchain. To install the packages follow the recipe description or install with:

pip install llama-recipes[langchain]

Optional dependencies can also be combines with [option1,option2].

Install from source

To install from source e.g. for development use these commands. We're using hatchling as our build backend which requires an up-to-date pip as well as setuptools package.

git clone git@github.com:meta-llama/llama-recipes.git
cd llama-recipes
pip install -U pip setuptools
pip install -e .

For development and contributing to llama-recipes please install all optional dependencies:

git clone git@github.com:meta-llama/llama-recipes.git
cd llama-recipes
pip install -U pip setuptools
pip install -e .[tests,auditnlg,vllm]

Getting the Llama models

You can find Llama models on Hugging Face hub here, where models with hf in the name are already converted to Hugging Face checkpoints so no further conversion is needed. The conversion step below is only for original model weights from Meta that are hosted on Hugging Face model hub as well.

Model conversion to Hugging Face

If you have the model checkpoints downloaded from the Meta website, you can convert it to the Hugging Face format with:

## Install Hugging Face Transformers from source
pip freeze | grep transformers ## verify it is version 4.45.0 or higher

git clone git@github.com:huggingface/transformers.git
cd transformers
pip install protobuf
python src/transformers/models/llama/convert_llama_weights_to_hf.py \
   --input_dir /path/to/downloaded/llama/weights --model_size 3B --output_dir /output/path

Repository Organization

Most of the code dealing with Llama usage is organized across 2 main folders: recipes/ and src/.

`recipes/`

Contains examples are organized in folders by topic:

Subfolder	Description
quickstart	The "Hello World" of using Llama, start here if you are new to using Llama.
use_cases	Scripts showing common applications of Meta Llama3
3p_integrations	Partner owned folder showing common applications of Meta Llama3
responsible_ai	Scripts to use PurpleLlama for safeguarding model outputs
experimental	Meta Llama implementations of experimental LLM techniques

`src/`

Contains modules which support the example recipes:

Subfolder	Description
configs	Contains the configuration files for PEFT methods, FSDP, Datasets, Weights & Biases experiment tracking.
datasets	Contains individual scripts for each dataset to download and process. Note
inference	Includes modules for inference for the fine-tuned models.
model_checkpointing	Contains FSDP checkpoint handlers.
policies	Contains FSDP scripts to provide different policies, such as mixed precision, transformer wrapping policy and activation checkpointing along with any precision optimizer (used for running FSDP with pure bf16 mode).
utils	Utility files for: - `train_utils.py` provides training/eval loop and more train utils. - `dataset_utils.py` to get preprocessed datasets. - `config_utils.py` to override the configs received from CLI. - `fsdp_utils.py` provides FSDP wrapping policy for PEFT methods. - `memory_utils.py` context manager to track different memory stats in train loop.

Supported Features

The recipes and modules in this repository support the following features:

Feature
HF support for inference	✅
HF support for finetuning	✅
PEFT	✅
Deferred initialization ( meta init)	✅
Low CPU mode for multi GPU	✅
Mixed precision	✅
Single node quantization	✅
Flash attention	✅
Activation checkpointing FSDP	✅
Hybrid Sharded Data Parallel (HSDP)	✅
Dataset packing & padding	✅
BF16 Optimizer (Pure BF16)	✅
Profiling & MFU tracking	✅
Gradient accumulation	✅
CPU offloading	✅
FSDP checkpoint conversion to HF for inference	✅
W&B experiment tracker	✅

Contributing

Please read CONTRIBUTING.md for details on our code of conduct, and the process for submitting pull requests to us.

License

See the License file for Meta Llama 3.2 here and Acceptable Use Policy here

See the License file for Meta Llama 3.1 here and Acceptable Use Policy here

See the License file for Meta Llama 3 here and Acceptable Use Policy here

See the License file for Meta Llama 2 here and Acceptable Use Policy here

Project details

These details have not been verified by PyPI

Project links

License
- Other/Proprietary License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.0.5.post2

Jan 22, 2025

0.0.5.post1

Jan 22, 2025

0.0.5

Jan 22, 2025

0.0.4.post1

Sep 26, 2024

This version

0.0.4

Sep 25, 2024

0.0.3

Jul 23, 2024

0.0.2

May 17, 2024

0.0.1

Sep 6, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llama_recipes-0.0.4.tar.gz (24.6 MB view details)

Uploaded Sep 25, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llama_recipes-0.0.4-py3-none-any.whl (71.1 kB view details)

Uploaded Sep 25, 2024 Python 3

File details

Details for the file llama_recipes-0.0.4.tar.gz.

File metadata

Download URL: llama_recipes-0.0.4.tar.gz
Upload date: Sep 25, 2024
Size: 24.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for llama_recipes-0.0.4.tar.gz
Algorithm	Hash digest
SHA256	`4299f5d049074600bbee9f3295eb98f79e0f1f845115c3962a70bc1b9f8cadd3`
MD5	`2f3b46ab1f1dba33304fe1892a6bae71`
BLAKE2b-256	`bb02d9a8331a671b37c50c27e7c2e5c5909daf0682e01720b12bcd458fbb8e0c`

See more details on using hashes here.

File details

Details for the file llama_recipes-0.0.4-py3-none-any.whl.

File metadata

Download URL: llama_recipes-0.0.4-py3-none-any.whl
Upload date: Sep 25, 2024
Size: 71.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for llama_recipes-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`783469db284f251dbc8b5b9427cba1b09f2c87e137f6d777220bbd7e337c96d2`
MD5	`186c34c401868a64227ba126e9b8c317`
BLAKE2b-256	`05fa58415c3ae96dba7c678134ef3efddbb9d4c71bdd864cca4bd21ff5c228ea`

See more details on using hashes here.

llama-recipes 0.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Llama Recipes: Examples to get started using the Llama models from Meta

Table of Contents

Getting Started

Prerequisites

PyTorch Nightlies

Installing

Install with pip

Install with optional dependencies

Install from source

Getting the Llama models

Model conversion to Hugging Face

Repository Organization

recipes/

src/

Supported Features

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`recipes/`

`src/`