|
| 1 | +## Llama-Recipes Quickstart |
| 2 | + |
| 3 | +If you are new to developing with Meta Llama models, this is where you should start. This folder contains introductory-level notebooks across different techniques relating to Meta Llama. |
| 4 | + |
| 5 | +* The [](./Running_Llama3_Anywhere/) notebooks demonstrate how to run Llama inference across Linux, Mac and Windows platforms using the appropriate tooling. |
| 6 | +* The [](./Prompt_Engineering_with_Llama_3.ipynb) notebook showcases the various ways to elicit appropriate outputs from Llama. Take this notebook for a spin to get a feel for how Llama responds to different inputs and generation parameters. |
| 7 | +* The [](./inference/) folder contains scripts to deploy Llama for inference on server and mobile. See also [](../3p_integration/vllm/) and [](../3p_integration/tgi/) for hosting Llama on open-source model servers. |
| 8 | +* The [](./RAG/) folder contains a simple Retrieval-Augmented Generation application using Llama 3. |
| 9 | +* The [](./finetuning/) folder contains resources to help you finetune Llama 3 on your custom datasets, for both single- and multi-GPU setups. The scripts use the native llama-recipes finetuning code found in [](../../src/llama_recipes/finetuning.py) which supports these features: |
| 10 | + |
| 11 | +| Feature | | |
| 12 | +| ---------------------------------------------- | - | |
| 13 | +| HF support for finetuning | ✅ | |
| 14 | +| Deferred initialization ( meta init) | ✅ | |
| 15 | +| HF support for inference | ✅ | |
| 16 | +| Low CPU mode for multi GPU | ✅ | |
| 17 | +| Mixed precision | ✅ | |
| 18 | +| Single node quantization | ✅ | |
| 19 | +| Flash attention | ✅ | |
| 20 | +| PEFT | ✅ | |
| 21 | +| Activation checkpointing FSDP | ✅ | |
| 22 | +| Hybrid Sharded Data Parallel (HSDP) | ✅ | |
| 23 | +| Dataset packing & padding | ✅ | |
| 24 | +| BF16 Optimizer ( Pure BF16) | ✅ | |
| 25 | +| Profiling & MFU tracking | ✅ | |
| 26 | +| Gradient accumulation | ✅ | |
| 27 | +| CPU offloading | ✅ | |
| 28 | +| FSDP checkpoint conversion to HF for inference | ✅ | |
| 29 | +| W&B experiment tracker | ✅ | |
0 commit comments