Sparse Finetuning

Launching from Console Scripts

with DataParallel (default)

llmcompressor.transformers.text_generation.train
    --model PATH_TO_MODEL
    --distill_teacher PATH_TO_TEACHER
    --dataset DATASET_NAME
    --recipe PATH_TO_RECIPE
    --output_dir PATH_TO_OUTPUT
    --num_train_epochs 1
    --splits "train"

Also supported:

llmcompressor.transformers.text_generation.finetune (alias for train)
llmcompressor.transformers.text_generation.oneshot
llmcompressor.transformers.text_generation.eval
llmcompressor.transformers.text_generation.apply(for running in sequential stage mode)
llmcompressor.transformers.text_generation.compress (alias for apply)

with FSDP

accelerate launch 
    --config_file example_fsdp_config.yaml 
    --no_python llmcompressor.transformers.text_generation.finetune
    --model PATH_TO_MODEL
    --distill_teacher PATH_TO_TEACHER
    --dataset DATASET_NAME
    --recipe PATH_TO_RECIPE
    --output_dir PATH_TO_OUTPUT
    --num_train_epochs 1
    --splits "train"

See configure_fsdp.md for additional instructions on setting up FSDP configuration

Launching from Python

from llmcompressor.transformers import train

model = "./obcq_deployment"
teacher_model = "Xenova/llama2.c-stories15M"
dataset_name = "open_platypus"
concatenate_data = False
output_dir = "./output_finetune"
recipe = "test_trainer_recipe.yaml"
num_train_epochs=2
overwrite_output_dir = True
splits = {
    "train": "train[:50%]",
}

train(
    model=model,
    distill_teacher=teacher_model,
    dataset=dataset_name,
    output_dir=output_dir,
    recipe=recipe,
    num_train_epochs=num_train_epochs,
    overwrite_output_dir=overwrite_output_dir,
    concatenate_data = concatenate_data,
    splits = splits
)

Additional Configuration

Finetuning arguments are split up into 3 groups:

ModelArguments: src/llmcompressor/transformers/utils/arg_parser/model_arguments.py
TrainingArguments: src/llmcompressor/transformers/utils/arg_parser/training_arguments.py
DatasetArguments: src/llmcompressor/transformers/utils/arg_parser/data_arguments.py
RecipeArguments: src/llmcompressor/transformers/utils/arg_parser/recipe_arguments.py

Running One-Shot with FSDP

accelerate launch 
    --config_file example_fsdp_config.yaml 
    --no_python llmcompressor.transformers.text_generation.oneshot
    --model PATH_TO_MODEL
    --num_calibration_samples 512
    --dataset DATASET_NAME
    --dataset_config_name OPTIONAL
    --max_seq_len OPTIONAL
    --concatenate_data OPTIONAL
    --recipe PATH_TO_RECIPE
    --output_dir PATH_TO_OUTPUT
    --splits "train"
    --pad_to_max_length False

Running One-shot from Python (without FSDP)

from llmcompressor.transformers import oneshot

model ="Xenova/llama2.c-stories15M"
dataset_name = "open_platypus"
concatenate_data = False
pad_to_max_length = False
output_dir = "./output_oneshot"
recipe = "test_oneshot_recipe.yaml"
overwrite_output_dir = True
splits = {
    "calibration": "train[:20%]"
}

oneshot(
    model=model,
    dataset=dataset_name,
    concatenate_data=concatenate_data,
    output_dir=output_dir,
    recipe=recipe,
    overwrite_output_dir=overwrite_output_dir,
    pad_to_max_length = pad_to_max_length,
    splits = splits
)

Running Multi-Stage Recipes

A recipe can be run stage-by-stage by setting run_stages to True or calling the llmcompressor.transformers.apply/compress pathways. Each stage in the recipe should have a run_type attribute set to either oneshot or train when running in sequential mode.

See example_alternating_recipe.yaml for an example of a staged recipe for Llama.

Python Example

(This can also be run with FSDP by launching the script as accelerate launch --config_file example_fsdp_config.yaml test_multi.py)

test_multi.py

from llmcompressor.transformers import apply
from transformers import AutoModelForCausalLM

model = "../ml-experiments/nlg-text_generation/llama_pretrain-llama_7b-base/dense/training"

dataset_name = "open_platypus"
concatenate_data = False
run_stages=True
output_dir = "./output_finetune_multi"
recipe = "example_alternating_recipe.yaml"
num_train_epochs=1
overwrite_output_dir = True
splits = {
    "train": "train[:95%]",
    "calibration": "train[95%:100%]"
}

apply(
    model_name_or_path=model,
    dataset_name=dataset_name,
    run_stages=run_stages,
    output_dir=output_dir,
    recipe=recipe,
    num_train_epochs=num_train_epochs,
    overwrite_output_dir=overwrite_output_dir,
    concatenate_data = concatenate_data,
    remove_unused_columns = False,
    splits = splits
)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse Finetuning

Launching from Console Scripts

with DataParallel (default)

with FSDP

Launching from Python

Additional Configuration

Running One-Shot with FSDP

Running One-shot from Python (without FSDP)

Running Multi-Stage Recipes

Python Example

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Sparse Finetuning

Launching from Console Scripts

with DataParallel (default)

with FSDP

Launching from Python

Additional Configuration

Running One-Shot with FSDP

Running One-shot from Python (without FSDP)

Running Multi-Stage Recipes

Python Example