[Callbacks] Remove pre_initialize_structure by kylesayrs · Pull Request #1160 · vllm-project/llm-compressor

kylesayrs · 2025-02-17T18:59:06Z

Purpose

Remove pre_initialize_structure to simplify codebase
Fix recipe appending for appending a recipe to a model which already has a recipe
Remove misleading logging messages

2025-02-17T17:48:38.477750-0500 | _check_create_state | INFO - State created for compression lifecycle
2025-02-17T17:48:38.478670-0500 | pre_initialize_structure | INFO - Compression lifecycle structure pre-initialized for 0 modifiers
2025-02-17T17:48:38.478836-0500 | pre_initialize_structure | INFO - Compression lifecycle structure pre-initialized for 0 modifiers

Prerequisites

[Callbacks] Consolidate Saving Methods #1168

Follow-ups

Remove double initialization

Changes

The preinitialization step used to fulfill a few purposes

Construct the lifecycle state
- This is now done by the dataclass directly

- state: Optional[State] = None
+ state: Optional[State] = field(default_factory=State)

Populate state with model and recipe
- This is now done (and has always been done) by initialize
- Some functions such as Trainer.init_model attempt to access the model through the session before initialize is called. In these cases, we can pass the model directly

trainer = Trainer(
-     model_init=get_session_model,
+     model_init=lambda: model,

Prepend recipes to the recipe.yaml if the model has already been compressed once
- Move this logic from preinitialization to the save_pretrained function
- Consolidate all save pathways to use the the same wrapped method

def save_pretrained_wrapper(...):
    update_and_save_recipe(model.name_or_path, save_directory)

Provide a way for modifiers to influence the model after they have already been applied
- This can still be a enacted via recipe validation, but likely no longer has a use case and shouldn't be done automatically, at most the LLM Compressor should warn if the recipe configuration is invalid / requires modification
Create quantization modifier on GPTQ
- This is now done within the on_initialize function
- In the future, this should be done by a high-level recipe validation step

def on_initialize(...)
-     self.on_initialize_structure(state, **kwargs)
+     self._maybe_build_quant_modifier(state.model)

Remove EventType.order() method which is unused
Extend the Recipe.simplify_recipe class method to support strings

Lifecycle

create_session() (doesn't do much and can be hidden behind initialize)
initialize(model=..., recipe=...)
1. Maybe start modifiers
LifecycleCallback.event(...)
1. Maybe start/end modifiers
finalize()

Regression Evaluation

Main

vllm (pretrained=/home/kyle/llm-compressor/Meta-Llama-3-8B-Instruct-W4A16-G128,dtype=bfloat16,add_bos_token=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|  Tasks   |Version|Filter|n-shot|Metric|   |Value |   |Stderr|
|----------|------:|------|-----:|------|---|-----:|---|-----:|
|winogrande|      1|none  |     5|acc   |↑  |0.7482|±  |0.0122|

This branch

vllm (pretrained=/home/kyle/llm-compressor/Meta-Llama-3-8B-Instruct-W4A16-G128,dtype=bfloat16,add_bos_token=True), gen_kwargs: (None), limit: None, num_fewshot: 5, batch_size: 1
|  Tasks   |Version|Filter|n-shot|Metric|   |Value |   |Stderr|
|----------|------:|------|-----:|------|---|-----:|---|-----:|
|winogrande|      1|none  |     5|acc   |↑  |0.7482|±  |0.0122|

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

github-actions · 2025-02-17T18:59:19Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

…lize-structure

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

…initialize-structure

horheynm

Good job XD

## Purpose ## * Simplify all methods of saving into one point, namely the wrapped `save_pretrained` function * Precursor to #1160 * Needed for having a single point for saving on top of existing recipes ## Background ## All the things needed to be done during saving 1. Save the model weights, potentially compressed 2. Save the processor 3. Update the recipe checkpoint 4. Copy any necessary python files from the model cache 5. Only save on the main process After these changes, (1, 2, 3, 4) will be done within the `save_pretrained` function, and (5) will be the responsibility of the caller. (3) will be implemented by #1160 so as not to conflict with existing logic in pre_init All of the places where a model is saved are * If an output dir is specified, at the end of the main function * Between stages of the stage runner * Between epochs of the HF Trainer * By the user after oneshot/training completes After these changes, all of these will be replaced by a single `save_checkpoint` function which calls `save_pretrained` to do all the necessary things ## Changes ## * Remove `save_model_and_recipe` * Saving recipes is now done by `save_pretrained` function * Implement `save_checkpoint` * Single entrypoint for saving a model and its processor * Performs actions (1, 2, 4) * Replace all locations where a model is saved with `save_checkpoint` * All applicable callers with only saving on the main process (5) * Remove support for `modify_fsdp_model_save_pretrained` and `unwrap_and_export_model`, to be added back in a future release --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com>

The base branch was changed.

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

…lize-structure

brian-dellabetta

looking good, but i'm going off an assumption that none of this deleted code we actually need

src/llmcompressor/transformers/utils/helpers.py

src/llmcompressor/recipe/recipe.py

kylesayrs · 2025-02-25T17:25:28Z

@brian-dellabetta The PR description lists all of the functionality that was needed by preinitialize and where that functionality now lives

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

src/llmcompressor/entrypoints/oneshot.py

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

dsikka

great work!

src/llmcompressor/transformers/sparsification/compressed_tensors_utils.py

## Purpose ## * Fix staged 2of4 example ## Background ## * When #1160 landed, this change introduced a bug in the recipe container which meant that the recipe was not recompiled after `append`ing. This caused sgpt to initialize twice and gptq to never initialize, leading to a sparsity-only quantization config * At some point, a changed was introduced which causes previous stages to become reconstructed after recipe recompilation. This means that without resetting the session in between stages, previous stages will initialize twice. * In order to avoid this issue, this PR introduces `session.reset()` in between stages * This change has the consequence of creating `recipe.yaml` files which do not have the full recipe history. However, I believe this is acceptable for the time being, as the stage runner and this work flow will be removed in the next release. --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

## Purpose ## * Simplify code related to callbacks * Remove event lifecycle * Callback event lifecycle * Optimizer event lifecycle * Remove the concept of a "start_event", which was originally used because initialization sometimes requires triggering on_start, and on_start requires an event in order to get an index and index-related info * For now, we create a dummy event on initialization which has an index of zero * In the future, all start events will be triggered by events (never initialize) and all event events will be triggered by events or finalization ## Prerequisites ## * #1160 ## Changes ## * Instead of using event lifecycle as a proxy, pass events to modifiers directly ```python3 - self._check_setup_event_lifecycle(event_type) + event = Event(event_type=event_type) - for event in self.event_lifecycle.events_from_type(event_type): + for mod in self.modifiers: data = mod.update_event(state=self.state, event=event, **kwargs) ``` * This was originally needed for optimizer event lifecycle, that is now removed * Make silent noops into errors ```python3 if not self.initialized_: - return + raise RuntimeError("Cannot update an uninitialized modifier") ``` * Remove `check_initialized`, which was used to allow for checking in situations which required double initialization Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

## Purpose ## * Code documentation ## Prerequisites ## * #1160 --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

remove pre_initialize_structure

9b3e216

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs marked this pull request as draft February 17, 2025 19:29

kylesayrs added 3 commits February 17, 2025 15:17

remove preinit event

3bff7d1

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

remove order test

14e47a5

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge branch 'main' into kylesayrs/remove-preinitialize-structure

c6a9e6b

kylesayrs added the ready When a PR is ready for review label Feb 17, 2025

kylesayrs marked this pull request as ready for review February 17, 2025 20:47

kylesayrs self-assigned this Feb 17, 2025

kylesayrs marked this pull request as draft February 17, 2025 23:28

kylesayrs removed the ready When a PR is ready for review label Feb 18, 2025

kylesayrs added 2 commits February 17, 2025 19:40

consolodate saving

6b882bb

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

typos

bb35a74

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs added the ready When a PR is ready for review label Feb 18, 2025

kylesayrs added 4 commits February 17, 2025 19:49

add todos

71903ff

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

dreggs, style

d39d375

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

typo

7cc5a6d

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

adjust typehint

9865fa3

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs marked this pull request as ready for review February 18, 2025 05:09

kylesayrs added 8 commits February 18, 2025 14:48

allow prepending

68ce624

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

check saved recipe contents

5b7cc03

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

consolidate saving paths

bdc4fa5

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

remove broken import

a83b0aa

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/consolidate-saving

4efd116

add back def

b9f0bd1

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

29ab794

…lize-structure

save state

0a2642b

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs mentioned this pull request Feb 18, 2025

[Callbacks] Consolidate Saving Methods #1168

Merged

Merge branch 'kylesayrs/consolidate-saving' into kylesayrs/remove-pre…

0c70881

…initialize-structure

kylesayrs changed the base branch from main to kylesayrs/consolidate-saving February 18, 2025 21:18

kylesayrs mentioned this pull request Feb 19, 2025

[Callbacks] Remove compression_ready #1169

Closed

horheynm previously approved these changes Feb 20, 2025

View reviewed changes

Base automatically changed from kylesayrs/consolidate-saving to main February 25, 2025 15:46

kylesayrs added 4 commits February 25, 2025 11:42

rename function

3d64d57

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

remove accidentally added files

55984c4

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

05fa5f6

…lize-structure

Merge remote-tracking branch 'origin' into kylesayrs/remove-preinitia…

53d762e

…lize-structure

brian-dellabetta previously approved these changes Feb 25, 2025

View reviewed changes

src/llmcompressor/transformers/utils/helpers.py Outdated Show resolved Hide resolved

src/llmcompressor/recipe/recipe.py Show resolved Hide resolved

add debug statement

e026658

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

kylesayrs dismissed brian-dellabetta’s stale review via e026658 February 25, 2025 17:29

pass model to stage runner

00961a0

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

brian-dellabetta reviewed Feb 25, 2025

View reviewed changes

src/llmcompressor/entrypoints/oneshot.py Outdated Show resolved Hide resolved

remove breakpoint

c7678b0

Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

horheynm approved these changes Feb 26, 2025

View reviewed changes

kylesayrs mentioned this pull request Feb 26, 2025

[Callbacks] Remove EventLifecycle and on_start event #1170

Merged

dsikka approved these changes Feb 26, 2025

View reviewed changes

src/llmcompressor/transformers/sparsification/compressed_tensors_utils.py Show resolved Hide resolved

dsikka requested a review from brian-dellabetta February 26, 2025 18:48

dsikka enabled auto-merge (squash) February 26, 2025 18:49

kylesayrs mentioned this pull request Feb 26, 2025

[Callbacks][Docs] Add docstrings to saving functions #1201

Merged

Merge branch 'main' into kylesayrs/remove-preinitialize-structure

049e3cc

brian-dellabetta approved these changes Feb 26, 2025

View reviewed changes

dsikka merged commit a88b72b into main Feb 26, 2025
7 checks passed

dsikka deleted the kylesayrs/remove-preinitialize-structure branch February 26, 2025 20:37

kylesayrs mentioned this pull request Mar 10, 2025

[Bugfix] Staged 2of4 example #1238

Merged

kylesayrs added a commit that referenced this pull request Apr 16, 2025

[Callbacks][Docs] Add docstrings to saving functions (#1201)

4af2d25

## Purpose ## * Code documentation ## Prerequisites ## * #1160 --------- Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Callbacks] Remove pre_initialize_structure#1160

[Callbacks] Remove pre_initialize_structure#1160
dsikka merged 29 commits intomainfrom
kylesayrs/remove-preinitialize-structure

kylesayrs commented Feb 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Feb 17, 2025

Uh oh!

horheynm left a comment

Uh oh!

brian-dellabetta left a comment

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented Feb 25, 2025 •

edited

Loading

Uh oh!

Uh oh!

dsikka left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

kylesayrs commented Feb 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Prerequisites

Follow-ups

Changes

Lifecycle

Regression Evaluation

Uh oh!

github-actions bot commented Feb 17, 2025

Uh oh!

horheynm left a comment

Choose a reason for hiding this comment

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

kylesayrs commented Feb 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

dsikka left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

kylesayrs commented Feb 17, 2025 •

edited

Loading

kylesayrs commented Feb 25, 2025 •

edited

Loading