[Oneshot refactor] Refactor initialize_model_from_path by horheynm · Pull Request #1109 · vllm-project/llm-compressor

horheynm · 2025-01-28T19:14:39Z

ORDER OF REVIEWS:

SUMMARY:
Refactor initialize_model_from_path to decouple training_args dependent logic and oneshot (non-training_args) logic.

TEST PLAN:

Pass all tests
search initialize_model_from_path using grep

github-actions · 2025-01-28T19:14:53Z

👋 Hi! Thank you for contributing to llm-compressor. Please add the ready label when the PR is ready for review.

Note: This is required to complete the testing suite, please only add the label once the PR is code complete and local testing has been performed.

src/llmcompressor/transformers/finetune/text_generation.py

kylesayrs

Please move set_seed outside of the initialize_model_from_path function. Otherwise lgtm.

horheynm · 2025-02-05T16:54:33Z

Please move set_seed outside of the initialize_model_from_path function. Otherwise lgtm.

Its a part of training args, https://github.com/huggingface/transformers/blob/main/src/transformers/training_args.py#L1044

and in the original code.
https://github.com/vllm-project/llm-compressor/pull/1109/files#diff-4b4df7cb1f6042ff11af61f87b5d6dffd8e07a7ef182a8aa659566a5ee77fc04L198

If there is an argument to move it outside, we can do it in a follow up pr.

…ame_from_model (#1108) ORDER OF REVIEWS: 1. #1108 <- current PR 2. #1103 3. #1109 4. #1110 SUMMARY: * Rename `get_shared_processor_src` to `get_processor_from_model` * Appropriate signature on `initialize_processor_from_path`, where `teacher` should be optinal TEST PLAN: * Pass all existing tests * Search `get_shared_processor_src` using pygrep ```bash 3 function pygrep() { 2 local search_term="$1" 1 shift 126 local search_dirs="${*:-src examples tests}" 1 grep -rn --include="*.py" -E "$search_term" $search_dirs 2 } ``` --------- Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com>

ORDER OF REVIEWS: 1. #1108 2. #1103 <- current PR 3. #1109 4. #1110 SUMMARY: Refactor dataclass used for llm-compressor entrypoints (oneshot, train, apply) to decouple non-relevant attributes from the existing dataclass. Ex. recipe in training_args. Recipe is contained in a session, not in the trainer that training_args govern. Dataclass refactor details are in https://docs.google.com/document/d/1YbR1dTQmCzqhGk74m5msBzqoPHQgB6dVxDtf6cTmetc/edit?usp=sharing Note: #1110 takes care of using a new entrypoint that will prohibit the post_train / oneshot call to use training_args. Current entrypoint will need training_args for oneshot to function - this PR is just for refactoring the dataclass. Before: ModelArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/model_args.py#L6 DataTrainingArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/data/data_args.py#L70 TrainingArguments: https://github.com/vllm-project/llm-compressor/blob/6fa5a5eecc7d363ec73474d011d40135b6374179/src/llmcompressor/transformers/finetune/training_args.py#L10 After: ModelArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-58fd0f7ae4564317960ae0d4d4b2cdb97b9588c1915f062915e74ecf51b5502cR6 DatasetArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-5e43f74ba5d8327b937adada3c7f30a7efb13f9a44cb3fdb5e1a2a12b8b8ea27R70 RecipeArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-0ff9c048a4deb55e5459054bdc61a5d8c81da9c94588ec2355e6b2c2ec8675d1R6 TrainingArguments: https://github.com/vllm-project/llm-compressor/pull/1103/files#diff-249ee96763dd50956a7309f898eda4bcaa91c6af653474568fbda10b5a39c817R12 TEST PLAN: * Pass all existing tests * Search dataclass arguments using `pygrep` ```bash 3 function pygrep() { 2 local search_term="$1" 1 shift 126 local search_dirs="${*:-src examples tests}" 1 grep -rn --include="*.py" -E "$search_term" $search_dirs 2 } ``` --------- Co-authored-by: Rahul Tuli <rahul@neuralmagic.com>

brian-dellabetta

things look good to me, just breaking change concerns but looks like you addressed those in the slack thread with @robertgshaw2-redhat

src/llmcompressor/transformers/finetune/text_generation.py

ORDER OF REVIEWS: 1. #1108 2. #1103 3. #1109 4. #1110 <- current PR SUMMARY: * Create a class to decouple dependency to `main`. Class `Oneshot` consists of pre-processing, carrying out oneshot logic and post-processing * Move the oneshot class and method under `llmcompressor/entrypoints/oneshot.py`. * Add ReadMe in `/llmcompressor/entrypoints` for info on oneshot * Delete oneshot logic from `/finetune` directory, add deprecation warning * Remove apply used only for stagerunner oneshot pathway in session.py and session_function.py * Add oneshot only calibration dataloader logic * Add a return variable of `model: PretrainedModel` for `def oneshot`. * Make oneshot carryout logic independent of `TrainingArguments` * remove `overwrite_output_dir` as oneshot input arg -> only used for `TrainingArguments` * Update README on `/finetune` path. Remove `oneshot` logic and `oneshot with fsdp` * Update `wrap_save_pretrained` logic to run only if not updated already -> used for stage runner to avoid double wrapping Entrypoints: ```python3 from llmcompressor import oneshot oneshot(**kwargs) # calls Oneshot ``` or ```python3 from llmcompressor import Oneshot oneshot = Oneshot(**kwargs) oneshot() # preprocesss, carries out logic and post process ``` TEST PLAN: Pass all tests and examples. Verified https://github.com/vllm-project/llm-compressor/blob/main/examples/quantization_2of4_sparse_w4a16/llama7b_sparse_w4a16.py works as expected. FOLLOW UPS: * Stage runner removal * Update entrypoints folder with train, eval, predict, etc. --------- Signed-off-by: George Ohashi <george@neuralmagic.com> Signed-off-by: George <george@neuralmagic.com>

refactor initialize_model_from_path

06de20e

horheynm added the ready When a PR is ready for review label Jan 28, 2025

George added 2 commits January 30, 2025 10:20

Merge branch 'main' into oneshot-refac-initialize_model_from_path

83eeb09

Merge branch 'main' into oneshot-refac-initialize_model_from_path

b319df1

This was referenced Feb 3, 2025

[Oneshot Refactor] Rename get_shared_processor_src to get_processor_name_from_model #1108

Merged

[Oneshot Refactor] dataclass Arguments #1103

Merged

[Oneshot Refactor] Main refactor #1110

Merged

dsikka reviewed Feb 4, 2025

View reviewed changes

src/llmcompressor/transformers/finetune/text_generation.py Show resolved Hide resolved

src/llmcompressor/transformers/finetune/text_generation.py Show resolved Hide resolved

George Ohashi added 3 commits February 4, 2025 14:33

comment

d385745

comments

1cdeaf3

Merge branch 'main' into oneshot-refac-initialize_model_from_path

026f059

kylesayrs reviewed Feb 4, 2025

View reviewed changes

src/llmcompressor/transformers/finetune/text_generation.py Show resolved Hide resolved

kylesayrs requested changes Feb 4, 2025

View reviewed changes

Merge branch 'main' into oneshot-refac-initialize_model_from_path

ab06712

George added 2 commits February 10, 2025 20:33

Merge branch 'main' into oneshot-refac-initialize_model_from_path

f5ebaf9

Merge branch 'main' into oneshot-refac-initialize_model_from_path

3b2ad11

horheynm mentioned this pull request Feb 11, 2025

[Entrypoint] Update oneshot to post_train #1136

Closed

brian-dellabetta approved these changes Feb 11, 2025

View reviewed changes

src/llmcompressor/transformers/finetune/text_generation.py Show resolved Hide resolved

kylesayrs approved these changes Feb 11, 2025

View reviewed changes

dsikka merged commit 74150cb into main Feb 11, 2025
7 checks passed

dsikka deleted the oneshot-refac-initialize_model_from_path branch February 11, 2025 19:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Oneshot refactor] Refactor initialize_model_from_path#1109

[Oneshot refactor] Refactor initialize_model_from_path#1109
dsikka merged 9 commits intomainfrom
oneshot-refac-initialize_model_from_path

horheynm commented Jan 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jan 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Uh oh!

horheynm commented Feb 5, 2025 •

edited

Loading

Uh oh!

brian-dellabetta left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

horheynm commented Jan 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Jan 28, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kylesayrs left a comment

Choose a reason for hiding this comment

Uh oh!

horheynm commented Feb 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brian-dellabetta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

horheynm commented Jan 28, 2025 •

edited

Loading

horheynm commented Feb 5, 2025 •

edited

Loading