Skip to content

Commit 81ec39c

Browse files
kylesayrsdsikka
andauthored
[Offload] Convert model back to CT offloading for testing (vllm-project#2403)
## Purpose ## * Fix testing which requires access to the model after the model has been saved ## Prerequisites ## * vllm-project/compressed-tensors#601 ## Changes ## * Convert back to CT offloading after converting to accelerate offloading for saving * Previously we just "removed dispatch", but this is bad practice as it won't work for disk offloading Signed-off-by: Kyle Sayers <kylesayrs@gmail.com> Co-authored-by: Dipika Sikka <dipikasikka1@gmail.com>
1 parent c6e4d38 commit 81ec39c

File tree

1 file changed

+7
-4
lines changed

1 file changed

+7
-4
lines changed

src/llmcompressor/transformers/compression/compressed_tensors_utils.py

Lines changed: 7 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@
99
SparsityCompressionConfig,
1010
)
1111
from compressed_tensors.config import CompressionFormat
12-
from compressed_tensors.offload import is_rank0, to_accelerate
12+
from compressed_tensors.offload import from_accelerate, is_rank0, to_accelerate
1313
from loguru import logger
1414
from transformers import PreTrainedModel
1515

@@ -89,10 +89,10 @@ def save_pretrained_wrapper(
8989
if compressor is not None:
9090
compressor.compress_model(model)
9191

92-
if is_rank0():
93-
# convert to accelerate offloaded for optimal saving with transformers
94-
to_accelerate(model)
92+
# convert to accelerate offloaded for optimal saving with transformers
93+
to_accelerate(model)
9594

95+
if is_rank0():
9696
# save (compressed) model structure
9797
original_save_pretrained.__get__(model, model_class)(
9898
save_directory,
@@ -110,6 +110,9 @@ def save_pretrained_wrapper(
110110
# copy python files from cache dir to save_path if any
111111
copy_python_files_from_model_cache(model, save_directory)
112112

113+
# convert back from accelerate to restore model to original form
114+
from_accelerate(model)
115+
113116
save_pretrained_wrapper._overridden = True
114117
return save_pretrained_wrapper
115118

0 commit comments

Comments
 (0)