Fix CPU Offloading (#1159)

dsikka · web-flow · commit 2053ee974423 · 2025-02-17T18:34:30.000Z
SUMMARY:
- When updating the training args, `place_model_on_device` was missed
and as a result, when creating the trainer (which we really should not
be doing during oneshot...) the default value is left as True and the
trainer tries to move the model to a gpu, if it is available.
- We want this argument to be False as we handle the device map and
model initialization based on the calibration needs

TEST PLAN:
- `cpu_offloading_fp8.py` ran to completion without issue
- `mult_gpus_int8_device_map` made it past the error and is running
diff --git a/src/llmcompressor/args/training_arguments.py b/src/llmcompressor/args/training_arguments.py
@@ -30,3 +30,7 @@ class TrainingArguments(HFTrainingArgs):
             "checkpoints will be written."
         },
     )
+
+    @property
+    def place_model_on_device(self):
+        return False

Original file line number	Diff line number	Diff line change
`@@ -30,3 +30,7 @@ class TrainingArguments(HFTrainingArgs):`
`30`	`30`	`"checkpoints will be written."`
`31`	`31`	`},`
`32`	`32`	`)`
	`33`	`+`
	`34`	`+ @property`
	`35`	`+ def place_model_on_device(self):`
	`36`	`+ return False`