fix variable names to align with the new transformer version (#1792)

shanjiaz · web-flow · commit cf9ccef7098e · 2025-09-02T10:49:54.000-04:00
SUMMARY:
Update past_key_value to past_key_values. 


TEST PLAN:
This should resolve the first three kv cache failures. I'm looking into
the last one related to meta device.

Signed-off-by: shanjiaz &lt;zsjwpianpian@gmail.com&gt;
diff --git a/src/llmcompressor/modifiers/quantization/calibration.py b/src/llmcompressor/modifiers/quantization/calibration.py
@@ -247,7 +247,7 @@ def calibrate_kv_cache_input_hook(
     kv_cache to singleton QuantizedKVParameterCache.
     """
     kv_cache = getattr(module, "kv_cache")
-    kwargs["past_key_value"] = kv_cache
+    kwargs["past_key_values"] = kv_cache
     kwargs["use_cache"] = False
     return args, kwargs