Skip to content

Commit 80a2639

Browse files
committed
fix kv cache tests
Signed-off-by: Kyle Sayers <kylesayrs@gmail.com>
1 parent 80d22a1 commit 80a2639

File tree

3 files changed

+9
-11
lines changed

3 files changed

+9
-11
lines changed

.github/workflows/test-check-transformers.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -98,4 +98,4 @@ jobs:
9898
- name: Running KV Cache Tests
9999
if: (success() || failure()) && steps.install.outcome == 'success'
100100
run: |
101-
pytest -v tests/llmcompressor/transformers/kv_cache -k "not test_kv_cache_gptq_model_state_dict_attr"
101+
pytest -v tests/llmcompressor/transformers/kv_cache

tests/e2e/vLLM/recipes/kv_cache/gptq.yaml

Lines changed: 2 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,8 +1,5 @@
11
quant_stage:
22
quant_modifiers:
3-
QuantizationModifier:
4-
kv_cache_scheme:
5-
{num_bits: 8, type: float, symmetric: true, strategy: tensor}
63
GPTQModifier:
74
sequential_update: false
85
ignore: ["lm_head"]
@@ -15,3 +12,5 @@ quant_stage:
1512
strategy: "channel"
1613
actorder: False
1714
targets: ["Linear"]
15+
kv_cache_scheme:
16+
{num_bits: 8, type: float, symmetric: true, strategy: tensor}

tests/llmcompressor/transformers/kv_cache/test_kv_cache.py

Lines changed: 6 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -214,13 +214,6 @@ def test_kv_cache_gptq_model_state_dict_attr(kv_cache_fixture, tmp_path):
214214
recipe = """
215215
quant_stage:
216216
quant_modifiers:
217-
QuantizationModifier:
218-
kv_cache_scheme:
219-
num_bits: {num_bits}
220-
type: {_type}
221-
strategy: {strategy}
222-
dynamic: {dynamic}
223-
symmetric: {symmetric}
224217
GPTQModifier:
225218
ignore: ["lm_head"]
226219
config_groups:
@@ -232,6 +225,12 @@ def test_kv_cache_gptq_model_state_dict_attr(kv_cache_fixture, tmp_path):
232225
strategy: "channel"
233226
actorder: False
234227
targets: ["Linear"]
228+
kv_cache_scheme:
229+
num_bits: {num_bits}
230+
type: {_type}
231+
strategy: {strategy}
232+
dynamic: {dynamic}
233+
symmetric: {symmetric}
235234
"""
236235

237236
output_dir, _ = next(kv_cache_fixture(recipe, tmp_path))

0 commit comments

Comments
 (0)