Skip to content

Commit d1b9d75

Browse files
authored
[quantization] Remove redundant args (#552)
This commit removes redundant args when exporting. TICO-DCO-1.0-Signed-off-by: seongwoo <mhs4670go@naver.com>
1 parent b610f6e commit d1b9d75

File tree

1 file changed

+4
-6
lines changed

1 file changed

+4
-6
lines changed

tico/quantization/wrapq/examples/llama/quantize_decoder_layer_decode.py

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -183,13 +183,11 @@ def make_random_decode_batch():
183183
(
184184
x_ex, # hidden_states
185185
mask_ex, # attention_mask
186-
None, # position_ids (unused)
187-
(past_k_ex, past_v_ex), # past_key_value
188-
False, # output_attentions
189-
True, # use_cache
190-
None, # cache_position (unused)
191-
(cos_ex, sin_ex), # position_embeddings
192186
),
187+
{
188+
"past_key_value": (past_k_ex, past_v_ex),
189+
"position_embeddings": (cos_ex, sin_ex),
190+
},
193191
)
194192
cm.save(save_path)
195193

0 commit comments

Comments
 (0)