Skip to content

Commit f87e4fa

Browse files
talumbaucopybara-github
authored andcommitted
GPU model prefill sequence length minimum is 32
PiperOrigin-RevId: 721395570
1 parent 46aa456 commit f87e4fa

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ai_edge_torch/generative/examples/experimental/gemma/convert_gemma2_gpu_to_tflite.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@
4343
)
4444
_PREFILL_SEQ_LENS = flags.DEFINE_multi_integer(
4545
'prefill_seq_lens',
46-
(8, 64, 128, 256, 512, 1024),
46+
(32, 64, 128, 256, 512, 1024),
4747
'List of the maximum sizes of prefill input tensors.',
4848
)
4949
_KV_CACHE_MAX_LEN = flags.DEFINE_integer(

0 commit comments

Comments
 (0)