Skip to content

Commit c39572a

Browse files
jtuylspstarkcdpr
authored andcommitted
[CI][TorchModels] Update llama 8b fp16 golden time (iree-org#22426)
The latency of the prefill phase of llama 8b f16 with sequence length 128 should have improved with this PR: iree-org#22393, so bumping the golden time here. Signed-off-by: Jorn Tuyls <[email protected]>
1 parent ae3044a commit c39572a

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

tests/external/iree-test-suites/torch_models/llama_8b_fp16/prefill_benchmark_seq128_mi325.json

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -36,5 +36,5 @@
3636
"value": "34x2097152xf16"
3737
}
3838
],
39-
"golden_time_ms": 42.0
39+
"golden_time_ms": 29.5
4040
}

0 commit comments

Comments
 (0)