Skip to content

Commit 43b22de

Browse files
saiendurisaienduri
andauthored
Add punet benchmarking to the regression suite (iree-org#19088)
This commit adds support to benchmark punet fp16/fp8 performance TOM. This concludes adding all the necessary testing for the SDXL model. It also switches the compilation of punet to use the spec file as it is necessary for tuning at the current state of the project. I've also updated the artifacts in azure, and this time using the date as part of the azure link so everyone knows the time the artifacts were generated. Nithin is working on implementing the spec file optimizations as part of the compiler itself, so we can remove the usage of such files in the future. All tests have timeouts now too and updated existing ones because the CLI flag seems to be per test timeouts (not the whole pytest command itself). Now, we can avoid hangs such as https://github.com/iree-org/iree/actions/runs/11748746984/job/32734141414 --------- Signed-off-by: saienduri <[email protected]> Signed-off-by: saienduri <[email protected]> Co-authored-by: saienduri <[email protected]>
1 parent cb5d1ab commit 43b22de

File tree

5 files changed

+593
-10
lines changed

5 files changed

+593
-10
lines changed

.github/workflows/pkgci_regression_test.yml

Lines changed: 11 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -112,7 +112,7 @@ jobs:
112112
--no-skip-tests-missing-files \
113113
--capture=no \
114114
--log-cli-level=info \
115-
--timeout=1200 \
115+
--timeout=240 \
116116
--durations=0 \
117117
--config-files=${MODELS_CONFIG_FILE_PATH}
118118
@@ -189,7 +189,7 @@ jobs:
189189
-rpfE \
190190
--capture=no \
191191
--log-cli-level=info \
192-
--timeout=1200 \
192+
--timeout=240 \
193193
--durations=0
194194
env:
195195
ROCM_CHIP: ${{ matrix.rocm-chip }}
@@ -203,7 +203,7 @@ jobs:
203203
-rpfE \
204204
--capture=no \
205205
--log-cli-level=info \
206-
--timeout=1200 \
206+
--timeout=240 \
207207
--durations=0
208208
env:
209209
ROCM_CHIP: ${{ matrix.rocm-chip }}
@@ -227,6 +227,7 @@ jobs:
227227
--goldensize-rocm-clip-bytes 860000 \
228228
--goldensize-rocm-vae-bytes 840000 \
229229
--rocm-chip gfx90a \
230+
--timeout=240 \
230231
--log-cli-level=info \
231232
--retries 7
232233
echo "$(<job_summary.md )" >> $GITHUB_STEP_SUMMARY
@@ -247,7 +248,14 @@ jobs:
247248
--goldensize-rocm-unet-bytes 2270000 \
248249
--goldensize-rocm-clip-bytes 860000 \
249250
--goldensize-rocm-vae-bytes 840000 \
251+
--goldentime-rocm-punet-int8-fp16-ms 55 \
252+
--goldendispatch-rocm-punet-int8-fp16 1284 \
253+
--goldensize-rocm-punet-int8-fp16-bytes 2560000 \
254+
--goldentime-rocm-punet-int8-fp8-ms 59 \
255+
--goldendispatch-rocm-punet-int8-fp8 1564 \
256+
--goldensize-rocm-punet-int8-fp8-bytes 2800000 \
250257
--rocm-chip gfx942 \
251258
--log-cli-level=info \
259+
--timeout=240 \
252260
--retries 7
253261
echo "$(<job_summary.md )" >> $GITHUB_STEP_SUMMARY

0 commit comments

Comments
 (0)