Commit e38db3a
[CI] Tune nightly benchmarking job for better reliability (#17122)
This PR tunes the nightly benchmarking job to produce more consistent
results:
- Lowers the tolerance threshold of benchmarking results accepted from
50% to 8%
- Nightly was flaking before even with a 50% tolerance threshold
- Raises the iterations to 5000
- Using 10,000 iterations did not result in significantly more stable
performance, although this may change as we obtain more data
- However, the PVC benchmarking job in the overall nightly workflow now
takes about ~47 minutes, whereas before the PVC benchmarking job took
~14 minutes
- This should not have major impact on execution time however,
considering the E2E tests take ~42 minutes: Since both these jobs run in
parallel on different machines, the theoretical effect on the overall
workflow should only be about 5 minutes, although this would depend on
whether or not machines are able to be scheduled in time.
- Changes the benchmarking workflows in sycl-nightly.yml to use the
tuned PERF_PVC runner
- Untuned machines are exhibiting large variations when running
compute-benchmarks (20-25%, up to 50% in the worst case scenario): These
are unacceptable variations and not particularly useful.
- Disables nightly benchmarking on gen12:
- Gen12 machines are currently untuned. Similar to PVC machines, these
results are not accurate and not worth serious nightly benchmarking.
- Adds guards for benchmarking jobs to prevent benchmark runs in forks
#14454 (comment)
---------
Co-authored-by: Nick Sarnie <[email protected]>1 parent e0eda7e commit e38db3a
File tree
4 files changed
+32
-18
lines changed- .github/workflows
- devops
- actions/run-tests/benchmark
- benchmarking
4 files changed
+32
-18
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
126 | 126 | | |
127 | 127 | | |
128 | 128 | | |
| 129 | + | |
129 | 130 | | |
130 | 131 | | |
131 | 132 | | |
| |||
170 | 171 | | |
171 | 172 | | |
172 | 173 | | |
173 | | - | |
| 174 | + | |
| 175 | + | |
| 176 | + | |
174 | 177 | | |
175 | 178 | | |
176 | 179 | | |
177 | 180 | | |
178 | 181 | | |
179 | | - | |
180 | | - | |
181 | | - | |
182 | | - | |
183 | | - | |
184 | 182 | | |
185 | 183 | | |
186 | 184 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
247 | 247 | | |
248 | 248 | | |
249 | 249 | | |
250 | | - | |
| 250 | + | |
251 | 251 | | |
252 | 252 | | |
253 | 253 | | |
| |||
262 | 262 | | |
263 | 263 | | |
264 | 264 | | |
265 | | - | |
266 | | - | |
267 | | - | |
268 | | - | |
269 | | - | |
270 | 265 | | |
271 | | - | |
| 266 | + | |
272 | 267 | | |
273 | 268 | | |
274 | 269 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
| 55 | + | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
| 59 | + | |
| 60 | + | |
| 61 | + | |
| 62 | + | |
| 63 | + | |
| 64 | + | |
| 65 | + | |
| 66 | + | |
| 67 | + | |
| 68 | + | |
| 69 | + | |
49 | 70 | | |
50 | 71 | | |
51 | 72 | | |
| |||
69 | 90 | | |
70 | 91 | | |
71 | 92 | | |
72 | | - | |
| 93 | + | |
73 | 94 | | |
74 | 95 | | |
75 | 96 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
10 | 10 | | |
11 | 11 | | |
12 | 12 | | |
13 | | - | |
| 13 | + | |
14 | 14 | | |
15 | | - | |
| 15 | + | |
16 | 16 | | |
17 | 17 | | |
18 | 18 | | |
| |||
23 | 23 | | |
24 | 24 | | |
25 | 25 | | |
26 | | - | |
| 26 | + | |
27 | 27 | | |
28 | 28 | | |
29 | 29 | | |
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
33 | 33 | | |
34 | | - | |
| 34 | + | |
35 | 35 | | |
36 | 36 | | |
37 | 37 | | |
| |||
0 commit comments