Commit ab07bfd
authored
Workaround for OOM error on
## Summary
The benchmark currently requires a lot of memory with current
configuration (69GBs), it's the heaviest of them all based on current
results from `all_benchmark_data.csv`, so I added a workaround for GPUs
with not enough memory.
## Details
Alternative implementation could be to replace all benchmark function
calls with another function that would process OOM errors, but that
would require changing all benchmarks. We would need to replace all
`triton.testing.do_bench` and with local function that would handle OOM
errors and change `_test_memory` as well. And we would probably need to
start saving some `inf` results in the csv.
## Testing Done
Tested specifically changed benchmark.
- Hardware Type: Intel GPU Max 1550
- [x] run `make test` to ensure correctness
- [x] run `make checkstyle` to ensure code style
- [x] run `make test-convergence` to ensure convergencebenchmark_jsd (#1037)1 parent 7b51e56 commit ab07bfd
2 files changed
+23
-1
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
9 | 9 | | |
10 | 10 | | |
11 | 11 | | |
| 12 | + | |
12 | 13 | | |
13 | 14 | | |
14 | 15 | | |
| |||
123 | 124 | | |
124 | 125 | | |
125 | 126 | | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
126 | 135 | | |
127 | 136 | | |
128 | 137 | | |
129 | 138 | | |
130 | | - | |
| 139 | + | |
131 | 140 | | |
132 | 141 | | |
133 | 142 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
121 | 121 | | |
122 | 122 | | |
123 | 123 | | |
| 124 | + | |
| 125 | + | |
| 126 | + | |
| 127 | + | |
| 128 | + | |
| 129 | + | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
| 133 | + | |
| 134 | + | |
| 135 | + | |
| 136 | + | |
0 commit comments