Commit 8bbfd21
authored
[PROTON] Implement max bps method for XPU (#5617)
This PR implements theoretical memory bandwidth calculation for XPU GPUs
in proton.
Remarks:
- Derived formula was computed and compared against the published xpu
bandwidths.
- The multipliers in `arch_to_mem_type_multiplier` are related to memory
types that the architectures implement (gddr6, hbm2e)
- Only 3 arch mappings are included. To my knowledge the rest of intel
gpus are integrated, thus bandwidth is system dependent. Perhaps some
exception catching should be implemented, however that is not the case
for other branches.
- The result is in mega bytes, aligned with cuda, but the docstring as
well as hip case point to the fact that the returned value should be in
bytes. I think this should be aligned upstream.1 parent 98da880 commit 8bbfd21
1 file changed
+7
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
18 | 18 | | |
19 | 19 | | |
20 | 20 | | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
21 | 27 | | |
22 | 28 | | |
23 | 29 | | |
| |||
68 | 74 | | |
69 | 75 | | |
70 | 76 | | |
71 | | - | |
72 | | - | |
73 | | - | |
| 77 | + | |
0 commit comments