Commit 6feef10
authored
[AMD] Extended FP conversion for gfx1250 (triton-lang#8821)
GFX1250 arch comes with new FP conversion instruction which can convert
8x FP32/FP16/Bf16 to 8x FP8. This PR extends to the AMDGPU backend with
the support of the new instructions1 parent 03a0321 commit 6feef10
File tree
2 files changed
+178
-56
lines changed- python/test/unit/language
- third_party/amd/lib/TritonAMDGPUToLLVM
2 files changed
+178
-56
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
373 | 373 | | |
374 | 374 | | |
375 | 375 | | |
| 376 | + | |
| 377 | + | |
| 378 | + | |
376 | 379 | | |
377 | 380 | | |
378 | 381 | | |
| |||
0 commit comments