You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
[TUTORIAL] Adjust rand number range for matmul tutorial (#7505)
This PR fixes the "Unit Test" in `03-matrix-multiplication.py` on MI300x
and MI350x GPUs. `torch.randn((512, 512), device=DEVICE,
dtype=torch.float16)` can generate relatively large absolute number in
the input which may lead to larger sums having a larger absolute
roundoff error as the exponent grows. The unit test passes on AMD MI250
and Nvidia H100 as well.
# CDNA2 devices use reduced precision fp16 and bf16 and flush input and
375
-
# output denormal values to zero. Detailed info is at: https://pytorch.org/docs/stable/notes/numerical_accuracy.html#reduced-precision-fp16-and-bf16-gemms-and-convolutions-on-amd-instinct-mi200-devices
0 commit comments