Commit 1f4d184
Temporarily allow bfloat16 dot algorithms on CPU.
Since XLA:CPU doesn't (yet!) support explicit algorithms for controlling the precision of dot products we have a check in JAX that fails when a non-trivial algorithm is specified on CPU. In order to support downstream use cases, this change allows some bfloat16 algorithms to pass through. XLA:CPU "emulates" these algorithms using `F32_F32_F32` with the appropriate casting, so that means that CPU numerics will be different than on other platforms with explicit algorithm support, but it is useful to be able to use these algorithms with the correct input and output casting without requiring platform dependent logic in user code.
PiperOrigin-RevId: 7038348891 parent 861115a commit 1f4d184
2 files changed
+5
-0
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
3773 | 3773 | | |
3774 | 3774 | | |
3775 | 3775 | | |
| 3776 | + | |
| 3777 | + | |
3776 | 3778 | | |
3777 | 3779 | | |
3778 | 3780 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
1082 | 1082 | | |
1083 | 1083 | | |
1084 | 1084 | | |
| 1085 | + | |
| 1086 | + | |
| 1087 | + | |
1085 | 1088 | | |
1086 | 1089 | | |
1087 | 1090 | | |
| |||
0 commit comments