Commit 1655ca0
[6.4][Backport] Backport some attention bugfixes + causal attention (#1811)
Support for causal attention and more strict checks for KV-Cache
* Fix attention bugs (swap thread and iter when Q LDS is bypassed and bf16 tests) (#1797)
* Fix some attention bugs:
- do now swap thread and iter subdims for Q if we are bypassing LDS
- use f32 attention in CPU code
- fix bug in maskKVCacheTosa for bf16
---------
Co-authored-by: Daniel Hernandez-Juarez <[email protected]>
Co-authored-by: Daniel Hernandez-Juarez <[email protected]>1 parent be36966 commit 1655ca0
File tree
11 files changed
+543
-108
lines changed- mlir
- include/mlir/Dialect/Rock/IR
- lib
- Conversion/TosaToRock
- Dialect/Rock/Transforms
- test
- Conversion/TosaToRock
- e2e
- fusion
- nightly-misc-e2e/mixr-attention/f16
- pr-e2e/attention
- tools/rocmlir-gen
11 files changed
+543
-108
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
205 | 205 | | |
206 | 206 | | |
207 | 207 | | |
208 | | - | |
209 | | - | |
210 | | - | |
211 | | - | |
212 | | - | |
213 | | - | |
214 | | - | |
215 | | - | |
216 | | - | |
217 | | - | |
218 | | - | |
219 | | - | |
220 | | - | |
221 | | - | |
222 | | - | |
223 | | - | |
224 | | - | |
225 | | - | |
226 | | - | |
227 | | - | |
228 | | - | |
| 208 | + | |
| 209 | + | |
| 210 | + | |
| 211 | + | |
| 212 | + | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
| 218 | + | |
| 219 | + | |
| 220 | + | |
| 221 | + | |
| 222 | + | |
| 223 | + | |
229 | 224 | | |
230 | 225 | | |
231 | 226 | | |
| |||
432 | 427 | | |
433 | 428 | | |
434 | 429 | | |
435 | | - | |
436 | | - | |
437 | | - | |
438 | | - | |
439 | | - | |
440 | | - | |
441 | | - | |
442 | | - | |
443 | | - | |
444 | | - | |
445 | | - | |
446 | | - | |
447 | | - | |
448 | | - | |
449 | | - | |
450 | | - | |
451 | | - | |
452 | | - | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
| 436 | + | |
| 437 | + | |
| 438 | + | |
| 439 | + | |
| 440 | + | |
| 441 | + | |
| 442 | + | |
| 443 | + | |
| 444 | + | |
| 445 | + | |
453 | 446 | | |
454 | 447 | | |
455 | 448 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
905 | 905 | | |
906 | 906 | | |
907 | 907 | | |
| 908 | + | |
| 909 | + | |
| 910 | + | |
| 911 | + | |
| 912 | + | |
| 913 | + | |
| 914 | + | |
| 915 | + | |
| 916 | + | |
| 917 | + | |
| 918 | + | |
| 919 | + | |
| 920 | + | |
| 921 | + | |
| 922 | + | |
| 923 | + | |
| 924 | + | |
908 | 925 | | |
909 | 926 | | |
910 | 927 | | |
| |||
1004 | 1021 | | |
1005 | 1022 | | |
1006 | 1023 | | |
| 1024 | + | |
| 1025 | + | |
| 1026 | + | |
| 1027 | + | |
| 1028 | + | |
| 1029 | + | |
| 1030 | + | |
| 1031 | + | |
| 1032 | + | |
| 1033 | + | |
| 1034 | + | |
| 1035 | + | |
1007 | 1036 | | |
1008 | 1037 | | |
| 1038 | + | |
| 1039 | + | |
| 1040 | + | |
| 1041 | + | |
| 1042 | + | |
| 1043 | + | |
| 1044 | + | |
| 1045 | + | |
| 1046 | + | |
| 1047 | + | |
| 1048 | + | |
| 1049 | + | |
| 1050 | + | |
| 1051 | + | |
| 1052 | + | |
1009 | 1053 | | |
1010 | 1054 | | |
1011 | 1055 | | |
| |||
1216 | 1260 | | |
1217 | 1261 | | |
1218 | 1262 | | |
1219 | | - | |
| 1263 | + | |
1220 | 1264 | | |
1221 | | - | |
1222 | 1265 | | |
1223 | 1266 | | |
1224 | 1267 | | |
| |||
1245 | 1288 | | |
1246 | 1289 | | |
1247 | 1290 | | |
1248 | | - | |
| 1291 | + | |
1249 | 1292 | | |
1250 | | - | |
1251 | | - | |
| 1293 | + | |
1252 | 1294 | | |
1253 | | - | |
1254 | 1295 | | |
1255 | 1296 | | |
1256 | 1297 | | |
| |||
Lines changed: 16 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
868 | 868 | | |
869 | 869 | | |
870 | 870 | | |
| 871 | + | |
| 872 | + | |
| 873 | + | |
871 | 874 | | |
872 | 875 | | |
873 | 876 | | |
| |||
1754 | 1757 | | |
1755 | 1758 | | |
1756 | 1759 | | |
| 1760 | + | |
| 1761 | + | |
| 1762 | + | |
| 1763 | + | |
| 1764 | + | |
| 1765 | + | |
| 1766 | + | |
| 1767 | + | |
| 1768 | + | |
| 1769 | + | |
1757 | 1770 | | |
1758 | 1771 | | |
1759 | 1772 | | |
| |||
1828 | 1841 | | |
1829 | 1842 | | |
1830 | 1843 | | |
1831 | | - | |
| 1844 | + | |
1832 | 1845 | | |
1833 | 1846 | | |
1834 | 1847 | | |
| |||
1979 | 1992 | | |
1980 | 1993 | | |
1981 | 1994 | | |
| 1995 | + | |
1982 | 1996 | | |
1983 | 1997 | | |
1984 | 1998 | | |
1985 | 1999 | | |
1986 | 2000 | | |
1987 | | - | |
| 2001 | + | |
1988 | 2002 | | |
1989 | 2003 | | |
1990 | 2004 | | |
| |||
0 commit comments