Commit 2550f14
llama3.2 + cross attn test (#220)
* llama3.2 + cross attn test
* lint issues fix
* mypy errors
* making yapf happy
* cut off WA for tunned gemms
* try and catch for non continuous tensor
---------
Co-authored-by: Aleksandr Malyshev <[email protected]>1 parent 4075b35 commit 2550f14
File tree
5 files changed
+280
-94
lines changed- tests/kernels
- vllm
- attention/backends
- model_executor/layers
- worker
5 files changed
+280
-94
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
21 | 21 | | |
22 | 22 | | |
23 | 23 | | |
24 | | - | |
| 24 | + | |
| 25 | + | |
25 | 26 | | |
26 | 27 | | |
27 | 28 | | |
| |||
807 | 808 | | |
808 | 809 | | |
809 | 810 | | |
810 | | - | |
811 | 811 | | |
812 | 812 | | |
813 | 813 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
15 | | - | |
16 | | - | |
| 15 | + | |
| 16 | + | |
17 | 17 | | |
18 | 18 | | |
19 | 19 | | |
| |||
524 | 524 | | |
525 | 525 | | |
526 | 526 | | |
527 | | - | |
528 | 527 | | |
| 528 | + | |
| 529 | + | |
| 530 | + | |
| 531 | + | |
| 532 | + | |
| 533 | + | |
529 | 534 | | |
530 | 535 | | |
531 | 536 | | |
| |||
0 commit comments