Commit 9edcf5e
committed
refactor the SparseAttnIndexer as CustomOp
Signed-off-by: ganyi <[email protected]>1 parent 3b45a44 commit 9edcf5e
File tree
8 files changed
+622
-339
lines changed- vllm
- attention/ops
- config
- model_executor
- layers
- models
- platforms
- v1/attention/backends/mla
8 files changed
+622
-339
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
| 10 | + | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
10 | 14 | | |
11 | 15 | | |
12 | 16 | | |
| |||
1067 | 1071 | | |
1068 | 1072 | | |
1069 | 1073 | | |
| 1074 | + | |
| 1075 | + | |
| 1076 | + | |
| 1077 | + | |
| 1078 | + | |
| 1079 | + | |
| 1080 | + | |
| 1081 | + | |
1070 | 1082 | | |
1071 | 1083 | | |
1072 | 1084 | | |
| |||
0 commit comments