Commit d0bb140
[NPU] bugfix for model Qwen3-Coder-Next at weight shape transpose for npu. (sgl-project#18700)
Co-authored-by: McZyWu <zhuoyun.wu.23@ucl.ac.uk>1 parent a1b39c1 commit d0bb140
File tree
2 files changed
+3
-3
lines changed- python/sglang/srt
- hardware_backend/npu/quantization
- layers/attention
2 files changed
+3
-3
lines changedLines changed: 2 additions & 2 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
118 | 118 | | |
119 | 119 | | |
120 | 120 | | |
121 | | - | |
| 121 | + | |
122 | 122 | | |
123 | 123 | | |
124 | 124 | | |
| |||
129 | 129 | | |
130 | 130 | | |
131 | 131 | | |
132 | | - | |
| 132 | + | |
133 | 133 | | |
134 | 134 | | |
135 | 135 | | |
| |||
Lines changed: 1 addition & 1 deletion
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
46 | 46 | | |
47 | 47 | | |
48 | 48 | | |
49 | | - | |
| 49 | + | |
50 | 50 | | |
51 | 51 | | |
52 | 52 | | |
| |||
0 commit comments