File tree
5 files changed
+58
-15
lines changed- tests
- kernels
- quantization
- vllm
- model_executor/layers/quantization
- compressed_tensors/schemes
- utils
5 files changed
+58
-15
lines changedOriginal file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
86 | 86 |
| |
87 | 87 |
| |
88 | 88 |
| |
89 |
| - | |
90 |
| - | |
91 |
| - | |
92 |
| - | |
| 89 | + | |
93 | 90 |
| |
94 | 91 |
| |
95 | 92 |
| |
| |||
119 | 116 |
| |
120 | 117 |
| |
121 | 118 |
| |
122 |
| - | |
| 119 | + | |
| 120 | + | |
123 | 121 |
| |
124 | 122 |
| |
125 | 123 |
| |
| |||
145 | 143 |
| |
146 | 144 |
| |
147 | 145 |
| |
148 |
| - | |
149 | 146 |
| |
150 | 147 |
| |
151 | 148 |
| |
152 |
| - | |
| 149 | + | |
| 150 | + | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
153 | 155 |
| |
154 | 156 |
| |
155 | 157 |
| |
| |||
184 | 186 |
| |
185 | 187 |
| |
186 | 188 |
| |
187 |
| - | |
188 |
| - | |
| 189 | + | |
189 | 190 |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
8 | 8 |
| |
9 | 9 |
| |
10 | 10 |
| |
| 11 | + | |
11 | 12 |
| |
12 | 13 |
| |
13 | 14 |
| |
| |||
74 | 75 |
| |
75 | 76 |
| |
76 | 77 |
| |
| 78 | + | |
| 79 | + | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
| 92 | + | |
| 93 | + | |
| 94 | + | |
| 95 | + | |
| 96 | + | |
| 97 | + | |
| 98 | + | |
| 99 | + | |
| 100 | + | |
| 101 | + | |
| 102 | + | |
| 103 | + | |
| 104 | + | |
| 105 | + | |
| 106 | + | |
77 | 107 |
| |
78 | 108 |
| |
79 | 109 |
| |
|
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
510 | 510 |
| |
511 | 511 |
| |
512 | 512 |
| |
| 513 | + | |
| 514 | + | |
| 515 | + | |
| 516 | + | |
| 517 | + | |
513 | 518 |
| |
514 | 519 |
| |
515 | 520 |
| |
516 | 521 |
| |
| 522 | + | |
517 | 523 |
| |
518 | 524 |
| |
519 | 525 |
| |
| |||
735 | 741 |
| |
736 | 742 |
| |
737 | 743 |
| |
738 |
| - | |
| 744 | + | |
739 | 745 |
| |
740 | 746 |
| |
741 | 747 |
| |
|
vllm/model_executor/layers/quantization/compressed_tensors/schemes/compressed_tensors_w8a8_int8.py
Lines changed: 7 additions & 4 deletions
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
82 | 82 |
| |
83 | 83 |
| |
84 | 84 |
| |
85 |
| - | |
86 |
| - | |
87 |
| - | |
| 85 | + | |
| 86 | + | |
| 87 | + | |
| 88 | + | |
| 89 | + | |
| 90 | + | |
| 91 | + | |
88 | 92 |
| |
89 | 93 |
| |
90 | 94 |
| |
| |||
138 | 142 |
| |
139 | 143 |
| |
140 | 144 |
| |
141 |
| - | |
142 | 145 |
| |
143 | 146 |
| |
144 | 147 |
| |
|
Lines changed: 4 additions & 1 deletion
Original file line number | Diff line number | Diff line change | |
---|---|---|---|
| |||
211 | 211 |
| |
212 | 212 |
| |
213 | 213 |
| |
| 214 | + | |
| 215 | + | |
| 216 | + | |
214 | 217 |
| |
215 | 218 |
| |
216 | 219 |
| |
217 | 220 |
| |
218 | 221 |
| |
219 | 222 |
| |
220 |
| - | |
| 223 | + | |
221 | 224 |
| |
222 | 225 |
| |
223 | 226 |
| |
|
0 commit comments