Commit bb69e9a
ONNX: Fix FP8 quantization for the second MLP in LayernormMLP
1 parent 08dc786 commit bb69e9a
1 file changed
+12
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
2243 | 2243 | | |
2244 | 2244 | | |
2245 | 2245 | | |
| 2246 | + | |
| 2247 | + | |
2246 | 2248 | | |
2247 | 2249 | | |
2248 | 2250 | | |
| 2251 | + | |
| 2252 | + | |
| 2253 | + | |
| 2254 | + | |
2249 | 2255 | | |
2250 | 2256 | | |
2251 | | - | |
2252 | | - | |
| 2257 | + | |
| 2258 | + | |
| 2259 | + | |
| 2260 | + | |
2253 | 2261 | | |
| 2262 | + | |
2254 | 2263 | | |
2255 | 2264 | | |
2256 | 2265 | | |
| |||
2324 | 2333 | | |
2325 | 2334 | | |
2326 | 2335 | | |
2327 | | - | |
| 2336 | + | |
2328 | 2337 | | |
2329 | 2338 | | |
2330 | 2339 | | |
| |||
0 commit comments