Commit 6fe89ea
authored
[TRTLLM-9819][perf] Reuse alltoall workspace for CuteDSL MoE output (#9840)
Signed-off-by: Enwei Zhu <[email protected]>1 parent 0b279f4 commit 6fe89ea
File tree
7 files changed
+98
-54
lines changed- cpp/tensorrt_llm/thop
- tensorrt_llm/_torch
- compilation
- modules/fused_moe
7 files changed
+98
-54
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
205 | 205 | | |
206 | 206 | | |
207 | 207 | | |
208 | | - | |
209 | | - | |
| 208 | + | |
| 209 | + | |
210 | 210 | | |
211 | 211 | | |
212 | 212 | | |
213 | 213 | | |
| 214 | + | |
| 215 | + | |
| 216 | + | |
| 217 | + | |
214 | 218 | | |
215 | | - | |
| 219 | + | |
| 220 | + | |
216 | 221 | | |
217 | 222 | | |
218 | 223 | | |
219 | 224 | | |
220 | | - | |
221 | 225 | | |
222 | 226 | | |
223 | 227 | | |
224 | | - | |
225 | | - | |
226 | 228 | | |
227 | 229 | | |
228 | 230 | | |
| |||
253 | 255 | | |
254 | 256 | | |
255 | 257 | | |
| 258 | + | |
256 | 259 | | |
| 260 | + | |
| 261 | + | |
| 262 | + | |
| 263 | + | |
| 264 | + | |
| 265 | + | |
| 266 | + | |
| 267 | + | |
| 268 | + | |
| 269 | + | |
| 270 | + | |
257 | 271 | | |
258 | 272 | | |
259 | 273 | | |
| |||
489 | 503 | | |
490 | 504 | | |
491 | 505 | | |
| 506 | + | |
| 507 | + | |
| 508 | + | |
492 | 509 | | |
493 | 510 | | |
494 | 511 | | |
| |||
510 | 527 | | |
511 | 528 | | |
512 | 529 | | |
| 530 | + | |
513 | 531 | | |
514 | 532 | | |
515 | 533 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
80 | 83 | | |
81 | 84 | | |
82 | 85 | | |
| |||
Lines changed: 12 additions & 14 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
991 | 991 | | |
992 | 992 | | |
993 | 993 | | |
| 994 | + | |
| 995 | + | |
| 996 | + | |
| 997 | + | |
| 998 | + | |
994 | 999 | | |
995 | 1000 | | |
996 | | - | |
997 | | - | |
998 | 1001 | | |
999 | 1002 | | |
1000 | 1003 | | |
1001 | 1004 | | |
1002 | | - | |
1003 | | - | |
1004 | | - | |
1005 | | - | |
1006 | | - | |
1007 | | - | |
1008 | | - | |
1009 | | - | |
1010 | | - | |
1011 | 1005 | | |
1012 | 1006 | | |
1013 | 1007 | | |
| |||
1022 | 1016 | | |
1023 | 1017 | | |
1024 | 1018 | | |
1025 | | - | |
1026 | 1019 | | |
1027 | 1020 | | |
1028 | 1021 | | |
| |||
1096 | 1089 | | |
1097 | 1090 | | |
1098 | 1091 | | |
1099 | | - | |
| 1092 | + | |
1100 | 1093 | | |
1101 | 1094 | | |
1102 | 1095 | | |
1103 | 1096 | | |
1104 | 1097 | | |
1105 | 1098 | | |
| 1099 | + | |
| 1100 | + | |
| 1101 | + | |
| 1102 | + | |
| 1103 | + | |
1106 | 1104 | | |
1107 | 1105 | | |
1108 | 1106 | | |
| |||
1123 | 1121 | | |
1124 | 1122 | | |
1125 | 1123 | | |
1126 | | - | |
| 1124 | + | |
1127 | 1125 | | |
1128 | 1126 | | |
1129 | 1127 | | |
| |||
Lines changed: 48 additions & 34 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
210 | 210 | | |
211 | 211 | | |
212 | 212 | | |
| 213 | + | |
| 214 | + | |
| 215 | + | |
213 | 216 | | |
214 | 217 | | |
215 | 218 | | |
| |||
258 | 261 | | |
259 | 262 | | |
260 | 263 | | |
| 264 | + | |
261 | 265 | | |
262 | 266 | | |
263 | 267 | | |
| |||
274 | 278 | | |
275 | 279 | | |
276 | 280 | | |
| 281 | + | |
| 282 | + | |
| 283 | + | |
| 284 | + | |
| 285 | + | |
| 286 | + | |
| 287 | + | |
| 288 | + | |
| 289 | + | |
| 290 | + | |
277 | 291 | | |
278 | 292 | | |
279 | 293 | | |
| |||
291 | 305 | | |
292 | 306 | | |
293 | 307 | | |
| 308 | + | |
294 | 309 | | |
295 | | - | |
296 | | - | |
297 | | - | |
298 | 310 | | |
299 | | - | |
| 311 | + | |
300 | 312 | | |
301 | 313 | | |
302 | 314 | | |
| |||
313 | 325 | | |
314 | 326 | | |
315 | 327 | | |
316 | | - | |
| 328 | + | |
317 | 329 | | |
318 | 330 | | |
319 | 331 | | |
| |||
326 | 338 | | |
327 | 339 | | |
328 | 340 | | |
329 | | - | |
330 | 341 | | |
331 | 342 | | |
332 | 343 | | |
| |||
344 | 355 | | |
345 | 356 | | |
346 | 357 | | |
347 | | - | |
| 358 | + | |
348 | 359 | | |
| 360 | + | |
349 | 361 | | |
350 | 362 | | |
351 | 363 | | |
352 | | - | |
| 364 | + | |
353 | 365 | | |
354 | 366 | | |
355 | 367 | | |
| |||
364 | 376 | | |
365 | 377 | | |
366 | 378 | | |
367 | | - | |
368 | | - | |
369 | | - | |
370 | | - | |
371 | | - | |
372 | | - | |
| 379 | + | |
| 380 | + | |
| 381 | + | |
| 382 | + | |
| 383 | + | |
| 384 | + | |
373 | 385 | | |
374 | 386 | | |
375 | 387 | | |
| |||
388 | 400 | | |
389 | 401 | | |
390 | 402 | | |
391 | | - | |
392 | | - | |
393 | | - | |
394 | | - | |
| 403 | + | |
| 404 | + | |
| 405 | + | |
395 | 406 | | |
396 | | - | |
| 407 | + | |
397 | 408 | | |
398 | | - | |
| 409 | + | |
399 | 410 | | |
400 | | - | |
401 | | - | |
402 | | - | |
403 | | - | |
| 411 | + | |
| 412 | + | |
| 413 | + | |
| 414 | + | |
404 | 415 | | |
405 | | - | |
| 416 | + | |
406 | 417 | | |
407 | | - | |
| 418 | + | |
408 | 419 | | |
409 | | - | |
410 | | - | |
| 420 | + | |
| 421 | + | |
411 | 422 | | |
412 | 423 | | |
413 | | - | |
414 | | - | |
| 424 | + | |
| 425 | + | |
415 | 426 | | |
416 | | - | |
| 427 | + | |
417 | 428 | | |
418 | | - | |
419 | | - | |
| 429 | + | |
| 430 | + | |
420 | 431 | | |
421 | 432 | | |
422 | 433 | | |
| |||
425 | 436 | | |
426 | 437 | | |
427 | 438 | | |
428 | | - | |
| 439 | + | |
429 | 440 | | |
430 | 441 | | |
431 | 442 | | |
432 | 443 | | |
433 | 444 | | |
434 | 445 | | |
435 | 446 | | |
| 447 | + | |
436 | 448 | | |
437 | 449 | | |
438 | 450 | | |
| |||
448 | 460 | | |
449 | 461 | | |
450 | 462 | | |
| 463 | + | |
451 | 464 | | |
452 | 465 | | |
453 | 466 | | |
| |||
459 | 472 | | |
460 | 473 | | |
461 | 474 | | |
| 475 | + | |
462 | 476 | | |
463 | 477 | | |
464 | 478 | | |
| |||
Lines changed: 3 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
389 | 389 | | |
390 | 390 | | |
391 | 391 | | |
| 392 | + | |
| 393 | + | |
| 394 | + | |
392 | 395 | | |
393 | 396 | | |
394 | 397 | | |
| |||
Lines changed: 3 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
354 | 354 | | |
355 | 355 | | |
356 | 356 | | |
| 357 | + | |
| 358 | + | |
| 359 | + | |
357 | 360 | | |
358 | 361 | | |
359 | 362 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
723 | 723 | | |
724 | 724 | | |
725 | 725 | | |
| 726 | + | |
| 727 | + | |
| 728 | + | |
| 729 | + | |
| 730 | + | |
726 | 731 | | |
727 | 732 | | |
728 | 733 | | |
| |||
0 commit comments