Skip to content

Commit 5a5427f

Browse files
kaiyuxqiaoxj07dongxuy04syuonijuney-nvidia
authored
blog: Scaling Expert Parallelism in TensorRT-LLM (Part 1: Design and Implementation of Large-scale EP) (NVIDIA#4958)
Signed-off-by: juney-nvidia <[email protected]> Co-authored-by: Xianjie <[email protected]> Co-authored-by: Dongxu Yang <[email protected]> Co-authored-by: Enwei Zhu <[email protected]> Co-authored-by: Jun Yang <[email protected]>
1 parent 180b91f commit 5a5427f

29 files changed

+772
-51
lines changed
1.68 MB
Loading
2.85 MB
Loading
2.92 MB
Loading
296 KB
Loading
219 KB
Loading
201 KB
Loading
87.1 KB
Loading
121 KB
Loading
156 KB
Loading
119 KB
Loading

0 commit comments

Comments
 (0)