Commit 7eee9a9
authored
doc: Update doc for Deepseek min latency (NVIDIA#3717)
* Tidy code
Signed-off-by: Zongfei Jing <[email protected]>
* Update doc for min latency deepseek
Signed-off-by: Zongfei Jing <[email protected]>
* Throw exception for RouterKernel when not running on sm90+
Signed-off-by: Zongfei Jing <[email protected]>
---------
Signed-off-by: Zongfei Jing <[email protected]>1 parent 0ae7017 commit 7eee9a9
File tree
3 files changed
+20
-10
lines changed- cpp/tensorrt_llm/kernels/trtllmGenKernels/blockScaleMoe/trtllmGenSrc
- docs/source/blogs
- tensorrt_llm/_torch/models
3 files changed
+20
-10
lines changedLines changed: 7 additions & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
698 | 698 | | |
699 | 699 | | |
700 | 700 | | |
| 701 | + | |
701 | 702 | | |
702 | 703 | | |
703 | 704 | | |
| |||
886 | 887 | | |
887 | 888 | | |
888 | 889 | | |
| 890 | + | |
| 891 | + | |
889 | 892 | | |
890 | 893 | | |
891 | 894 | | |
| |||
973 | 976 | | |
974 | 977 | | |
975 | 978 | | |
| 979 | + | |
| 980 | + | |
976 | 981 | | |
977 | 982 | | |
978 | 983 | | |
| |||
1204 | 1209 | | |
1205 | 1210 | | |
1206 | 1211 | | |
| 1212 | + | |
| 1213 | + | |
1207 | 1214 | | |
1208 | 1215 | | |
1209 | 1216 | | |
| |||
Lines changed: 9 additions & 7 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
110 | 110 | | |
111 | 111 | | |
112 | 112 | | |
| 113 | + | |
113 | 114 | | |
114 | 115 | | |
115 | 116 | | |
| |||
125 | 126 | | |
126 | 127 | | |
127 | 128 | | |
128 | | - | |
| 129 | + | |
129 | 130 | | |
130 | 131 | | |
131 | 132 | | |
| |||
147 | 148 | | |
148 | 149 | | |
149 | 150 | | |
150 | | - | |
151 | | - | |
152 | | - | |
153 | | - | |
154 | | - | |
155 | | - | |
| 151 | + | |
| 152 | + | |
| 153 | + | |
| 154 | + | |
| 155 | + | |
| 156 | + | |
| 157 | + | |
156 | 158 | | |
157 | 159 | | |
158 | 160 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
624 | 624 | | |
625 | 625 | | |
626 | 626 | | |
| 627 | + | |
| 628 | + | |
| 629 | + | |
627 | 630 | | |
628 | 631 | | |
629 | 632 | | |
| |||
650 | 653 | | |
651 | 654 | | |
652 | 655 | | |
653 | | - | |
654 | | - | |
655 | | - | |
| 656 | + | |
656 | 657 | | |
657 | 658 | | |
658 | 659 | | |
| |||
0 commit comments