Commit 8614cd3
[None][fix] fix: resolve GPU memory imbalance in concurrent weight loading (#6472)
Signed-off-by: Necofish <[email protected]>
Signed-off-by: Nekofish-L <[email protected]>
Signed-off-by: Jie Li <[email protected]>
Co-authored-by: Jie Li <[email protected]>1 parent e2891a6 commit 8614cd3
File tree
3 files changed
+22
-3
lines changed- examples/llm-api
- tensorrt_llm
- _torch/models
3 files changed
+22
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| 80 | + | |
| 81 | + | |
| 82 | + | |
| 83 | + | |
| 84 | + | |
80 | 85 | | |
81 | 86 | | |
82 | 87 | | |
| |||
288 | 293 | | |
289 | 294 | | |
290 | 295 | | |
| 296 | + | |
291 | 297 | | |
292 | 298 | | |
293 | 299 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
12 | 12 | | |
13 | 13 | | |
14 | 14 | | |
| 15 | + | |
15 | 16 | | |
16 | 17 | | |
17 | 18 | | |
| |||
852 | 853 | | |
853 | 854 | | |
854 | 855 | | |
| 856 | + | |
855 | 857 | | |
856 | 858 | | |
| 859 | + | |
857 | 860 | | |
858 | 861 | | |
859 | 862 | | |
| |||
931 | 934 | | |
932 | 935 | | |
933 | 936 | | |
934 | | - | |
| 937 | + | |
935 | 938 | | |
936 | 939 | | |
937 | 940 | | |
| |||
977 | 980 | | |
978 | 981 | | |
979 | 982 | | |
| 983 | + | |
980 | 984 | | |
981 | 985 | | |
| 986 | + | |
982 | 987 | | |
983 | 988 | | |
984 | 989 | | |
| |||
1034 | 1039 | | |
1035 | 1040 | | |
1036 | 1041 | | |
1037 | | - | |
| 1042 | + | |
1038 | 1043 | | |
1039 | 1044 | | |
1040 | 1045 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
561 | 561 | | |
562 | 562 | | |
563 | 563 | | |
564 | | - | |
| 564 | + | |
| 565 | + | |
| 566 | + | |
| 567 | + | |
| 568 | + | |
| 569 | + | |
| 570 | + | |
| 571 | + | |
| 572 | + | |
565 | 573 | | |
566 | 574 | | |
567 | 575 | | |
| |||
0 commit comments