Commit fe76c17
deepseek && qwen tp performance tuning (#934)
Co-authored-by: wangzaijun <wzjhelloworld@qq.com>
Co-authored-by: sufubao <47234901+sufubao@users.noreply.github.com>
Co-authored-by: sufubao <sufubao@sensetime.com>
Co-authored-by: none <none>
Co-authored-by: hiworldwzj <30762946+hiworldwzj@users.noreply.github.com>1 parent 94ce9fe commit fe76c17
File tree
76 files changed
+995
-357
lines changed- lightllm
- common
- all_kernel_configs
- deepseek_v3_rotary_emb_kernel
- grouped_moe_gemm_kernel
- moe_silu_and_mul_kernel
- basemodel
- layer_infer
- layer_weights/meta_weights
- fused_moe
- quantization
- triton_quant
- fp8
- models
- deepseek2
- layer_infer
- layer_weights
- triton_kernel
- llama/layer_infer
- qwen3_moe/layer_weights
- server
- utils
- test/kernel
- unit_tests/common/fused_moe
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
76 files changed
+995
-357
lines changedLines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
Lines changed: 1 addition & 0 deletions
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
0 commit comments