Commit 674fec3
[main] feat(moe): Support gated delta net for Qwen3-Next (1/4) (NVIDIA#1989)
Signed-off-by: oliver könig <[email protected]>
Co-authored-by: oliver könig <[email protected]>1 parent 71bb0fd commit 674fec3
File tree
21 files changed
+2463
-54
lines changed- megatron
- core
- models/gpt
- ssm
- transformer
- training
- tests
- functional_tests
- shell_test_utils
- test_cases/gpt/gpt3_mcore_te_tp2_pp1_gdn
- test_utils/recipes
- unit_tests
- models
- post_training
- ssm
21 files changed
+2463
-54
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
8 | 8 | | |
9 | 9 | | |
10 | 10 | | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
11 | 14 | | |
12 | 15 | | |
13 | 16 | | |
| |||
42 | 45 | | |
43 | 46 | | |
44 | 47 | | |
45 | | - | |
| 48 | + | |
| 49 | + | |
| 50 | + | |
| 51 | + | |
| 52 | + | |
| 53 | + | |
| 54 | + | |
46 | 55 | | |
47 | 56 | | |
48 | 57 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
7 | 7 | | |
8 | 8 | | |
9 | 9 | | |
10 | | - | |
11 | | - | |
12 | | - | |
13 | | - | |
14 | 10 | | |
15 | | - | |
16 | | - | |
| 11 | + | |
| 12 | + | |
| 13 | + | |
17 | 14 | | |
| 15 | + | |
| 16 | + | |
| 17 | + | |
| 18 | + | |
| 19 | + | |
| 20 | + | |
| 21 | + | |
| 22 | + | |
| 23 | + | |
| 24 | + | |
| 25 | + | |
| 26 | + | |
| 27 | + | |
| 28 | + | |
| 29 | + | |
18 | 30 | | |
| 31 | + | |
| 32 | + | |
| 33 | + | |
0 commit comments