Commit 97de56d
committed
fix: Fix the input to the shared experts
I had misread that the shared experts take the inputs _before_ the standard
MoE layer and was feeding the output of the MoE to the shared experts.
Branch: GraniteMoEShared
Signed-off-by: Gabe Goodhart <[email protected]>1 parent 97df181 commit 97de56d
1 file changed
+5
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
4648 | 4648 | | |
4649 | 4649 | | |
4650 | 4650 | | |
4651 | | - | |
| 4651 | + | |
4652 | 4652 | | |
4653 | 4653 | | |
4654 | 4654 | | |
| |||
4659 | 4659 | | |
4660 | 4660 | | |
4661 | 4661 | | |
4662 | | - | |
| 4662 | + | |
4663 | 4663 | | |
4664 | 4664 | | |
4665 | 4665 | | |
| |||
4671 | 4671 | | |
4672 | 4672 | | |
4673 | 4673 | | |
4674 | | - | |
| 4674 | + | |
4675 | 4675 | | |
| 4676 | + | |
| 4677 | + | |
4676 | 4678 | | |
4677 | 4679 | | |
4678 | 4680 | | |
| |||
0 commit comments