Commit 5b35eae
authored
Add ExpertParallel Mixture-of-Experts Plugin (#99)
* initial commit
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* include prepare_scattermoe
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* fixes and add scenarios-moe. Allow gradient_accum=null mode
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* missed out on CONTENTS.yaml
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* update readme, code cleanup, add comments and initial bench
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* more cleanup and update pf bench
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* add more comments and minor refactoring
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* finish up comments
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* add padding free to granite moe
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* fmt and lint.
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* install workflow + more fmt + fix test
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* go back to dtensors for sharded checkpoints
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* add scattermoe checkpoint restorer utility
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* fmt + lint
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* more cleanup
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* improved documention on state dict inferernce
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* add more test on inferring checkpoint metadat
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* update configs for mixtral
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* update granite configs
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* fix readme and update GraniteMoE to FOAK
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
* commit benches
Signed-off-by: Yu Chin Fabian Lim <[email protected]>
---------
Signed-off-by: Yu Chin Fabian Lim <[email protected]>1 parent d767e33 commit 5b35eae
File tree
52 files changed
+4658
-12
lines changed- .github/workflows
- plugins
- accelerated-moe
- configs
- src/fms_acceleration_moe
- utils
- scattermoe_utils
- megablocks
- kernels
- tests
- framework/src/fms_acceleration
- fused-ops-and-kernels/src/fms_acceleration_foak
- models
- sample-configurations
- scripts
- benchmarks
- refs
Some content is hidden
Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
52 files changed
+4658
-12
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
15 | 15 | | |
16 | 16 | | |
17 | 17 | | |
| 18 | + | |
18 | 19 | | |
19 | 20 | | |
20 | 21 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
30 | 30 | | |
31 | 31 | | |
32 | 32 | | |
| 33 | + | |
33 | 34 | | |
34 | 35 | | |
35 | 36 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
34 | 34 | | |
35 | 35 | | |
36 | 36 | | |
37 | | - | |
| 37 | + | |
38 | 38 | | |
39 | 39 | | |
40 | 40 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
| 1 | + | |
| 2 | + | |
| 3 | + | |
| 4 | + | |
| 5 | + | |
| 6 | + | |
| 7 | + | |
| 8 | + | |
| 9 | + | |
| 10 | + | |
0 commit comments