-
Notifications
You must be signed in to change notification settings - Fork 3.5k
[Dev] fix(moe): Support HybridEP and reduce memory overhead for 1F1B A2A overlap #2201
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
47 commits
Select commit
Hold shift + click to select a range
12afb8b
release unused memory
lhb8125 ab40d7b
format
lhb8125 0e641d3
Merge branch 'main_tot' into hongbinl/1f1b_overlap_memory_issue
lhb8125 1219a26
renaming golden values
lhb8125 ce6e661
fix bug: accuracy issu because of recomputing and offloading same module
lhb8125 d04d741
Merge branch 'dev' into hongbinl/activation_offloading_fix
lhb8125 2fe4aeb
format
lhb8125 fb3f7c3
update golden values
lhb8125 5001e2b
Merge branch 'dev' into hongbinl/activation_offloading_fix
lhb8125 9937890
update golden values
lhb8125 6c83118
update model_config and golden values
lhb8125 33a38f5
format
lhb8125 6c76b07
update golden values
lhb8125 e8c0eb0
support hybridep+a2a overlap
lhb8125 b207de3
Merge branch 'dev' into hongbinl/1f1b_hybridep
465f497
Merge branch 'hongbinl/1f1b_overlap_memory_issue' into hongbinl/1f1b_…
lhb8125 299df02
minor fix
lhb8125 dc0cb6c
assert PP>1 for a2a overlap with MTP layers
lhb8125 32fc988
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 6102cc5
Merge branch 'dev' into hongbinl/1f1b_hybridep
yanring d29b634
Merge branch 'dev' into hongbinl/1f1b_hybridep
yanring 5518940
revert the changes about memory overhead optimization
lhb8125 9dca28b
Merge branch 'hongbinl/1f1b_hybridep' of https://github.com/lhb8125/M…
lhb8125 e0e6da1
minor fix
lhb8125 04199ce
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 448f035
fix back compatibility
lhb8125 e9e662b
Merge branch 'hongbinl/1f1b_hybridep' of https://github.com/lhb8125/M…
lhb8125 fe05568
format
lhb8125 b83deee
support early attn mem replease
Wohox f441acf
fix mem opt ut
Wohox 8c2e9ad
format
Wohox f1c886c
Merge pull request #48 from Wohox/pingtian/add_ep_overlap_switch_orde…
lhb8125 bc209bb
fix bugs when enabling hybridep
lhb8125 060f53d
format
lhb8125 776d224
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 487eea9
remove unused try-except clause
lhb8125 c568c37
format
lhb8125 36648e3
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 33d4d9c
fix comments
Wohox 3ee932b
more explanation
Wohox 61a75d2
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 0708cc1
replace __del__ with explicit destructor
lhb8125 2cfaec1
format
lhb8125 0e299f6
Merge branch 'dev' into hongbinl/1f1b_hybridep
lhb8125 0f8663b
fix ut
lhb8125 97de523
Merge pull request #50 from Wohox/pingtian/fix_comments_2201
lhb8125 12a2a22
fix ut
lhb8125 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you add some explanation here?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Fixed in lhb8125#50, @lhb8125 can you help take a look~
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Merged, thanks!