-
-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Fix: try aligning dtype of matrixes when training with deepspeed and mixed-precision is set to bf16 or fp16 #2060
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 6 commits
Commits
Show all changes
10 commits
Select commit
Hold shift + click to select a range
7c61c0d
Add autocast warpper for forward functions in deepspeed_utils.py to t…
sharlynxy d33d5ec
#
sharlynxy 7f984f4
#
sharlynxy c8af252
refactor
f501209
Merge branch 'dev/xy/align_dtype_using_mixed_precision' of github.com…
0d9da0e
Merge pull request #1 from saibit-tech/dev/xy/align_dtype_using_mixed…
sharlynxy adb775c
Update: requirement diffusers[torch]==0.25.0
sharlynxy abf2c44
Dynamically set device in deepspeed wrapper (#2)
sharlynxy 46ad3be
update deepspeed wrapper
sharlynxy 1684aba
remove deepspeed from requirements.txt
sharlynxy File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.