-
Notifications
You must be signed in to change notification settings - Fork 243
[update] Updated RoPE Configuration for HF Models (transformers) w. backward-compatible support for vLLM #690 #703
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Closed
devpatelio
wants to merge
39
commits into
NovaSky-AI:main
from
devpatelio:devpatel/skyrl-rope-support
Closed
Changes from 34 commits
Commits
Show all changes
39 commits
Select commit
Hold shift + click to select a range
e7d9d44
done
devpatelio 38312c2
rename params to parameters for rope for parity
devpatelio f4ec82d
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio 45034fb
support old param use
devpatelio ac6d2fc
doc updates and remove excess comments
devpatelio 5dc16a0
remove comments
devpatelio 8954231
remove comments
devpatelio a7105cf
Apply suggestions from code review
devpatelio 3f6512a
done
devpatelio 4c39a7e
some changes
devpatelio 270e0f7
merge changes
devpatelio 8c1dd19
fixes for rope config
devpatelio 8623973
Pass generator, not trainer rope configuration (they're the same by d…
devpatelio 9f8b08b
better user logging for clear rope behaviour
devpatelio 3c5884e
linter
devpatelio 210609d
update gitignore
devpatelio ee0259e
Apply suggestions from code review
SumanthRH eb1c0f2
Update skyrl-train/skyrl_train/entrypoints/main_base.py
devpatelio 536e5ef
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio fd18186
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio 32250c6
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio 22bed1c
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio 62228b9
add test to model wrapper
devpatelio eaedbf6
linter
devpatelio 15ab029
goarugh
devpatelio cbd75bf
done
devpatelio a34c01d
return empty dict
devpatelio 0b8a1b9
done
devpatelio b62f97a
Merge branch 'main' into devpatel/skyrl-rope-support
devpatelio 63aeb06
some changes
devpatelio 82dd877
rm stepweise training
devpatelio bb9774e
revert gsm8k
devpatelio a3963a3
revert gsm8k
devpatelio 52a8959
piped rope config to critic model calls
devpatelio 7ad5745
Update skyrl-train/skyrl_train/utils/trainer_utils.py
devpatelio 585c1fd
change base config
devpatelio 0ccb00a
updated docs
devpatelio f418a44
Merge branch 'main' of https://github.com/erictang000/SkyRL into dev_…
erictang000 0d4131c
x
erictang000 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Some comments aren't visible on the classic Files Changed page.
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This conditional expression to create
rope_scaling_dictcan be simplified.OmegaConf.to_containerhandles non-DictConfiginputs correctly by returning them as-is. You can simplify this to an unconditional call, which also makes it more consistent with howrope_parameters_newis handled later in the function.