[update] Updated RoPE Configuration for HF Models (transformers) w. backward-compatible support for vLLM #690 #703

devpatelio · 2025-11-25T03:10:23Z

HuggingFace's transformers library has an updated RoPE configuration scheme which removes rope_scaling and rope_theta and replaces it with a single rope_parameters configuration.

We updated the RoPE config by first updating the YAML config to support the updated config template. We then update all trainer utils and calls to now call the updated config. For the vLLM endpoint, we temporarily take our updated YAML config and pass in the RoPE config to behave the same as before (separate rope_scaling and rope_theta) as vLLM isn't updated yet. This will be updated and removed once vLLM is updated. See the vLLM docs for more info.

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

devpatelio · 2025-11-25T03:11:35Z

/gemini review

…efault but allows for override) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

devpatelio · 2025-11-25T03:17:04Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively updates the RoPE configuration to use the new rope_parameters scheme from the HuggingFace transformers library, replacing the deprecated rope_scaling and rope_theta. The changes are consistently applied across documentation, configuration files, and model loading logic. I appreciate the backward-compatible support for vLLM, which is a thoughtful addition.

I've identified a potential issue where a DictConfig object might be passed to vLLM instead of a standard Python dict, which could lead to runtime errors. I've also suggested a couple of minor improvements to make the configuration handling more robust by ensuring OmegaConf interpolations are resolved. Overall, this is a solid update.

skyrl-train/skyrl_train/entrypoints/main_base.py

skyrl-train/skyrl_train/utils/trainer_utils.py

SumanthRH

Can you add a test for this ?

Can add it to skyrl-train/tests/gpu/gpu_ci/test_model_wrapper.py to test RoPE parameters being set properly.

Also, have you done any E2E test for this? Good way would be to override the skyrl config in a script and confirm that the parameters are being propagated all the way to AutoModel init

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

devpatelio · 2025-11-25T05:40:03Z

/gemini review

erictang000

assuming you've run something like the basic gsm8k example e2e and everything still works?

other than that lgtm other than a nit

erictang000 · 2025-11-26T05:09:22Z

skyrl-train/skyrl_train/config/ppo_base_config.yaml

  #   factor: 1.0
  #   original_max_position_embeddings: 32768
-
+  step_wise_training: false


can remove this for now, since the step wise training PR was reverted

fixed, and e2e is working (i tried it on a bunch of different rope configuration runs)

Added test!

devpatelio · 2025-11-26T23:51:17Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively updates the RoPE configuration to align with the new rope_parameters scheme from HuggingFace's transformers, while maintaining backward compatibility for vLLM and older configurations. The updates to documentation, YAML files, and the addition of a dedicated test case are well-executed. However, there is a critical oversight: the RoPE configuration is not applied to the critic model. The get_llm_for_sequence_regression function, used to create the critic, was not updated to handle rope_parameters. This means the critic will be created without the specified RoPE settings, which could negatively impact training. To resolve this, get_llm_for_sequence_regression should be updated to accept and apply rope_parameters to the model's configuration, and the critic worker initializations in deepspeed_worker.py and fsdp_worker.py must be updated to pass this configuration.

erictang000

gemini makes a good point, plumb to critic, should be not too much more code?

devpatelio · 2025-11-27T04:20:38Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively modernizes the RoPE configuration by adopting the new rope_parameters standard from the transformers library, while thoughtfully maintaining backward compatibility for vLLM. The changes are well-organized, touching documentation, configuration files, and application logic consistently. A new utility function centralizes the migration logic from the deprecated rope_scaling and rope_theta parameters, which is a great approach. The addition of unit tests for the new model wrapper logic is also a valuable contribution. I've identified one potential issue in the new configuration handling logic that could lead to a runtime error with a malformed config. My suggestion addresses this to make the implementation more robust.

skyrl-train/skyrl_train/utils/trainer_utils.py

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

devpatelio · 2025-11-27T04:24:49Z

/gemini review

gemini-code-assist

Code Review

This pull request effectively updates the RoPE configuration to align with the newer rope_parameters scheme from HuggingFace's transformers library, deprecating the old rope_scaling and rope_theta fields. The changes are consistently applied across documentation, configuration files, model wrappers, and worker initializations. I appreciate the backward-compatible support for vLLM, which is handled cleanly by reconstructing the old configuration format. The addition of unit tests to verify the new configuration logic is also a great touch. I have a couple of minor suggestions in trainer_utils.py to enhance code clarity and maintainability. Overall, this is a well-executed and thorough update.

gemini-code-assist · 2025-11-27T04:26:50Z

skyrl-train/skyrl_train/utils/trainer_utils.py

+            rope_scaling_dict = (
+                OmegaConf.to_container(rope_scaling, resolve=True)
+                if isinstance(rope_scaling, DictConfig)
+                else rope_scaling
+            )


This conditional expression to create rope_scaling_dict can be simplified. OmegaConf.to_container handles non-DictConfig inputs correctly by returning them as-is. You can simplify this to an unconditional call, which also makes it more consistent with how rope_parameters_new is handled later in the function.

rope_scaling_dict = OmegaConf.to_container(rope_scaling, resolve=True)

gemini-code-assist · 2025-11-27T04:26:50Z

skyrl-train/skyrl_train/utils/trainer_utils.py

+        if new_params is not None:
+            logger.warning(f"Ignoring 'rope_parameters' as it is not a dictionary. Found: {new_params}")
+        return {}


The if new_params is not None: check is redundant. The has_new_config check on line 669 already ensures that rope_parameters_new is not None, and OmegaConf.to_container will not return None for a non-None input. This block can be simplified by removing the conditional check.

logger.warning(f"Ignoring 'rope_parameters' as it is not a dictionary. Found: {new_params}") return {}

SumanthRH · 2025-12-08T10:58:30Z

@devpatelio can you resolve conflicts with main?

…rope

devpatelio and others added 12 commits November 19, 2025 17:09

done

e7d9d44

rename params to parameters for rope for parity

38312c2

Update skyrl-train/skyrl_train/utils/trainer_utils.py

f4ec82d

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

support old param use

45034fb

doc updates and remove excess comments

ac6d2fc

remove comments

5dc16a0

remove comments

8954231

Apply suggestions from code review

a7105cf

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

done

3f6512a

some changes

4c39a7e

merge changes

270e0f7

fixes for rope config

8c1dd19

This comment was marked as resolved.

Sign in to view

Pass generator, not trainer rope configuration (they're the same by d…

8623973

…efault but allows for override) Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

This comment was marked as resolved.

Sign in to view

devpatelio added 3 commits November 25, 2025 03:14

better user logging for clear rope behaviour

9f8b08b

linter

3c5884e

update gitignore

210609d

gemini-code-assist bot reviewed Nov 25, 2025

View reviewed changes

skyrl-train/skyrl_train/entrypoints/main_base.py Outdated Show resolved Hide resolved

skyrl-train/skyrl_train/utils/trainer_utils.py Show resolved Hide resolved

skyrl-train/skyrl_train/utils/trainer_utils.py Outdated Show resolved Hide resolved

devpatelio requested a review from SumanthRH November 25, 2025 03:25

SumanthRH previously requested changes Nov 25, 2025

View reviewed changes

SumanthRH and others added 7 commits November 24, 2025 19:55

Apply suggestions from code review

ee0259e

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-train/skyrl_train/entrypoints/main_base.py

eb1c0f2

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-train/skyrl_train/utils/trainer_utils.py

536e5ef

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-train/skyrl_train/utils/trainer_utils.py

fd18186

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-train/skyrl_train/utils/trainer_utils.py

32250c6

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

Update skyrl-train/skyrl_train/utils/trainer_utils.py

22bed1c

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

add test to model wrapper

62228b9

devpatelio added 2 commits November 25, 2025 05:36

goarugh

15ab029

done

cbd75bf

This comment was marked as outdated.

Sign in to view

devpatelio added 3 commits November 25, 2025 05:45

return empty dict

a34c01d

done

0b8a1b9

Merge branch 'main' into devpatel/skyrl-rope-support

b62f97a

erictang000 reviewed Nov 26, 2025

View reviewed changes

devpatelio added 2 commits November 26, 2025 22:09

some changes

63aeb06

rm stepweise training

82dd877

devpatelio added 2 commits November 26, 2025 23:48

revert gsm8k

bb9774e

revert gsm8k

a3963a3

gemini-code-assist bot reviewed Nov 26, 2025

View reviewed changes

erictang000 approved these changes Nov 26, 2025

View reviewed changes

erictang000 requested changes Nov 26, 2025

View reviewed changes

piped rope config to critic model calls

52a8959

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

skyrl-train/skyrl_train/utils/trainer_utils.py Outdated Show resolved Hide resolved

Update skyrl-train/skyrl_train/utils/trainer_utils.py

7ad5745

Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>

gemini-code-assist bot reviewed Nov 27, 2025

View reviewed changes

devpatelio requested a review from erictang000 November 27, 2025 04:27

erictang000 added the skyrl-train label Dec 1, 2025

devpatelio and others added 4 commits December 8, 2025 14:17

change base config

585c1fd

updated docs

0ccb00a

Merge branch 'main' of https://github.com/erictang000/SkyRL into dev_…

f418a44

…rope

x

0d4131c

[update] Updated RoPE Configuration for HF Models (transformers) w. backward-compatible support for vLLM #690 #703

Are you sure you want to change the base?

[update] Updated RoPE Configuration for HF Models (transformers) w. backward-compatible support for vLLM #690 #703

Uh oh!

Conversation

devpatelio commented Nov 25, 2025

Uh oh!

devpatelio commented Nov 25, 2025

Uh oh!

This comment was marked as resolved.

Uh oh!

This comment was marked as resolved.

Uh oh!

devpatelio commented Nov 25, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

SumanthRH left a comment

Choose a reason for hiding this comment

Uh oh!

devpatelio commented Nov 25, 2025

Uh oh!

This comment was marked as outdated.

Uh oh!

erictang000 left a comment

Choose a reason for hiding this comment

Uh oh!

erictang000 Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

devpatelio Nov 26, 2025

Choose a reason for hiding this comment

Uh oh!

devpatelio commented Nov 26, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

erictang000 left a comment

Choose a reason for hiding this comment

Uh oh!

devpatelio commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

devpatelio commented Nov 27, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot Nov 27, 2025

Choose a reason for hiding this comment

Uh oh!

SumanthRH commented Dec 8, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants