center_rewards_coefficient always null #3099
Replies: 2 comments 1 reply
-
Hello, could you share where you found that parameter Edit: Seems to be a Reward Modeling config https://huggingface.co/docs/trl/v0.10.1/en/reward_trainer#centering-rewards , we didn't pass this config through before, that was why wandb shows it as None. |
Beta Was this translation helpful? Give feedback.
-
@NanoCode012 I've created the PR as requested PR: Center rewards coefficient #3124 This implements all the requirements you mentioned: The PR includes documentation, example configs, and unit tests. Ready for review! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
When training a reward model in the past, I used the center_rewards_coefficient parameter in my config.
However, now when I add it to my config file, I see in the wandb run config that it’s always set to null.
Example snippet from my config:
center_rewards_coefficient: 0.01
Expected behavior:
center_rewards_coefficient should be respected and appear with the correct value in the wandb config.
Actual behavior:
It is always logged as null in wandb, and I’m not sure if it’s being used at all during training.
Axolotl version:
0.12.2
Command used:
accelerate launch -m axolotl.cli.train /tmp/config.yaml
Beta Was this translation helpful? Give feedback.
All reactions