Skip to content

Scores are always negative when using EditRewardInferencer with reward_dim="overall_detail" #19

@tls0523

Description

@tls0523

Hi, thank you for releasing this great project 🙏
I’m trying to use EditRewardInferencer for single-image inference, but I noticed that the returned reward scores are always negative, and I’m not sure whether this is expected behavior or a potential issue in my usage.

My setup
I initialize the inferencer as follows:
def __init__( self, device="cuda", config_path="EditReward-Qwen2.5-7B-VL.yaml", checkpoint_path="EditReward-Qwen2.5-VL-7B", reward_dim="overall_detail", rm_head_type="ranknet_multi_head", ): self.inferencer = EditRewardInferencer( config_path=config_path, checkpoint_path=checkpoint_path, device=device, reward_dim=reward_dim, rm_head_type=rm_head_type, )

During inference, I obtain the score via:
score = rewards[0][0].item()

Issue
No matter what inputs I use, the resulting score is always negative.

Questions
Is it expected that the output reward score for reward_dim="overall_detail" is always negative?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions