Skip to content

Is the newly implemented GRPO supposed to be slower than PPO? #2610

@yxchng

Description

@yxchng

it seems to be much slower on my side with G=8? though with less memory (which is expected)

Metadata

Metadata

Assignees

No one assigned

    Labels

    ❓ questionSeeking clarification or more information🏋 GRPORelated to GRPO

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions