Skip to content

[Question] A question about the cost function of the p3o algorithm #358

@Liqinyan821

Description

@Liqinyan821

Required prerequisites

Questions

Hello Omnisafe team, thank you very much for your contribution.
When I was Learning the p3o algorithm, I found that the def _loss_pi_cost function was not clip, and loss_pi_cost in the P3O Optimization for Safe Reinforcement Learning used clip.
87bf7a541d27ee53fc4f1bcdfa47bd81
324c76d3af56db011f799976ca22c297

Metadata

Metadata

Assignees

No one assigned

    Labels

    questionFurther information is requested

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions