Skip to content

SAC with PER #82

@xwen24

Description

@xwen24

Hello!

Thanks so much for sharing the codes!

I just have a question about sac_discrete_per.py: why the importance-sampling weights are not used to update the Q-function and the policy function?

weight_update = [min(l1.item(), l2.item()) for l1, l2 in zip(q_value_loss1, q_value_loss2)]

I have seen other SAC-PER repositories that used the weights to calculate the q_value_loss and policy_loss.

Thanks.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions