-
Notifications
You must be signed in to change notification settings - Fork 18
Open
Description
thanks for sharing 。 confusion about update_weights function in the agents/base.py ,about line 480 , value_prefix_loss += kl_loss(value_prefixes, target_value_prefixes_support[:, step_i]) * mask policy_loss_i = kl_loss(policies, target_policies[:, step_i + 1]) * mask the target data indexed by step_i and step_i + 1 ,why is different index , thanks very much
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels