-
Notifications
You must be signed in to change notification settings - Fork 2.1k
Open
Labels
questionFurther information is requestedFurther information is requested
Description
❓ Question
Hi, I was looking into the code inside the SAC algorithm, and I encountered an odd line here.
When comparing to make the update of the target, should not be using self._n_updates, or something similar instead gradient_step? such variable is always zero, hence, is always entering in the condition?.
There is maybe any specific reason to do it this way?
Thanks!
Checklist
- I have checked that there is no similar issue in the repo
- I have read the documentation
- If code there is, it is minimal and working
- If code there is, it is formatted using the markdown code blocks for both code and stack traces.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
questionFurther information is requestedFurther information is requested