File tree Expand file tree Collapse file tree 1 file changed +3
-1
lines changed
reinforced_lib/agents/neuro Expand file tree Collapse file tree 1 file changed +3
-1
lines changed Original file line number Diff line number Diff line change @@ -50,7 +50,9 @@ class Evosax(BaseAgent):
5050 on backpropagation through the value or policy network. Instead, the network parameters are evolved using
5151 black-box optimization. This agent is suitable for environments with both discrete and continuous action spaces.
5252 The user is responsible for providing appropriate network output in the correct format (e.g., discrete actions
53- should be sampled from logits with ``jax.random.categorical`` inside the network definition).
53+ should be sampled from logits with ``jax.random.categorical`` inside the network definition). Note that
54+ this agent does not discount future rewards, therefore, the fitness is computed as a simple sum of rewards
55+ obtained during the evaluation phase.
5456
5557 Parameters
5658 ----------
You can’t perform that action at this time.
0 commit comments