Skip to content

Commit f4b699a

Browse files
committed
Comment about no discount factor
1 parent 093a141 commit f4b699a

File tree

1 file changed

+3
-1
lines changed

1 file changed

+3
-1
lines changed

reinforced_lib/agents/neuro/evosax.py

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,9 @@ class Evosax(BaseAgent):
5050
on backpropagation through the value or policy network. Instead, the network parameters are evolved using
5151
black-box optimization. This agent is suitable for environments with both discrete and continuous action spaces.
5252
The user is responsible for providing appropriate network output in the correct format (e.g., discrete actions
53-
should be sampled from logits with ``jax.random.categorical`` inside the network definition).
53+
should be sampled from logits with ``jax.random.categorical`` inside the network definition). Note that
54+
this agent does not discount future rewards, therefore, the fitness is computed as a simple sum of rewards
55+
obtained during the evaluation phase.
5456
5557
Parameters
5658
----------

0 commit comments

Comments
 (0)