Skip to content

Commit 812648e

Browse files
authored
Rename QRDQN logger key (#67)
1 parent 9985326 commit 812648e

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

docs/misc/changelog.rst

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -11,6 +11,7 @@ Breaking Changes:
1111
- Upgraded to Stable-Baselines3 >= 1.5.1a1
1212
- Changed the way policy "aliases" are handled ("MlpPolicy", "CnnPolicy", ...), removing the former
1313
``register_policy`` helper, ``policy_base`` parameter and using ``policy_aliases`` static attributes instead (@Gregwar)
14+
- Renamed ``rollout/exploration rate`` key to ``rollout/exploration_rate`` for QRDQN (to be consistent with SB3 DQN)
1415

1516
New Features:
1617
^^^^^^^^^^^^^

sb3_contrib/qrdqn/qrdqn.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -159,7 +159,7 @@ def _on_step(self) -> None:
159159
polyak_update(self.quantile_net.parameters(), self.quantile_net_target.parameters(), self.tau)
160160

161161
self.exploration_rate = self.exploration_schedule(self._current_progress_remaining)
162-
self.logger.record("rollout/exploration rate", self.exploration_rate)
162+
self.logger.record("rollout/exploration_rate", self.exploration_rate)
163163

164164
def train(self, gradient_steps: int, batch_size: int = 100) -> None:
165165
# Switch to train mode (this affects batch norm / dropout)

0 commit comments

Comments
 (0)