Skip to content

Commit e46a3a8

Browse files
committed
add comment, GLIE
1 parent 9a95269 commit e46a3a8

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

example/tutorial_frozenlake_dqn.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
from tensorlayer.layers import *
66
import matplotlib.pyplot as plt
77

8-
""" Q-Network Q(a, s) - TD Learning, Off-Policy, e-Greedy Exploration
8+
""" Q-Network Q(a, s) - TD Learning, Off-Policy, e-Greedy Exploration (GLIE)
99
1010
Q(S, A) <- Q(S, A) + alpha * (R + lambda * Q(newS, newA) - Q(S, A))
1111
delta_w = R + lambda * Q(newS, newA)
@@ -90,7 +90,7 @@ def to_one_hot(i, n_classes=None):
9090
s = s1
9191
## Reduce chance of random action if an episode is done.
9292
if d == True:
93-
e = 1./((i/50) + 10) # reduce e
93+
e = 1./((i/50) + 10) # reduce e, GLIE: Greey in the limit with infinite Exploration
9494
break
9595

9696
## Note that, the rewards here with random action

0 commit comments

Comments
 (0)