Skip to content

Commit 8a39150

Browse files
authored
Update tutorial_atari_pong.py
1 parent d325ab6 commit 8a39150

File tree

1 file changed

+4
-3
lines changed

1 file changed

+4
-3
lines changed

examples/reinforcement_learning/tutorial_atari_pong.py

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -7,14 +7,15 @@
77
Pixels” which is a minimalistic implementation of deep reinforcement learning by
88
using python-numpy and OpenAI gym environment.
99
The code here is the reimplementation of Karpathy's Blog by using TensorLayer.
10-
Compare with Karpathy's code, we store observation for a batch, he store
11-
observation for a episode only, they store gradients instead. (so we will use
10+
Compare with Karpathy's code, we store observation for a batch, but he store
11+
observation for only one episode and gradients. (so we will use
1212
more memory if the observation is very large.)
13-
FEEL FREE TO JOIN US !
13+
1414
TODO
1515
-----
1616
- update grads every step rather than storing all observation!
1717
18+
1819
References
1920
------------
2021
- http://karpathy.github.io/2016/05/31/rl/

0 commit comments

Comments
 (0)