Skip to content

Commit 0f82b69

Browse files
authored
Merge pull request #6 from cehnegaitne/patch-1
Removed unnecessary words
2 parents 7ab97c2 + de46ac7 commit 0f82b69

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -21,13 +21,13 @@ The deep reinforcement learning community has made several independent improveme
2121
# Environments
2222
First, I recommend to use small test problems to run experiments quickly. Then, you can continue on environments with large observation space.
2323

24-
- **CartPole** - classic RL environment that can be solved even on a single cpu
25-
- **Atari Pong** - the easiest atari environment, it takes ~ 1 million frames to converge, comparing with other atari games that take > 40 millions
24+
- **CartPole** - classic RL environment can be solved on a single cpu
25+
- **Atari Pong** - the easiest atari environment, only takes ~ 1 million frames to converge, comparing with other atari games that take > 40 millions
2626
- **Atari others** - change hyperparameters, target network update frequency=10K, replay buffer size=1M
2727

2828
# If you get stuck…
29-
- First, remember that you are not stuck unless you have spent more than a week on a single algorithm. It is perfectly normal if you do not have all the required knowledge of mathematics and CS. For example, you will need knowledge of the fundamentals of measure theory and statistics, especially the [Wasserstein metric](https://en.wikipedia.org/wiki/Wasserstein_metric) and [quantile regression](https://en.wikipedia.org/wiki/Quantile_regression). Statistical inference: [ importance sampling](https://en.wikipedia.org/wiki/Importance_sampling). Data structures: [Segment Tree](https://leetcode.com/tag/segment-tree/) and [K-dimensional Tree](https://en.wikipedia.org/wiki/K-d_tree).
30-
- Carefully go through the paper. Try to see what is the problem that authors are solving. First you should understand a high-level idea of the approach, then you can read the code skipping the proofs, and after that go over the mathematical details and proofs.
29+
- Remember you are not stuck unless you have spent more than a week on a single algorithm. It is perfectly normal if you do not have all the required knowledge of mathematics and CS. For example, you will need knowledge of the fundamentals of measure theory and statistics, especially the [Wasserstein metric](https://en.wikipedia.org/wiki/Wasserstein_metric) and [quantile regression](https://en.wikipedia.org/wiki/Quantile_regression). Statistical inference: [importance sampling](https://en.wikipedia.org/wiki/Importance_sampling). Data structures: [Segment Tree](https://leetcode.com/tag/segment-tree/) and [K-dimensional Tree](https://en.wikipedia.org/wiki/K-d_tree).
30+
- Carefully go through the paper. Try to see what is the problem the authors are solving. Understand a high-level idea of the approach, then read the code (skipping the proofs), and after go over the mathematical details and proofs.
3131

3232
# Best RL courses
3333
- David Silver's course [link](http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html)

0 commit comments

Comments
 (0)