You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+9-8Lines changed: 9 additions & 8 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,18 +1,19 @@
1
1
# RLCodebase
2
2
RLCodebase is a modularized codebase for deep reinforcement learning algorithms based on PyTorch. This repo aims to provide an user-friendly reinforcement learning codebase for beginners to get started and for researchers to try their ideas quickly and efficiently.
3
3
4
-
For now, it has implemented DQN(PER), A2C, PPO, DDPG, TD3 and SAC algorithms, and tested on OpenAI Gym, Procgen, PyBullet and DMControl Suite environments.
4
+
For now, it has implemented DQN(PER), A2C, PPO, DDPG, TD3 and SAC algorithms, and has been tested on Atari, Procgen, Mujoco, PyBullet and DMControl Suite environments.
5
5
6
6
## Introduction
7
7
The design of RLCodebase is shown as below.
8
8
9
9
10
10

11
-
* Config: Config is a class that contains parameters for reinforcement learning algorithms such as discount factor, learning rate, etc. and general configurations such as random seed, saving path, etc.
12
-
* Agent: Agent is a wrapped class that controls the workflow of reinforcement learning algorithms like a manager. It's responsible for the interactions among submodules (policy, environment, memory).
13
-
* Policy: Policy tells us what action to taken given a state. It also implements a function that defines how to update the model given a batch of data.
14
-
* Environment: Environment is designed to be a vectorized gym environment. Here we use gym wrappers from OpenAI baselines for convenient implementations.
15
-
* Memory: Memory stores data needed for improving our model.
11
+
***Config**: Config is a class that contains parameters for reinforcement learning algorithms such as discount factor, learning rate, etc. and general configurations such as random seed, saving path, etc.
12
+
***Trainer**: Trainer is a wrapped class that controls the workflow of reinforcement learning training. It manages the interactions between submodules (Agent, Env, memory).
13
+
***Agent**: Agent chooses actions to take given states. It also defines how to update the model given a batch of data.
14
+
***Model**: Model gathers all neural networks to train.
15
+
***Env**: Env is a vectorized gym environment.
16
+
***Memory**: Memory stores experiences utilized for RL training.
16
17
17
18
## Installtion
18
19
All required packages have been included in setup.py and requirements.txt. Mujoco is needed for mujoco_py and dm control suite. To support mujoco_py and dm control, please refer to https://github.com/openai/mujoco-py and https://github.com/deepmind/dm_control. For mujoco_py 2.1.2.14 and dm_control (commit fe44496), you may download mujoco like below
0 commit comments