Skip to content

Commit 51abb23

Browse files
authored
last bit of doc review from checklist (#1191)
* minor grammatical fix * fixed definition in table
1 parent 3df96a0 commit 51abb23

File tree

2 files changed

+4
-3
lines changed

2 files changed

+4
-3
lines changed

docs/Feature-Memory.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Memory-enhanced agents using Recurrent Neural Networks
22

3-
## What are memories for
3+
## What are memories used for?
44

55
Have you ever entered a room to get something and immediately forgot what you
66
were looking for? Don't let that happen to your agents.

docs/Training-ML-Agents.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -158,7 +158,7 @@ after the GameObject containing the Brain component that should use these
158158
settings. (This GameObject will be a child of the Academy in your scene.)
159159
Sections for the example environments are included in the provided config file.
160160

161-
| **Setting** | **Description** | **Applies To Trainer**|
161+
| **Setting** | **Description** | **Applies To Trainer\***|
162162
| :-- | :-- | :-- |
163163
| batch_size | The number of experiences in each iteration of gradient descent.| PPO, BC |
164164
| batches_per_epoch | In imitation learning, the number of batches of training examples to collect before training the model.| BC |
@@ -183,7 +183,8 @@ Sections for the example environments are included in the provided config file.
183183
| trainer | The type of training to perform: "ppo" or "imitation".| PPO, BC |
184184
| use_curiosity | Train using an additional intrinsic reward signal generated from Intrinsic Curiosity Module. | PPO |
185185
| use_recurrent | Train using a recurrent neural network. See [Using Recurrent Neural Networks](Feature-Memory.md).| PPO, BC |
186-
|| PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)) ||
186+
187+
\*PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)
187188

188189
For specific advice on setting hyperparameters based on the type of training you
189190
are conducting, see:

0 commit comments

Comments
 (0)