last bit of doc review from checklist (#1191)

shihzy · web-flow · commit 51abb2338d8d · 2018-09-06T10:53:29.000-07:00
* minor grammatical fix

* fixed definition in table
diff --git a/docs/Feature-Memory.md b/docs/Feature-Memory.md
@@ -1,6 +1,6 @@
 # Memory-enhanced agents using Recurrent Neural Networks
 
-## What are memories for
+## What are memories used for?
 
 Have you ever entered a room to get something and immediately forgot what you
 were looking for? Don't let that happen to your agents.  
diff --git a/docs/Training-ML-Agents.md b/docs/Training-ML-Agents.md
@@ -158,7 +158,7 @@ after the GameObject containing the Brain component that should use these
 settings. (This GameObject will be a child of the Academy in your scene.)
 Sections for the example environments are included in the provided config file.
 
-| **Setting** | **Description** | **Applies To Trainer**|
+| **Setting** | **Description** | **Applies To Trainer\***|
 | :--         | :--             | :--                   |
 | batch_size | The number of experiences in each iteration of gradient descent.| PPO, BC |
 | batches_per_epoch | In imitation learning, the number of batches of training examples to collect before training the model.| BC |
@@ -183,7 +183,8 @@ Sections for the example environments are included in the provided config file.
 | trainer | The type of training to perform: "ppo" or "imitation".| PPO, BC |
 | use_curiosity | Train using an additional intrinsic reward signal generated from Intrinsic Curiosity Module. | PPO |
 | use_recurrent | Train using a recurrent neural network. See [Using Recurrent Neural Networks](Feature-Memory.md).| PPO, BC |
-|| PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)) ||
+
+\*PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)
 
 For specific advice on setting hyperparameters based on the type of training you
 are conducting, see: