File tree Expand file tree Collapse file tree 2 files changed +4
-3
lines changed Expand file tree Collapse file tree 2 files changed +4
-3
lines changed Original file line number Diff line number Diff line change 11# Memory-enhanced agents using Recurrent Neural Networks
22
3- ## What are memories for
3+ ## What are memories used for?
44
55Have you ever entered a room to get something and immediately forgot what you
66were looking for? Don't let that happen to your agents.
Original file line number Diff line number Diff line change @@ -158,7 +158,7 @@ after the GameObject containing the Brain component that should use these
158158settings. (This GameObject will be a child of the Academy in your scene.)
159159Sections for the example environments are included in the provided config file.
160160
161- | ** Setting** | ** Description** | ** Applies To Trainer** |
161+ | ** Setting** | ** Description** | ** Applies To Trainer\* ** |
162162| :-- | :-- | :-- |
163163| batch_size | The number of experiences in each iteration of gradient descent.| PPO, BC |
164164| batches_per_epoch | In imitation learning, the number of batches of training examples to collect before training the model.| BC |
@@ -183,7 +183,8 @@ Sections for the example environments are included in the provided config file.
183183| trainer | The type of training to perform: "ppo" or "imitation".| PPO, BC |
184184| use_curiosity | Train using an additional intrinsic reward signal generated from Intrinsic Curiosity Module. | PPO |
185185| use_recurrent | Train using a recurrent neural network. See [ Using Recurrent Neural Networks] ( Feature-Memory.md ) .| PPO, BC |
186- || PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)) ||
186+
187+ \* PPO = Proximal Policy Optimization, BC = Behavioral Cloning (Imitation)
187188
188189For specific advice on setting hyperparameters based on the type of training you
189190are conducting, see:
You can’t perform that action at this time.
0 commit comments