|
7 | 7 | - **Agent** - Unity Component which produces observations and takes actions in |
8 | 8 | the environment. Agents actions are determined by decisions produced by a |
9 | 9 | Policy. |
10 | | -- **Policy** - The decision making mechanism, typically a neural network model. |
11 | 10 | - **Decision** - The specification produced by a Policy for an action to be |
12 | 11 | carried out given an observation. |
13 | 12 | - **Editor** - The Unity Editor, which may include any pane (e.g. Hierarchy, |
14 | 13 | Scene, Inspector). |
15 | 14 | - **Environment** - The Unity scene which contains Agents. |
| 15 | +- **Experience** - Corresponds to a tuple of [Agent observations, actions, |
| 16 | + rewards] of a single Agent obtained after a Step. |
| 17 | +- **External Coordinator** - ML-Agents class responsible for communication with |
| 18 | + outside processes (in this case, the Python API). |
16 | 19 | - **FixedUpdate** - Unity method called each time the game engine is stepped. |
17 | 20 | ML-Agents logic should be placed here. |
18 | 21 | - **Frame** - An instance of rendering the main camera for the display. |
19 | 22 | Corresponds to each `Update` call of the game engine. |
20 | 23 | - **Observation** - Partial information describing the state of the environment |
21 | 24 | available to a given agent. (e.g. Vector, Visual) |
22 | | -- **Policy** - Function for producing decisions from observations. |
| 25 | +- **Policy** - The decision making mechanism for producing decisions from |
| 26 | + observations, typically a neural network model. |
23 | 27 | - **Reward** - Signal provided at every step used to indicate desirability of an |
24 | 28 | agent’s action within the current state of the environment. |
25 | 29 | - **State** - The underlying properties of the environment (including all agents |
26 | 30 | within it) at a given time. |
27 | 31 | - **Step** - Corresponds to an atomic change of the engine that happens between |
28 | 32 | Agent decisions. |
29 | | -- **Experience** - Corresponds to a tuple of [Agent observations, actions, |
30 | | - rewards] of a single Agent obtained after a Step. |
31 | | -- **Update** - Unity function called each time a frame is rendered. ML-Agents |
32 | | - logic should not be placed here. |
33 | | -- **External Coordinator** - ML-Agents class responsible for communication with |
34 | | - outside processes (in this case, the Python API). |
35 | 33 | - **Trainer** - Python class which is responsible for training a given group of |
36 | 34 | Agents. |
| 35 | +- **Update** - Unity function called each time a frame is rendered. ML-Agents |
| 36 | + logic should not be placed here. |
0 commit comments