Skip to content

Commit cf1f6be

Browse files
author
Ruo-Ping Dong
authored
Remove duplicated term in glossary (#5379)
* remove duplicated Policy in glossary * sort alphabetically
1 parent 05425b4 commit cf1f6be

File tree

1 file changed

+8
-8
lines changed

1 file changed

+8
-8
lines changed

docs/Glossary.md

Lines changed: 8 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -7,30 +7,30 @@
77
- **Agent** - Unity Component which produces observations and takes actions in
88
the environment. Agents actions are determined by decisions produced by a
99
Policy.
10-
- **Policy** - The decision making mechanism, typically a neural network model.
1110
- **Decision** - The specification produced by a Policy for an action to be
1211
carried out given an observation.
1312
- **Editor** - The Unity Editor, which may include any pane (e.g. Hierarchy,
1413
Scene, Inspector).
1514
- **Environment** - The Unity scene which contains Agents.
15+
- **Experience** - Corresponds to a tuple of [Agent observations, actions,
16+
rewards] of a single Agent obtained after a Step.
17+
- **External Coordinator** - ML-Agents class responsible for communication with
18+
outside processes (in this case, the Python API).
1619
- **FixedUpdate** - Unity method called each time the game engine is stepped.
1720
ML-Agents logic should be placed here.
1821
- **Frame** - An instance of rendering the main camera for the display.
1922
Corresponds to each `Update` call of the game engine.
2023
- **Observation** - Partial information describing the state of the environment
2124
available to a given agent. (e.g. Vector, Visual)
22-
- **Policy** - Function for producing decisions from observations.
25+
- **Policy** - The decision making mechanism for producing decisions from
26+
observations, typically a neural network model.
2327
- **Reward** - Signal provided at every step used to indicate desirability of an
2428
agent’s action within the current state of the environment.
2529
- **State** - The underlying properties of the environment (including all agents
2630
within it) at a given time.
2731
- **Step** - Corresponds to an atomic change of the engine that happens between
2832
Agent decisions.
29-
- **Experience** - Corresponds to a tuple of [Agent observations, actions,
30-
rewards] of a single Agent obtained after a Step.
31-
- **Update** - Unity function called each time a frame is rendered. ML-Agents
32-
logic should not be placed here.
33-
- **External Coordinator** - ML-Agents class responsible for communication with
34-
outside processes (in this case, the Python API).
3533
- **Trainer** - Python class which is responsible for training a given group of
3634
Agents.
35+
- **Update** - Unity function called each time a frame is rendered. ML-Agents
36+
logic should not be placed here.

0 commit comments

Comments
 (0)