Skip to content

Commit 811ff90

Browse files
authored
[Docs] stacking observations (#479)
1 parent 4387060 commit 811ff90

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

docs/Learning-Environment-Best-Practices.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -15,6 +15,7 @@ complexity over time. This can either be done manually, or via Curriculum Learni
1515

1616
## Vector Observations
1717
* Vector Observations should include all variables relevant to allowing the agent to take the optimally informed decision.
18+
* In cases where Vector Observations need to be remembered or compared over time, increase the `Stacked Vectors` value to allow the agent to keep track of multiple observations into the past.
1819
* Categorical variables such as type of object (Sword, Shield, Bow) should be encoded in one-hot fashion (i.e. `3` -> `0, 0, 1`).
1920
* Besides encoding non-numeric values, all inputs should be normalized to be in the range 0 to +1 (or -1 to 1). For example, the `x` position information of an agent where the maximum possible value is `maxValue` should be recorded as `AddVectorObs(transform.position.x / maxValue);` rather than `AddVectorObs(transform.position.x);`. See the equation below for one approach of normalization.
2021
* Positional information of relevant GameObjects should be encoded in relative coordinates wherever possible. This is often relative to the agent position.

docs/Learning-Environment-Design-Brains.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -23,7 +23,7 @@ The Brain Inspector window in the Unity Editor displays the properties assigned
2323
* `Vector Observation`
2424
* `Space Type` - Corresponds to whether the observation vector contains a single integer (Discrete) or a series of real-valued floats (Continuous).
2525
* `Space Size` - Length of vector observation for brain (In _Continuous_ space type). Or number of possible values (in _Discrete_ space type).
26-
* `Stacked Vectors` - The number of previous vector observations that will be stacked before being sent to the brain.
26+
* `Stacked Vectors` - The number of previous vector observations that will be stacked and used collectively for decision making. This results in the effective size of the vector observation being passed to the brain being: _Space Size_ x _Stacked Vectors_.
2727
* `Visual Observations` - Describes height, width, and whether to grayscale visual observations for the Brain.
2828
* `Vector Action`
2929
* `Space Type` - Corresponds to whether action vector contains a single integer (Discrete) or a series of real-valued floats (Continuous).

0 commit comments

Comments
 (0)