Pre-run Julia-based Guard policies before environment reset

In current versions of julia-based guard bot policies (e.g. LBG1_LG3, LBG1_LG4, LBG1_LG5), the julia-based iLQGames solver is only run after the environment is initiated. This can cause very long delays for time-to-first execution of guard policy, especially if the computer running the scenario is also running many other processes (e.g. a computationally expensive bandit policy). In turn, this causes the effective behavior of the guard to vary widely between systems.

Instead, the guard policy should be pre-run prior to environment initialization. This may look like:
1. initialize and reset environment
2. wait until first execution of guard policy completes
3. reset environment again

This will likely take changes to `ksp_interface.py` which calls the `env.reset()` function. If the changes are made at this fundamental level that would affect all environments, then all environments would need a version increment (e.g. V1 -> V2)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pre-run Julia-based Guard policies before environment reset #26

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Pre-run Julia-based Guard policies before environment reset #26

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions