Skip to content

Pre-run Julia-based Guard policies before environment reset #26

@rallen10

Description

@rallen10

In current versions of julia-based guard bot policies (e.g. LBG1_LG3, LBG1_LG4, LBG1_LG5), the julia-based iLQGames solver is only run after the environment is initiated. This can cause very long delays for time-to-first execution of guard policy, especially if the computer running the scenario is also running many other processes (e.g. a computationally expensive bandit policy). In turn, this causes the effective behavior of the guard to vary widely between systems.

Instead, the guard policy should be pre-run prior to environment initialization. This may look like:

  1. initialize and reset environment
  2. wait until first execution of guard policy completes
  3. reset environment again

This will likely take changes to ksp_interface.py which calls the env.reset() function. If the changes are made at this fundamental level that would affect all environments, then all environments would need a version increment (e.g. V1 -> V2)

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions