How to configure IPPO #1843

JulienHansen · 2025-02-12T08:31:17Z

JulienHansen
Feb 12, 2025

Hi,

I have a question regarding the YAML configuration for IPPO in SKRL. I'm working on a two-drone environment where one drone is tasked with reaching a target while the other must follow it. The first drone is behaving as expected. However, when I tested IPPO before implementing the follower logic, I expected the follower drone to essentially do nothing—but instead, it learns to fly.

I believe the issue is that the actor is shared between them and thus I need to configure the YAML file so that each agent has a different network architecture. Could someone please clarify how to set up the YAML configuration for IPPO in SKRL when I want my agents to use distinct model architectures? Here are my default IPPO config and the new one :

seed: 42


# Models are instantiated using skrl's model instantiator utility
# https://skrl.readthedocs.io/en/latest/api/utils/model_instantiators.html
models:
  separate: True # Separate models for policy and value
  policy:  # see gaussian_model parameters
    class: GaussianMixin
    clip_actions: False
    clip_log_std: True
    min_log_std: -20.0
    max_log_std: 2.0
    initial_log_std: 0.0
    network:
      - name: net
        input: STATES
        layers: [512, 256, 128]
        activations: elu
    output: ACTIONS
  value:  # see deterministic_model parameters
    class: DeterministicMixin
    clip_actions: False
    network:
      - name: net
        input: STATES
        layers: [512, 256, 128]
        activations: elu
    output: ONE

models:
  drone_attack:
    policy:
      class: GaussianMixin
      clip_actions: False
      clip_log_std: True
      min_log_std: -20.0
      max_log_std: 2.0
      initial_log_std: 0.0
      network:
        - name: net
          input: STATES
          layers: [512, 256, 128]
          activations: elu
      output: ACTIONS
    value:
      class: DeterministicMixin
      clip_actions: False
      network:
        - name: net
          input: STATES
          layers: [512, 256, 128]
          activations: elu
      output: ONE
  drone_defense:
    policy:
      class: GaussianMixin
      clip_actions: True
      clip_log_std: False
      min_log_std: -10.0
      max_log_std: 3.0
      initial_log_std: 0.5
      network:
        - name: net
          input: STATES
          layers: [256, 256]
          activations: relu
      output: ACTIONS
    value:
      class: DeterministicMixin
      clip_actions: False
      network:
        - name: net
          input: STATES
          layers: [256, 256]
          activations: relu
      output: ONE

of course the second one is not correct with skrl format it's just to express that both agent could have different model architecture

Answered by JulienHansen

Feb 19, 2025

I found my answer in the skrl repository. Currently, in the .yaml file, it is not possible to directly specify different architectures (see: GitHub discussion)

View full answer

RandomOakForest · 2025-02-14T13:35:15Z

RandomOakForest
Feb 14, 2025
Maintainer

Thanks for posting this. @Toni-SM for vis.

1 reply

JulienHansen Feb 19, 2025
Author

I found my answer in the skrl repository. Currently, in the .yaml file, it is not possible to directly specify different architectures (see: GitHub discussion)

Answer selected by JulienHansen

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How to configure IPPO #1843

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 1 comment 1 reply

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

How to configure IPPO #1843

Uh oh!

Uh oh!

JulienHansen Feb 12, 2025

Replies: 1 comment · 1 reply

Uh oh!

RandomOakForest Feb 14, 2025 Maintainer

Uh oh!

JulienHansen Feb 19, 2025 Author

JulienHansen
Feb 12, 2025

Replies: 1 comment 1 reply

RandomOakForest
Feb 14, 2025
Maintainer

JulienHansen Feb 19, 2025
Author