Skip to content

Conversation

@pascal-roth
Copy link
Contributor

Adds a new perceptive actor-critic class, that can define CNN layers for every 2D observation term.

Example usage shown in IsaacLab isaac-sim/IsaacLab#3467

Copy link
Collaborator

@ClemensSchwarke ClemensSchwarke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hey @pascal-roth, very cool, thanks a lot!

@ClemensSchwarke ClemensSchwarke changed the base branch from main to fix/recurrent_symmetry October 22, 2025 15:39
@ClemensSchwarke ClemensSchwarke force-pushed the feature/perceptive-nav-rl branch 2 times, most recently from 22cd1ec to 8a0a959 Compare October 22, 2025 15:57
Comment on lines 98 to 103
if self.flatten:
x = x.flatten(start_dim=1)
elif self.avgpool is not None:
x = self.avgpool(x)
x = x.flatten(start_dim=1)
return x
Copy link
Collaborator

@ClemensSchwarke ClemensSchwarke Oct 23, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't get this logic. Why do we always flatten when we do avgpool and why do we not do avgpool when we also set the flatten flag?

@ClemensSchwarke ClemensSchwarke force-pushed the feature/perceptive-nav-rl branch from 42e406d to df9e1ad Compare October 24, 2025 12:03
Base automatically changed from fix/recurrent_symmetry to main October 24, 2025 12:40
@ClemensSchwarke ClemensSchwarke force-pushed the feature/perceptive-nav-rl branch from df9e1ad to 0a1756d Compare October 24, 2025 12:44
@ClemensSchwarke
Copy link
Collaborator

ClemensSchwarke commented Oct 29, 2025

@pascal-roth I modified the CNN quite a bit to have more flexibility. Could you quickly check if this version works for you and maybe also give it a quick check? I also modified your perceptive environment configs to work with the changes here, it would be awesome if you could run the env to see if it still trains the same :)

@lvjonok
Copy link

lvjonok commented Oct 30, 2025

Hello!

I was following the progress of this PR and I had a question whether it makes sense to extend this ActorCriticPerceptive to have a shared CNN backbone for both critic and actor? Do you think it should be an option or should it be needed at all?

I thought it can greatly simplify the complexity of neural network and speed up the training process.

@ClemensSchwarke
Copy link
Collaborator

Hi @lvjonok, I would assume the speedup is negligible. But if you want to test this please share the results :)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants