[Feature Request] Store next observations and dones in RolloutBuffer

### 🚀 Feature

Add `next_observations` and `dones` fields to the `RolloutBuffer` and the `DictRolloutBuffer` classes, similar to how it is done in the `ReplayBuffer` class.

### Motivation

Currently, on-policy algorithms don't store the next observations and dones fields in their buffer in the `get_rollouts` method. This is because these fields are not required by any of the algorithms in stable-baselines3. However, these fields are required to be stored in the buffer to implement the original variant of the AIRL algorithm in [imitation](https://github.com/HumanCompatibleAI/imitation).

### Pitch

_No response_

### Alternatives

_No response_

### Additional context

_No response_

### Checklist

- [X] I have checked that there is no similar [issue](https://github.com/DLR-RM/stable-baselines3/issues) in the repo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Store next observations and dones in RolloutBuffer #1273

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Store next observations and dones in RolloutBuffer #1273

Description

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions