[Feature Request] Implement PILCO (Probabilistic Inference for Learning Control)

## Motivation

Implement PILCO (Probabilistic Inference for Learning Control), as requested in #509.

## Solution

It uses Gaussian Processes to model dynamics and analytic moment matching to propagate uncertainty, allowing for direct gradient-based policy optimization.

## Alternatives

NA

## Additional context

Reference: [PILCO: A Model-Based and Data-Efficient Approach to Policy Search](https://www.google.com/search?q=https://ieeexplore.ieee.org/document/6130309)

## Checklist

- [X] I have checked that there is no similar issue in the repo (**required**)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Implement PILCO (Probabilistic Inference for Learning Control) #3513

Motivation

Solution

Alternatives

Additional context

Checklist

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Implement PILCO (Probabilistic Inference for Learning Control) #3513

Description

Motivation

Solution

Alternatives

Additional context

Checklist

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions