Isaac-Sim2Real-Pipeline

We aim to construct a comprehensive toolkit designed to automate the end-to-end pipeline for sim-to-real reinforcement learning. This system will automatically train specified RL tasks within the Isaac Lab simulator, utilise an LLM-based agent for iterative performance optimisation, and subsequently facilitate the seamless migration of the trained policy to a physical environment.

Core Features:

Interaction friendly
Multi-agent support
Multi-algorithm support

STEP 1:

In the Prototype v1 stage, we simply call a mature and off-the-shelf Isaaclab project, either customised to fit a specific task or auto-generated via the Isaac command.

The output: A decent simulation can be recorded (including ckpt, comprehensive videos and quantitative results)

(Optional) Manually setting up the Isaaclab environment is still a labour-intensive task, which involves a human expert to design. Our further work will concentrate on automating the simulation setup:

The physical layout, which contains physical rules and each interactive object
The Reward
observations, action spaces for the tasks, which should align with the real-world setting
a proper event, terminations for the real-world randomisation
commands for defining the goal

STEP 2: Automated Simulation Refinement

This stage implements a recursive pipeline leveraging an LLM agent to autonomously refine simulation results using performance feedback. The process is divided into two phases:

Baseline Implementation: First, we will integrate the foundational Eureka framework into our toolkit to establish a performance baseline.
Advanced Optimisation: Subsequently, we will develop an enhanced methodology designed to systematically improve upon the Eureka outputs and validate the performance gains.

STEP 3: Overcoming the GAP between Sim2Real

After setting up the robotic environment in the real world, the previously trained model ckpt probably will not function directly in the real-world setting. Which means some sim2real approaches should be accepted in this stage to fill the gap.

There are two implementations we should try on our prototype:

DrEuleka: Using LLMs to design the domain randomisation for robustness.
Our approach (v1): Using the videos recorded from both the simulation and the real world as a clue to generate an opinion for improving the environmental setting on the Isaaclab simulation.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.vscode		.vscode
configs		configs
examples		examples
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Isaac-Sim2Real-Pipeline

STEP 1:

STEP 2: Automated Simulation Refinement

STEP 3: Overcoming the GAP between Sim2Real

About

Uh oh!

Releases

Packages

Languages

License

UoA-CARES/Isaac-Sim2Real-Pipeline

Folders and files

Latest commit

History

Repository files navigation

Isaac-Sim2Real-Pipeline

STEP 1:

STEP 2: Automated Simulation Refinement

STEP 3: Overcoming the GAP between Sim2Real

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages