Releases: nvidia-isaac/WBC-AGILE
Releases · nvidia-isaac/WBC-AGILE
v1.2: G1 Motion Tracking, Pick-and-Place, and Sim2MuJoCo
Immutable
release. Only release title and notes can be modified.
AGILE v1.2 Release Notes
Highlights
- G1 Motion Tracking Task — Full-body motion tracking for Unitree G1 using reference trajectories, with support for AMASS retargeted motion data
- G1 Pick-and-Place Task — Loco-manipulation task combining trajectory tracking with object grasping via the Dex3-1 hand
- Sim2MuJoCo — Unified sim-to-sim transfer workflow supporting both velocity tracking and motion tracking policies in MuJoCo, with a built-in evaluation pipeline for automated sweeps
- GR00T Data Recording & Inference — Record demonstrations in Isaac Lab for GR00T fine-tuning and deploy with NVIDIA GR00T
- OSMO Cloud Training — Workflow pipeline for launching distributed training on NVIDIA OSMO
- Sphinx Documentation — Full documentation site with GitHub Pages deployment
New Features
Tasks
- G1 Motion Tracking (
G1-MotionTracking-v0): Whole-body motion imitation using reference trajectory tracking with body position, orientation, and joint-level rewards (#41) - G1 Pick-and-Place (
G1-PickPlace-Tracking-v0): Trajectory-guided pick-and-place with the Unitree G1 and Dex3-1 hands, including object interaction rewards, curriculum learning, and domain randomization for data recording - T1 Stand-Up (
T1-StandUp-v0): Get-up-from-ground task for the Booster T1 with adaptive lift curriculum (#25)
Algorithms & Training
- Good/bad termination handling with value bootstrapping for improved PPO training (#40)
- Reward normalization for more stable training across reward scales
- OSMO workflow training pipeline for cloud-based distributed training (#33)
Sim2MuJoCo
- Unified
sim2mujocomodule supporting both velocity tracking and motion tracking policies (#42, #44) - Built-in evaluation pipeline with configurable command schedules, automated sweeps (velocity, height, yaw rate), and data logging
GR00T Integration
- Data recording pipeline: record demonstrations from trained policies with visual domain randomization
convert_to_gr00t.pyfor converting recorded data to GR00T-compatible format- GR00T inference service for deploying models back into the environment (#28)
Debugging & Visualization
- Object pose GUI and reward visualizer for interactive debugging of manipulation tasks
- Debug GUI gain fix for joint position control (#25)
Documentation & CI
- Sphinx documentation site with NVIDIA theme, deployed to GitHub Pages (#46)
- GitHub Actions CI with pre-merge checks and docs deployment
- Pull request template (#34)
Bug Fixes
- Fix pick-place action scale — Upper-body action scale filter was comparing joint names against regex patterns using exact string matching, causing right arm joints to be frozen (scale=0) during training (#47)
- Fix velocity-height G1 training configuration (#29)
- Fix reward normalization in distillation script
- Fix debug GUI play mode to not require a policy checkpoint
- Fix CI timeout for stand-up task (#38)
- Misc workflow and configuration fixes (#45)
Infrastructure
- Update Isaac Lab Docker image from 2.3.0 to 2.3.1
- Add
scripts/utils/convert_retargeted_data_for_tracking.pyfor converting AMASS retargeted motion data to the tracking task format - Add evaluation framework with configurable command schedules, trajectory logging, and W&B integration
v1.1: OSMO Workflow, GR00T Pipeline & Object Interaction
Immutable
release. Only release title and notes can be modified.
Release v1.1
This release brings several major features including the new OSMO workflow training pipeline, the GR00T data recording pipeline, and a robot object interaction task. It also includes various bug fixes, visualizer tools for debugging, and significant CI/CD enhancements.
🚀 Features & Enhancements
- OSMO Integration: Added OSMO workflow training pipeline (#33).
- GR00T Data Pipeline: Added GR00T data recording, conversion, and inference pipeline (#28).
- Robot Object Interaction: Introduced a new robot object interaction task.
- Sim2Sim Module: Added the sim2sim module for sim-to-sim validation.
- Visualizers: Added object and reward visualizers for easier debugging.
- Reward Normalization: Implemented reward normalization features.
- T1 Stand-up Task: Fixed and enhanced the T1 stand-up task (#25).
- Recurrent Student Policy: Uploaded the PyTorch script for the recurrent student policy.
🐛 Bug Fixes
- Training Fixes: Fixed the velocity height for G1 training (#29).
- Debug GUI: Fixed an issue where the gain was not effective in the debug GUI, and resolved general GUI play issues.
- Debug Env: Fixed the debug environment and updated the play script to not require a policy.
- Distillation Script: Addressed an issue with missing reward normalization in the distillation script.
🛠️ Maintenance & CI/Misc
- CI/CD Improvements:
- Docker Image: Updated the Isaac Lab Docker image from
2.3.0to2.3.1. - Documentation: Added office hour FAQs and related links.