xArm-MSL-ROS

🟢Overview

This repository provides code related to working on the Ufactory xArm manipulators in Stanford MSL. It includes:

Model training and find-tuning suppport (data preparation, training scripts, etc):
- GROOT N1.5
- ACT (Action Chunking with Transformers)
- Diffusion Policy
Data collection and visualization utilities using ROS, RealSense, and VRPN for demonstration recording.
A modified version of the official xArm Python SDK adapted for integration with MSL workflows.

🟢Table of Contents

xArm-MSL-ROS

🟢Groot N1.5: Data Preparation & Fine-Tuning

PS: Trained model's checkpoints for MSL xArm can be found at: Groot Checkpoints

Install the Right Version of GR00T

Use my GR00T repo, which differs from the original GR00T repo:

Custom config specifying action representation and action horizon

Data Preparation

Inside Docker:

dockerrunxarm
cd /xArm-MSL-ROS/msl_scripts/groot_data_prepare
python convert_data_for_groot.py
python generate_info_json.py
python generate_episode_task_json.py
python generate_modality_json.py

IMPORTANT NOTES:

Gripper value is scaled to 0–1 here by convert_data_for_groot.py. It is no longer 0–850 (raw data from xArm API).
The state/action inputs format: [orientation (6D), position, gripper (0–1)]

Fine-Tuning the Model

Outside Docker:

conda activate gr00t

Start fine-tuning:

python scripts/gr00t_finetune.py \
--dataset-path ~/bags/msl_bags/converted_groot_data_absolute  \
--output-dir ~/bags/msl_bags/groot_checkpoints/xarm_pick_place_absolute_pose_run6_batch_16_horizon_100  \
--data-config xarm_dualcam_h100  \
--embodiment-tag oxe_droid  \
--num-gpus 1  \
--no-tune_diffusion_model  \
--max-steps 200000 \
--batch-size 16

Resuming Training

Edit the file:

/home/xarm/anaconda3_outside_docker/envs/gr00t/lib/python3.10/site-packages/transformers/trainer.py

Change this line:

checkpoint_rng_state = torch.load(rng_file, weights_only=True)

To:

checkpoint_rng_state = torch.load(rng_file, weights_only=False)

Add the --resume flag:

python scripts/gr00t_finetune.py \
--dataset-path ~/bags/msl_bags/converted_groot_data_absolute  \
--output-dir ~/bags/msl_bags/groot_checkpoints/xarm_pick_place_absolute_pose_run6_batch_16_horizon_100  \
--data-config xarm_dualcam_h100  \
--embodiment-tag oxe_droid  \
--num-gpus 1  \
--no-tune_diffusion_model  \
--max-steps 200000 \
--batch-size 16 \
--resume

Running Inference

Inside Docker:

python extract_images_poses_from_bags.py

NOTE: State/action inputs are in the format: [position, orientation (6D), gripper (0–850)]

Outside Docker:

python eval_groot_xarm_pick_place.py

Visualization

3D plot:

python trajectory_plot_3D_rollout_groot.py

Similarity plot:

Edit trajectory_similarity_plot_3D_rollout_groot_act.py and set:

for_act = False

Then:

python trajectory_similarity_plot_3D_rollout_groot_act.py

Inference Without Robot Data

python scripts/eval_policy.py \
  --model_path ~/bags/msl_bags/groot_checkpoints/xarm_pick_place_absolute_pose/checkpoint-25000 \
  --data_config xarm_dualcam_h100 \
  --dataset_path ~/bags/msl_bags/converted_groot_data \
  --embodiment_tag oxe_droid \
  --video_backend torchvision_av \
  --modality_keys single_arm gripper \
  --plot

🟢ACT: Data Preparation & Training

PS: Trained model's checkpoints for MSL xArm can be found at: ACT Checkpoints

Install the Right Version of ACT

Use my ACT repo, which differs from the original ACT repo:

Action representation = end-effector pose + gripper (10-dim) instead of joint-space (14-dim)
Updated data loader for custom observations

Data Preparation

Inside Docker:

dockerrunxarm
cd /xArm-MSL-ROS/msl_scripts/act_data_prepare
python convert_data_for_act.py
python generate_act_config.py

IMPORTANT NOTES:

Gripper values are NOT scaled; they remain 0–850
Input format: [position, orientation (6D), gripper (0–850)]

Training the Model

Outside Docker:

conda activate aloha
cd act

Train:

python imitate_episodes.py \
  --task_name xarm_pick_place \
  --ckpt_dir /home/xarm/bags/msl_bags/act_checkpoints/absolute_action_run3 \
  --policy_class ACT \
  --chunk_size 100 \
  --batch_size 16 \
  --num_epochs 50000 \
  --lr 1e-5 \
  --dim_feedforward 3200 \
  --hidden_dim 512 \
  --seed 0 \
  --kl_weight 0

Resume Training

--resume_ckpt_path /home/xarm/bags/msl_bags/act_checkpoints/absolute_action/policy_epoch_30600_seed_0.ckpt

Train with Relative Actions

Set relative flag in convert_data_for_act.py
Update --ckpt_dir
Edit constants.py → REAL_DATASET_DIR to point to relative dataset

Running Inference

Inside Docker:

python extract_images_poses_from_bags.py

Outside Docker:

python imitate_episodes.py \
  --eval \
  --task_name xarm_pick_place \
  --ckpt_dir /home/xarm/bags/msl_bags/act_checkpoints/absolute_action_run4 \
  --policy_class ACT \
  --chunk_size 100 \
  --batch_size 16 \
  --num_epochs 50000 \
  --lr 1e-5 \
  --dim_feedforward 3200 \
  --hidden_dim 512 \
  --seed 0 \
  --kl_weight 0 \
  --inference_dataset_dir /home/xarm/bags/msl_bags/IMPORTANT-distribution-pick-and-place-raw-bags-30/extracted_images_and_pose

Visualization

3D plot:

python trajectory_plot_3D_rollout_act.py

Similarity plot:

Edit trajectory_similarity_plot_3D_rollout_groot_act.py and set:

for_act = True

Then:

python trajectory_similarity_plot_3D_rollout_groot_act.py

🟢Diffusion Policy: xArm Experiments

Start Docker:

dockerrunxarm

If container already exists and fails to start:

dockercommitxarm && dockerpushxarm && dockerrunxarmwithremove

Start tmux and open multiple windows:

tmux
Ctrl+b c   # to create new window
Ctrl+b n   # to switch to next window

In separate windows:

Start ROS:

roscore

Launch cameras:

cd xArm-MSL-ROS/
roslaunch msl_scripts/launch_two_realsense_cameras.launch

(Optional: use rqt to check camera feed, then close viewer.)

Important: one person should remain at the robot's safety switch.

Enable robot at:

http://192.168.1.219:18333

Run Diffusion Policy

cd diffusion_policy/
conda-init
conda activate robodiff
python eval_xarm_msl_ros.py

After Experiments

Outside Docker:

dockercommitxarm
dockerpushxarm
docker rm xarm

🟢MSL Data Collection with OptiTrack

cd ~/xArm-MSL-ROS
roslaunch msl_scripts/launch_two_realsense_cameras.launch
roslaunch vrpn_client_ros sample.launch
python ./xArm-Python-SDK/example/wrapper/xarm6/xarm_msl_ros.py

Check pose estimates:

rostopic echo /vrpn_client_node/drone1/pose

Visualize data:

rqt_image_view

Record demo data:

rosbag record -o xarm_demo \
  /robot_end_effector_pose \
  /wrist_camera/color/image_raw \
  /wrist_camera/color/camera_info \
  /fixed_camera/color/camera_info \
  /fixed_camera/color/image_raw \
  /tf /tf_static

🟢xArm-Python-SDK

See README.md in the xArm-Python-SDK subfolder for more details.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
msl_scripts		msl_scripts
xArm-Python-SDK		xArm-Python-SDK
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

xArm-MSL-ROS

🟢Overview

🟢Table of Contents

🟢Groot N1.5: Data Preparation & Fine-Tuning

Install the Right Version of GR00T

Data Preparation

Fine-Tuning the Model

Resuming Training

Running Inference

Visualization

Inference Without Robot Data

🟢ACT: Data Preparation & Training

Install the Right Version of ACT

Data Preparation

Training the Model

Resume Training

Train with Relative Actions

Running Inference

Visualization

🟢Diffusion Policy: xArm Experiments

Run Diffusion Policy

After Experiments

🟢MSL Data Collection with OptiTrack

🟢xArm-Python-SDK

About

Uh oh!

Releases

Packages

Languages

ChengyangHE/xArm-MSL-ROS-Public

Folders and files

Latest commit

History

Repository files navigation

xArm-MSL-ROS

🟢Overview

🟢Table of Contents

🟢Groot N1.5: Data Preparation & Fine-Tuning

Install the Right Version of GR00T

Data Preparation

Fine-Tuning the Model

Resuming Training

Running Inference

Visualization

Inference Without Robot Data

🟢ACT: Data Preparation & Training

Install the Right Version of ACT

Data Preparation

Training the Model

Resume Training

Train with Relative Actions

Running Inference

Visualization

🟢Diffusion Policy: xArm Experiments

Run Diffusion Policy

After Experiments

🟢MSL Data Collection with OptiTrack

🟢xArm-Python-SDK

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages