Skip to content

Official implementation of 2HandedAfforder (ICCV 2025) — a vision-language-based framework for learning precise, actionable bimanual affordances from human videos, enabling robots to reason about and execute coordinated two-handed actions.

Notifications You must be signed in to change notification settings

pearl-robot-lab/2HandedAfforder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

61 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Generation for 2HandedAfforder

This repository extract the ground truth affordance annotation for the bimanual affordance task from the EPIC-KITCHENS dataset

Step 1: Installation

  • git clone https://github.com/MarvinHei/2HandedAfforder_DataGen.git
  • conda create -n "2handedafforder" python=3.8
  • conda activate 2handedafforder
  • conda install pytorch==1.11.0 torchvision==0.12.0 torchaudio==0.11.0 cudatoolkit=11.3 -c pytorch
  • Others: pip install -r requirements.txt

Step 2: Generate initial sparse masks

Download the VISOR-VIS annotations from https://data.bris.ac.uk/data/dataset/2v6cgv1x04ol22qp9rm9x2j6a7 and place it in VISOR-VIS_Mod/epick_visor. Within VISOR-VIS_Mod run

python demo.py

to generate the sparse masks. These will get stored in ../EPIC_DATA/segmentations

Step 3: Download the model checkpoints for MI-GAN, agent-inpainting and XMem

See README.md of submodules

Step 4: Run the data processing pipeline

./pipeline.bash PXX PXX_XX

Exchange XX for the video file of EPIC-KITCHENS you want to process. Each individual step will get stored within the EPIC_DATA folder. To delete the entire processed data of a video from the dataset, do

./delete_video.bash PXX_XX

The video itself will not be deleted

Step 5: Convert the data into a HDF5 format

./post_process.bash PXX_XX

About

Official implementation of 2HandedAfforder (ICCV 2025) — a vision-language-based framework for learning precise, actionable bimanual affordances from human videos, enabling robots to reason about and execute coordinated two-handed actions.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •