Skip to content

We introduce PROSKILL, the first benchmark dataset for action-level skill assessment in procedural tasks. PROSKILL provides absolute skill assessment annotations, along with pairwise ones.

License

Notifications You must be signed in to change notification settings

fpv-iplab/ProSKILL_WACV

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PROSKILL

Segment-Level Skill Assessment in Procedural Videos

Authors

Michele Mazzamuto¹², Daniele Di Mauro², Gianpiero Francesca³, Giovanni Maria Farinella¹, Antonino Furnari¹

Affiliations

Co-first authors
¹ IPLAB, University of Catania
² Next Vision s.r.l., Italy
³ Toyota Motor Europe, Belgium

Directory Structure

data/

Contains the annotation data and metadata for five procedural video datasets:

  • taxonomy.yaml: A YAML file mapping each dataset to its list of procedural tasks and skill categories
  • assembly101/: Assembly101 dataset containing assembly and disassembly task annotations
  • egoexo4d/: Ego-Exo4D dataset containing bike repair tasks (chain cleaning/lubrication, wheel installation/removal, flat tire fixing)
  • epic-tent/: EPIC-Tent dataset containing tent assembly annotations
  • ikea/: IKEA furniture assembly dataset with annotations for various furniture pieces (TV bench, side table, coffee table, shelf drawer)
  • meccano/: Meccano toy assembly dataset

Each dataset subdirectory contains:

  • taxonomy.json: Task-specific taxonomy and skill definitions
  • train.txt, val.txt, test.txt: Video identifiers for train/validation/test splits in the form of winner,loser,global goal->subgoal
  • rounds/: Contains YAML files (numbered 1.yaml through 6.yaml) organizing video segment pairs for comparative judgment across six annotation rounds. Each file maps task subtasks to lists of paired video segments to be compared for skill assessment.
  • rankings/: Contains skill ranking files (named rank_after_round_{n}.yml, i.e., 1.yml through 6.yml) with Elo ratings, Swiss scores, z-scores, and expert level assessments for each video segment after each annotation round.
  • transcripts_coarse/: Coarse-grained action transcripts

AMT_results/

Contains Amazon Mechanical Turk (AMT) crowdsourced comparative skill assessment results across six annotation rounds per task. Each subdirectory corresponds to a dataset:

  • assembly101/: 12 JSON files (6 rounds each for assembly and disassembly tasks)
    • Format: assembly_round_X_results.json and disassembly_round_X_results.json
  • egoexo4d/: 6 JSON files (one per round for bike repair tasks)
    • Format: bike_round_X_results.json
  • epic-tent/: Similar structure with tent assembly results
  • ikea/: Similar structure with furniture assembly results
  • meccano/: Similar structure with meccano assembly results

Each result file contains comparative judgment data with:

  • Input.video1_url and Input.video2_url: Paired video identifiers being compared
  • AnswerExperienced1 and AnswerExperienced2: Skill ratings from annotators (0-5 scale)
  • winner: The video deemed to demonstrate higher skill level
  • agreement_rate: Inter-annotator agreement on the judgment

Videos Dataset Download:

To download each video dataset follow their documentation, in particular:

  • assembly101: follow istruction to download here
  • egoexo4d: follow istruction to download here
  • epic-tent: follow istruction to download here
  • ikea: follow istruction to download here
  • meccano: follow istruction to download here

Extract clip tool

We share a script (tools/extract_clips.py) to extract the annotated clips, the current implementation expect a videos/ directory, containing the original videos, and a clips/ directory, where the output will be extracted, inside every dataset dir, for Ego-Exo4D you should copy keystep_val.json and keystep_train.json from the original dataset inside the egoexo4d directory.

About

We introduce PROSKILL, the first benchmark dataset for action-level skill assessment in procedural tasks. PROSKILL provides absolute skill assessment annotations, along with pairwise ones.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages