Skip to content

feat(datasets): add training data preprocessing (GameFactory example)#47

Open
jankinf wants to merge 1 commit intoTencent-Hunyuan:mainfrom
jankinf:data
Open

feat(datasets): add training data preprocessing (GameFactory example)#47
jankinf wants to merge 1 commit intoTencent-Hunyuan:mainfrom
jankinf:data

Conversation

@jankinf
Copy link
Copy Markdown

@jankinf jankinf commented Feb 26, 2026

  • Add hy_preprocess/ with preprocessing scripts for training data.
  • GameFactory pipeline: preprocess_gamefactory_dataset.py converts GameFactory/Minecraft data into the format expected by CameraJsonWMemDataset (pose JSON, action JSON, VAE latents, temporal alignment).
  • Utilities: generate_neg_prompt_pt.py for negative-prompt generation; shell scripts under scripts/ to run preprocessing and neg-prompt generation.
  • Supports configurable resolution, frame count (e.g. 129 frames), and optional subsampling for quick runs.

This is provided as a reference implementation; adapt paths and options for your own datasets and environment.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant