-
Notifications
You must be signed in to change notification settings - Fork 192
Open
Description
Contritbutions are welcome!
Focus
- Efficient video model finetuning (T2V and I2V)
- Distillation (DMD & PCM)
- VSA Kernel Release
- Data loader optimization
- Mixed-resolution and no-precompute support (T5, VAE, LoRA)
Model Development & Finetuning
- T2V finetuning experiments on crush-smol @SolitaryThinker @jzhang38
- I2V finetuning experiments on crush-smol @JerryZhou54 @SolitaryThinker [Feature] [Training] Add i2v training #559
- Fine-tune VSA-based T2V model @BrianChen1129 [Feature] Adding VSA inference #478
Distillation & Alignment
-
V1 PCM Distillation @jzhang38 @Eigensystem PCM Distillation #471 - V1 DMD Distillation @jzhang38 @Eigensystem
Preprocessing & Dataset
- Refactor map-style parquet dataset @jzhang38 [Feat][Dataloader] 1/n Refactor parquet map-style dataloader #492
- Write iterable, resumable style parquet dataset @jzhang38 [Feat][Dataloader] 1/n Refactor parquet map-style dataloader #492
- Refactor preprocess @SolitaryThinker [Training] Refactor and improve validation datasets #539
- Latent precomputation for image-to-video training @BrianChen1129
- benchmark dataloader speed, implement iterable dataset and @SolitaryThinker
- Multi-resolution bucketing for long video input compatibility @rlsu9 @Eigensystem
Training Infrastructure
- Enable checkpoint saving/resume for distributed experiments @kevin314
- LoRA Inference @Edenzzzz [LoRA] Support V1 LoRA inference #451
- Accelerate weight loading [Feature] Load weights from distributed #470
- LoRA Training @Edenzzzz [LoRA] Support v1 LoRA training #576
- Support Video-Fun Wan 1.3B I2V model @SolitaryThinker @JerryZhou54
- Activation offloading @JerryZhou54
- Various Activation checkpointing methods @Eigensystem [Feat] activation checkpointing #584
- Torch compile and regional compile @jzhang38 [Feature] Optionally enable torch compile #684
- LigerKernels/SGLang cuda kernels for norm & activation dispatch: Call out for contributions!
- Enable training with no precomputed latents @rlsu9 @Eigensystem
- Reduce the number of changed layers in
_param_names_mapping@jzhang38
VSA Kernel Release
- VSA H100 Kernel @jzhang38 [Feature] Adding VSA inference #478
- VSA 4090 Kernel @jzhang38
CI Infrastructure
- Migrate to BuildKite + Modal
Related resources
No response
Edenzzzz, SolitaryThinker, jd-nuva, zhisbug, philippe-eecs and 6 more
Metadata
Metadata
Assignees
Labels
No labels