[Feature] Development Roadmap (V1 Training & Distillation & VSA Release)


Contritbutions are welcome!

## Focus
- Efficient video model finetuning (T2V and I2V)
- Distillation (DMD & PCM)
- VSA Kernel Release
- Data loader optimization
- Mixed-resolution and no-precompute support (T5, VAE, LoRA)

## Model Development & Finetuning
- [x] T2V finetuning experiments on crush-smol @SolitaryThinker  @jzhang38 
- [x] I2V finetuning experiments on crush-smol @JerryZhou54 @SolitaryThinker  #559 
- [x] Fine-tune VSA-based T2V model @BrianChen1129   https://github.com/hao-ai-lab/FastVideo/pull/478

## Distillation & Alignment
- [ ] ~V1 PCM Distillation @jzhang38  @Eigensystem    https://github.com/hao-ai-lab/FastVideo/pull/471~
- [x] V1 DMD Distillation @jzhang38 @Eigensystem 

## Preprocessing & Dataset
- [x] Refactor map-style parquet dataset @jzhang38  https://github.com/hao-ai-lab/FastVideo/pull/492
- [x] Write iterable, resumable style parquet dataset @jzhang38  https://github.com/hao-ai-lab/FastVideo/pull/492
- [x] Refactor preprocess @SolitaryThinker #539  
- [x] Latent precomputation for image-to-video training @BrianChen1129  
- [x] benchmark dataloader speed, implement iterable dataset and @SolitaryThinker   
- [ ] Multi-resolution bucketing for long video input compatibility @rlsu9  @Eigensystem   

## Training Infrastructure
- [x] Enable checkpoint saving/resume for distributed experiments @kevin314   
- [x] LoRA Inference  @Edenzzzz   https://github.com/hao-ai-lab/FastVideo/pull/451
- [x] Accelerate weight loading https://github.com/hao-ai-lab/FastVideo/pull/470
- [x] LoRA Training @Edenzzzz https://github.com/hao-ai-lab/FastVideo/pull/576
- [x] Support Video-Fun Wan 1.3B I2V model @SolitaryThinker @JerryZhou54 
- [ ] Activation offloading @JerryZhou54   
- [x] Various Activation checkpointing methods @Eigensystem #584 
- [x] Torch compile and regional compile @jzhang38   https://github.com/hao-ai-lab/FastVideo/pull/684
- [ ] LigerKernels/SGLang cuda kernels for norm & activation dispatch: Call out for contributions!
- [ ] Enable training with no precomputed latents @rlsu9 @Eigensystem  
- [ ] Reduce the number of changed layers in `_param_names_mapping` @jzhang38 


## VSA Kernel Release
- [x] VSA H100 Kernel @jzhang38   https://github.com/hao-ai-lab/FastVideo/pull/478
- [x] VSA 4090 Kernel @jzhang38   


## CI Infrastructure
- [x] Migrate to BuildKite + Modal


### Related resources

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature] Development Roadmap (V1 Training & Distillation & VSA Release) #468

Focus

Model Development & Finetuning

Distillation & Alignment

Preprocessing & Dataset

Training Infrastructure

VSA Kernel Release

CI Infrastructure

Related resources

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature] Development Roadmap (V1 Training & Distillation & VSA Release) #468

Description

Focus

Model Development & Finetuning

Distillation & Alignment

Preprocessing & Dataset

Training Infrastructure

VSA Kernel Release

CI Infrastructure

Related resources

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions