[v0.1.1] Bagel, WanVideo, stream packing and refactor for better repo structure
What's Changed
- feat: Custom FSDP2 trainer by @kcz358 in #8
- feat: Add Save and Load logic for fsdp2 trainer by @kcz358 in #9
- Dev/bo 0809 by @Luodian in #10
- feat: Add flash-attn and liger-kernel dependencies by @Luodian in #11
- feat: Support Qwen2 for remove padding training by @kcz358 in #14
- [feat] enable dllm training by @BIGKnight in #15
- feat: Add cicd by @kcz358 in #16
- feat: LLaVA-Ov ops and liger-kernel rfc by @kcz358 in #17
- rfc: Better base dataset abstract class and flexible args for kwargs by @kcz358 in #20
- test: Multi-gpu cicd test for robustness by @kcz358 in #22
- Dev/wan by @BIGKnight in #23
- feat: Add Qwen2 ulysses sequence parallel by @kcz358 in #24
- rfc: Refactor video loading logic and processor by @kcz358 in #25
- rfc: Train implementation, monkey patch logic by @kcz358 in #28
- Add efficient loss for dllms by @yshenaw in #27
- feat: Add profiler by @kcz358 in #30
- fix: profile error by @kcz358 in #31
- feat: Support stream packing by @kcz358 in #32
- Dev/muon by @BIGKnight in #34
- fix: Force iterable max steps by @kcz358 in #35
- feat: Support bagel training by @kcz358 in #33
- fix: Image tensor size error by @pufanyi in #40
New Contributors
Full Changelog: v0.1.0...v0.1.1