You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This commit was created on GitHub.com and signed with GitHub’s verified signature.
🎨 New header logo design - Refreshed ASCII art banner (thanks @naxci1)
🧹 Remove dead flash attention wrapper - Removed legacy code from FP8CompatibleDiT; FlashAttentionVarlen already handles backend switching via its attention_mode attribute
🛡️ Fix graceful fallback from flash-attn - Add compatibility shims for corrupted flash_attn/xformers DLLs, preventing startup crashes when CUDA extensions are broken
📊 Improved VRAM tracking - Separate allocated vs reserved memory tracking, Windows-only overflow detection (WDDM paging behavior)