|
| 1 | +--- |
| 2 | +title: "Kandinsky 5.0" |
| 3 | +description: "This guide shows how to use Kandinsky 5.0 video generation workflows in ComfyUI" |
| 4 | +sidebarTitle: "Kandinsky 5.0" |
| 5 | +--- |
| 6 | + |
| 7 | +import UpdateReminder from "/snippets/tutorials/update-reminder.mdx"; |
| 8 | + |
| 9 | +[Kandinsky 5.0](https://huggingface.co/kandinskylab/Kandinsky-5.0-I2V-Lite-5s) is a family of diffusion models for video and image generation developed by [Kandinsky Lab](https://huggingface.co/kandinskylab). The Kandinsky 5.0 T2V Lite is a lightweight 2B parameter model that ranks among the top open-source video generation models, capable of generating videos up to 10 seconds long. |
| 10 | + |
| 11 | +<UpdateReminder/> |
| 12 | + |
| 13 | +## Overview |
| 14 | + |
| 15 | +Kandinsky 5.0 uses a latent diffusion pipeline with Flow Matching and features: |
| 16 | + |
| 17 | +- **Diffusion Transformer (DiT):** Main generative backbone with cross-attention to text embeddings |
| 18 | +- **Qwen2.5-VL and CLIP:** Provides high-quality text embeddings |
| 19 | +- **HunyuanVideo 3D VAE:** Encodes and decodes video into a latent space |
| 20 | + |
| 21 | +The model family includes multiple variants optimized for different use cases: |
| 22 | +- **SFT model:** Highest generation quality |
| 23 | +- **CFG-distilled:** 2× faster inference |
| 24 | +- **Diffusion-distilled:** 6× faster with minimal quality loss (16 steps) |
| 25 | +- **Pretrain model:** Designed for fine-tuning |
| 26 | + |
| 27 | +All models are available in 5-second and 10-second video generation versions. |
| 28 | + |
| 29 | +## Model variants |
| 30 | + |
| 31 | +| Model | Video Duration | NFE | Latency (H100) | |
| 32 | +|-------|---------------|-----|----------------| |
| 33 | +| Kandinsky 5.0 T2V Lite SFT | 5s / 10s | 100 | 139s / 224s | |
| 34 | +| Kandinsky 5.0 T2V Lite no-CFG | 5s / 10s | 50 | 77s / 124s | |
| 35 | +| Kandinsky 5.0 T2V Lite distill | 5s / 10s | 16 | 35s / 61s | |
| 36 | +| Kandinsky 5.0 I2V Lite | 5s | 100 | 673s | |
| 37 | + |
| 38 | +## Text-to-Video workflow |
| 39 | + |
| 40 | +### 1. Download workflow file |
| 41 | + |
| 42 | +Please update your ComfyUI to the latest version, and through the menu `Workflow` -> `Browse Templates` -> `Video`, find "Kandinsky 5.0 T2V" to load the workflow. |
| 43 | + |
| 44 | +<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/video_kandinsky5_t2v.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}> |
| 45 | + <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow File</p> |
| 46 | +</a> |
| 47 | + |
| 48 | +### 2. Manually download models |
| 49 | + |
| 50 | +**Text Encoders** |
| 51 | +- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors) |
| 52 | +- [clip_l.safetensors](https://huggingface.co/comfyanonymous/flux_text_encoders/resolve/main/clip_l.safetensors) |
| 53 | + |
| 54 | +**Diffusion Model** |
| 55 | +- [kandinsky5lite_t2v_sft_5s.safetensors](https://huggingface.co/kandinskylab/Kandinsky-5.0-T2V-Lite-sft-5s/resolve/main/model/kandinsky5lite_t2v_sft_5s.safetensors) |
| 56 | + |
| 57 | +**VAE** |
| 58 | +- [hunyuan_video_vae_bf16.safetensors](https://huggingface.co/Kijai/HunyuanVideo_comfy/resolve/main/hunyuan_video_vae_bf16.safetensors) |
| 59 | + |
| 60 | +``` |
| 61 | +ComfyUI/ |
| 62 | +├── 📂 models/ |
| 63 | +│ ├── 📂 text_encoders/ |
| 64 | +│ │ ├── qwen_2.5_vl_7b_fp8_scaled.safetensors |
| 65 | +│ │ └── clip_l.safetensors |
| 66 | +│ ├── 📂 diffusion_models/ |
| 67 | +│ │ └── kandinsky5lite_t2v_sft_5s.safetensors |
| 68 | +│ └── 📂 vae/ |
| 69 | +│ └── hunyuan_video_vae_bf16.safetensors |
| 70 | +``` |
| 71 | + |
| 72 | +## Image-to-Video workflow |
| 73 | + |
| 74 | +### 1. Download workflow file |
| 75 | + |
| 76 | +Please update your ComfyUI to the latest version, and through the menu `Workflow` -> `Browse Templates` -> `Video`, find "Kandinsky 5.0 I2V" to load the workflow. |
| 77 | + |
| 78 | +<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/video_kandinsky5_i2v.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}> |
| 79 | + <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow File</p> |
| 80 | +</a> |
| 81 | + |
| 82 | +### 2. Manually download models |
| 83 | + |
| 84 | +**Text Encoders** |
| 85 | +- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors) |
| 86 | +- [clip_l.safetensors](https://huggingface.co/comfyanonymous/flux_text_encoders/resolve/main/clip_l.safetensors) |
| 87 | + |
| 88 | +**Diffusion Model** |
| 89 | +- [kandinsky5lite_i2v_sft_5s.safetensors](https://huggingface.co/kandinskylab/Kandinsky-5.0-I2V-Lite-5s/resolve/main/model/kandinsky5lite_i2v_sft_5s.safetensors) |
| 90 | + |
| 91 | +**VAE** |
| 92 | +- [hunyuan_video_vae_bf16.safetensors](https://huggingface.co/Kijai/HunyuanVideo_comfy/resolve/main/hunyuan_video_vae_bf16.safetensors) |
| 93 | + |
| 94 | +``` |
| 95 | +ComfyUI/ |
| 96 | +├── 📂 models/ |
| 97 | +│ ├── 📂 text_encoders/ |
| 98 | +│ │ ├── qwen_2.5_vl_7b_fp8_scaled.safetensors |
| 99 | +│ │ └── clip_l.safetensors |
| 100 | +│ ├── 📂 diffusion_models/ |
| 101 | +│ │ └── kandinsky5lite_i2v_sft_5s.safetensors |
| 102 | +│ └── 📂 vae/ |
| 103 | +│ └── hunyuan_video_vae_bf16.safetensors |
| 104 | +``` |
| 105 | + |
| 106 | +## Resources |
| 107 | + |
| 108 | +- [HuggingFace Model Collection](https://huggingface.co/collections/kandinskylab/kandinsky-50-video-lite) |
| 109 | +- [GitHub Repository](https://github.com/ai-forever/Kandinsky-5) |
| 110 | +- [ComfyUI Integration](https://github.com/ai-forever/Kandinsky-5/blob/main/comfyui/README.md) |
| 111 | +- [Project Page](https://ai-forever.github.io/Kandinsky-5/) |
0 commit comments