|
| 1 | +--- |
| 2 | +title: "ComfyUI Wan2.1 Fun Camera Official Examples" |
| 3 | +description: "This guide demonstrates how to use Wan2.1 Fun Camera in ComfyUI for video generation" |
| 4 | +sidebarTitle: "Wan2.1 Fun Camera" |
| 5 | +--- |
| 6 | + |
| 7 | +import UpdateReminder from '/snippets/tutorials/update-reminder.mdx' |
| 8 | + |
| 9 | +## About Wan2.1 Fun Camera |
| 10 | + |
| 11 | +**Wan2.1 Fun Camera** is a video generation project launched by the Alibaba team, focusing on controlling video generation effects through camera motion. |
| 12 | + |
| 13 | +**Model Weights Download**: |
| 14 | +- [14B Version](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control-Camera) |
| 15 | +- [1.3B Version](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control-Camera) |
| 16 | + |
| 17 | +**Code Repository**: [VideoX-Fun](https://github.com/aigc-apps/VideoX-Fun) |
| 18 | + |
| 19 | +**ComfyUI now natively supports the Wan2.1 Fun Camera model**. |
| 20 | + |
| 21 | +<UpdateReminder/> |
| 22 | + |
| 23 | +## Model Installation |
| 24 | + |
| 25 | +These models only need to be installed once. Additionally, model download information is included in the corresponding workflow images, so you can choose your preferred way to download the models. |
| 26 | + |
| 27 | +All of the following models can be found at [Wan_2.1_ComfyUI_repackaged](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged) |
| 28 | + |
| 29 | +**Diffusion Models** choose either 1.3B or 14B: |
| 30 | +- [wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors) |
| 31 | +- [wan2.1_fun_camera_v1.1_14B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_14B_bf16.safetensors) |
| 32 | + |
| 33 | +If you've used Wan2.1 related models before, you should already have the following models. If not, please download them: |
| 34 | + |
| 35 | +**Text Encoders** choose one: |
| 36 | +- [umt5_xxl_fp16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp16.safetensors) |
| 37 | +- [umt5_xxl_fp8_e4m3fn_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors) |
| 38 | + |
| 39 | +**VAE** |
| 40 | +- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/vae/wan_2.1_vae.safetensors) |
| 41 | + |
| 42 | +**CLIP Vision** |
| 43 | +- [clip_vision_h.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/clip_vision/clip_vision_h.safetensors) |
| 44 | + |
| 45 | +File Storage Location: |
| 46 | + |
| 47 | +``` |
| 48 | +📂 ComfyUI/ |
| 49 | +├── 📂 models/ |
| 50 | +│ ├── 📂 diffusion_models/ |
| 51 | +│ │ ├── wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors # 1.3B version |
| 52 | +│ │ └── wan2.1_fun_camera_v1.1_14B_bf16.safetensors # 14B version |
| 53 | +│ ├── 📂 text_encoders/ |
| 54 | +│ │ └── umt5_xxl_fp8_e4m3fn_scaled.safetensors |
| 55 | +│ ├── 📂 vae/ |
| 56 | +│ │ └── wan_2.1_vae.safetensors |
| 57 | +│ └── 📂 clip_vision/ |
| 58 | +│ └── clip_vision_h.safetensors |
| 59 | +``` |
| 60 | + |
| 61 | +## ComfyUI Wan2.1 Fun Camera 1.3B Native Workflow Example |
| 62 | + |
| 63 | +### 1. Workflow Related Files Download |
| 64 | + |
| 65 | +#### 1.1 Workflow File |
| 66 | + |
| 67 | +Download the video below and drag it into ComfyUI to load the corresponding workflow: |
| 68 | + |
| 69 | +<video |
| 70 | + controls |
| 71 | + className="w-full aspect-video" |
| 72 | + src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.mp4" |
| 73 | +></video> |
| 74 | + |
| 75 | +<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}> |
| 76 | + <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download Json Workflow File</p> |
| 77 | +</a> |
| 78 | + |
| 79 | +<Note> |
| 80 | +If you want to use the 14B version, simply replace the model file with the 14B version, but please be aware of the VRAM requirements. |
| 81 | +</Note> |
| 82 | + |
| 83 | +#### 1.2 Input Image Download |
| 84 | + |
| 85 | +Please download the image below, which we will use as the starting frame: |
| 86 | + |
| 87 | + |
| 88 | + |
| 89 | +### 2. Complete the Workflow Step by Step |
| 90 | + |
| 91 | + |
| 92 | + |
| 93 | +1. Ensure the correct version of model file is loaded: |
| 94 | + - 1.3B version: `wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors` |
| 95 | + - 14B version: `wan2.1_fun_camera_v1.1_14B_bf16.safetensors` |
| 96 | +2. Ensure the `Load CLIP` node has loaded `umt5_xxl_fp8_e4m3fn_scaled.safetensors` |
| 97 | +3. Ensure the `Load VAE` node has loaded `wan_2.1_vae.safetensors` |
| 98 | +4. Ensure the `Load CLIP Vision` node has loaded `clip_vision_h.safetensors` |
| 99 | +5. Upload the starting frame to the `Load Image` node |
| 100 | +6. Modify the Prompt if you're using your own input image |
| 101 | +7. Set camera motion in the `WanCameraEmbedding` node |
| 102 | +8. Click the `Run` button or use the shortcut `Ctrl(cmd) + Enter` to execute generation |
| 103 | + |
| 104 | +## ComfyUI Wan2.1 Fun Camera 14B Workflow and Input Image |
| 105 | + |
| 106 | +<video |
| 107 | + controls |
| 108 | + className="w-full aspect-video" |
| 109 | + src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.mp4" |
| 110 | +></video> |
| 111 | + |
| 112 | +<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}> |
| 113 | + <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download Json Workflow File</p> |
| 114 | +</a> |
| 115 | + |
| 116 | +**Input Image** |
| 117 | + |
| 118 | + |
| 119 | +## Performance Reference |
| 120 | + |
| 121 | +**1.3B Version**: |
| 122 | +- 512×512 resolution on RTX 4090 takes about 72 seconds to generate 81 frames |
| 123 | + |
| 124 | +**14B Version**: |
| 125 | +- RTX4090 24GB VRAM may experience insufficient memory when generating 512×512 resolution, and memory issues have also occurred on A100 when using larger sizes |
0 commit comments