Update fun camera docs (#176)

comfyui-wiki · web-flow · commit 3d874e1009e9 · 2025-06-10T20:34:07.000+08:00
* Update fun camera docs

* Update Chinese VACE docs
diff --git a/docs.json b/docs.json
@@ -124,6 +124,7 @@
                           "tutorials/video/wan/wan-video",
                           "tutorials/video/wan/vace",
                           "tutorials/video/wan/fun-control",
+                          "tutorials/video/wan/fun-camera",
                           "tutorials/video/wan/fun-inp",
                           "tutorials/video/wan/wan-flf"
                         ]
@@ -583,6 +584,7 @@
                         "pages": [
                           "zh-CN/tutorials/video/wan/wan-video",
                           "zh-CN/tutorials/video/wan/fun-control",
+                          "zh-CN/tutorials/video/wan/fun-camera",
                           "zh-CN/tutorials/video/wan/fun-inp",
                           "zh-CN/tutorials/video/wan/wan-flf",
                           "zh-CN/tutorials/video/wan/vace"
diff --git a/images/tutorial/video/wan/wan2-1-fun-camera-1-3b-step-guide.jpg b/images/tutorial/video/wan/wan2-1-fun-camera-1-3b-step-guide.jpg
diff --git a/images/tutorial/video/wan/wan2-1-fun-camera-14b.jpg b/images/tutorial/video/wan/wan2-1-fun-camera-14b.jpg
diff --git a/tutorials/video/wan/fun-camera.mdx b/tutorials/video/wan/fun-camera.mdx
@@ -0,0 +1,125 @@
+---
+title: "ComfyUI Wan2.1 Fun Camera Official Examples"
+description: "This guide demonstrates how to use Wan2.1 Fun Camera in ComfyUI for video generation"
+sidebarTitle: "Wan2.1 Fun Camera"
+---
+
+import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
+
+## About Wan2.1 Fun Camera
+
+**Wan2.1 Fun Camera** is a video generation project launched by the Alibaba team, focusing on controlling video generation effects through camera motion.
+
+**Model Weights Download**:
+- [14B Version](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control-Camera)
+- [1.3B Version](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control-Camera)
+
+**Code Repository**: [VideoX-Fun](https://github.com/aigc-apps/VideoX-Fun)
+
+**ComfyUI now natively supports the Wan2.1 Fun Camera model**.
+
+<UpdateReminder/>
+
+## Model Installation
+
+These models only need to be installed once. Additionally, model download information is included in the corresponding workflow images, so you can choose your preferred way to download the models.
+
+All of the following models can be found at [Wan_2.1_ComfyUI_repackaged](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged)
+
+**Diffusion Models** choose either 1.3B or 14B:
+- [wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors)
+- [wan2.1_fun_camera_v1.1_14B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_14B_bf16.safetensors)
+
+If you've used Wan2.1 related models before, you should already have the following models. If not, please download them:
+
+**Text Encoders** choose one:
+- [umt5_xxl_fp16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp16.safetensors)
+- [umt5_xxl_fp8_e4m3fn_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors)
+
+**VAE**
+- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/vae/wan_2.1_vae.safetensors)
+
+**CLIP Vision**
+- [clip_vision_h.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/clip_vision/clip_vision_h.safetensors)
+
+File Storage Location:
+
+```
+📂 ComfyUI/
+├── 📂 models/
+│ ├── 📂 diffusion_models/
+│ │   ├── wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors # 1.3B version
+│ │   └── wan2.1_fun_camera_v1.1_14B_bf16.safetensors # 14B version
+│ ├── 📂 text_encoders/
+│ │   └── umt5_xxl_fp8_e4m3fn_scaled.safetensors
+│ ├── 📂 vae/
+│ │   └── wan_2.1_vae.safetensors
+│ └── 📂 clip_vision/
+│     └── clip_vision_h.safetensors
+```
+
+## ComfyUI Wan2.1 Fun Camera 1.3B Native Workflow Example
+
+### 1. Workflow Related Files Download
+
+#### 1.1 Workflow File
+
+Download the video below and drag it into ComfyUI to load the corresponding workflow:
+
+<video
+  controls
+  className="w-full aspect-video"
+  src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.mp4"
+></video>
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download Json Workflow File</p>
+</a>
+
+<Note>
+If you want to use the 14B version, simply replace the model file with the 14B version, but please be aware of the VRAM requirements.
+</Note>
+
+#### 1.2 Input Image Download
+
+Please download the image below, which we will use as the starting frame:
+
+![Input Reference Image](https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B_input.jpg)
+
+### 2. Complete the Workflow Step by Step
+
+![Wan2.1 Fun Camera Workflow Steps](/images/tutorial/video/wan/wan2-1-fun-camera-1-3b-step-guide.jpg)
+
+1. Ensure the correct version of model file is loaded:
+   - 1.3B version: `wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors`
+   - 14B version: `wan2.1_fun_camera_v1.1_14B_bf16.safetensors`
+2. Ensure the `Load CLIP` node has loaded `umt5_xxl_fp8_e4m3fn_scaled.safetensors`
+3. Ensure the `Load VAE` node has loaded `wan_2.1_vae.safetensors`
+4. Ensure the `Load CLIP Vision` node has loaded `clip_vision_h.safetensors`
+5. Upload the starting frame to the `Load Image` node
+6. Modify the Prompt if you're using your own input image
+7. Set camera motion in the `WanCameraEmbedding` node
+8. Click the `Run` button or use the shortcut `Ctrl(cmd) + Enter` to execute generation
+
+## ComfyUI Wan2.1 Fun Camera 14B Workflow and Input Image
+
+<video
+  controls
+  className="w-full aspect-video"
+  src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.mp4"
+></video>
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download Json Workflow File</p>
+</a>
+
+**Input Image**
+![Input Image](https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B_input.jpg)
+
+## Performance Reference
+
+**1.3B Version**:
+- 512×512 resolution on RTX 4090 takes about 72 seconds to generate 81 frames
+
+**14B Version**:
+- RTX4090 24GB VRAM may experience insufficient memory when generating 512×512 resolution, and memory issues have also occurred on A100 when using larger sizes
diff --git a/tutorials/video/wan/vace.mdx b/tutorials/video/wan/vace.mdx
@@ -105,7 +105,7 @@ Download the video below and drag it into ComfyUI to load the corresponding work
 
 ### 2. Complete the Workflow Step by Step
 
-![](/images/tutorial/video/wan/wan-vace-t2v-step-guide.jpg)
+![image](/images/tutorial/video/wan/wan-vace-t2v-step-guide.jpg)
 
 Please follow the numbered steps in the image to ensure smooth workflow execution
 
diff --git a/zh-CN/tutorials/video/wan/fun-camera.mdx b/zh-CN/tutorials/video/wan/fun-camera.mdx
@@ -0,0 +1,125 @@
+---
+title: "ComfyUI Wan2.1 Fun Camera 官方原生示例"
+description: "本文介绍了如何在 ComfyUI 中使用 Wan2.1 Fun Camera 完成视频生成"
+sidebarTitle: "Wan2.1 Fun Camera"
+---
+
+import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
+
+## 关于 Wan2.1 Fun Camera
+
+**Wan2.1 Fun Camera** 是阿里团队推出的视频生成项目，专注于通过摄像机运动来控制视频生成效果。
+
+**模型权重下载地址**：
+- [14B 版本](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-14B-Control-Camera)
+- [1.3B 版本](https://huggingface.co/alibaba-pai/Wan2.1-Fun-V1.1-1.3B-Control-Camera)
+
+**代码仓库**：[VideoX-Fun](https://github.com/aigc-apps/VideoX-Fun)
+
+**目前 ComfyUI 已原生支持了 Wan2.1 Fun Camera 模型**。
+
+<UpdateReminder/>
+
+## 相关模型安装
+
+这些模型你仅需要安装一次，另外在对应的工作流图片中也包含了模型下载信息，你可以选择你喜欢的方式下载模型。
+
+下面的所有模型你可以在 [Wan_2.1_ComfyUI_repackaged](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged) 找到
+
+**Diffusion Models** 选择 1.3B 或 14B：
+- [wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors)
+- [wan2.1_fun_camera_v1.1_14B_bf16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/diffusion_models/wan2.1_fun_camera_v1.1_14B_bf16.safetensors)
+
+下面的模型，如果你使用过 Wan2.1 的相关模型，那么你应该已经有了下面的模型，如果没有，请下载下面的模型：
+
+**Text Encoders** 选择其中一个：
+- [umt5_xxl_fp16.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp16.safetensors)
+- [umt5_xxl_fp8_e4m3fn_scaled.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/text_encoders/umt5_xxl_fp8_e4m3fn_scaled.safetensors)
+
+**VAE**
+- [wan_2.1_vae.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/vae/wan_2.1_vae.safetensors)
+
+**CLIP Vision**
+- [clip_vision_h.safetensors](https://huggingface.co/Comfy-Org/Wan_2.1_ComfyUI_repackaged/resolve/main/split_files/clip_vision/clip_vision_h.safetensors)
+
+文件保存位置：
+
+```
+📂 ComfyUI/
+├── 📂 models/
+│ ├── 📂 diffusion_models/
+│ │   ├── wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors # 1.3B 版本
+│ │   └── wan2.1_fun_camera_v1.1_14B_bf16.safetensors # 14B 版本
+│ ├── 📂 text_encoders/
+│ │   └── umt5_xxl_fp8_e4m3fn_scaled.safetensors
+│ ├── 📂 vae/
+│ │   └── wan_2.1_vae.safetensors
+│ └── 📂 clip_vision/
+│     └── clip_vision_h.safetensors
+```
+
+## ComfyUI Wan2.1 Fun Camera 1.3B 原生工作流示例
+
+### 1. 工作流相关文件下载
+
+#### 1.1 工作流文件
+
+下载下面的视频，并拖入 ComfyUI 中以加载对应的工作流：
+
+<video
+  controls
+  className="w-full aspect-video"
+  src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.mp4"
+></video>
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>下载 Json 格式工作流文件</p>
+</a>
+<Note>
+如果你想使用 14B 版本，只需要将模型文件替换为 14B 版本即可，但请注意显存要求。
+</Note>
+
+#### 1.2 输入图片下载
+
+
+请下载下面的图片，我们将作为起始帧：
+
+![输入参考图片](https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_1.3B_input.jpg)
+
+### 2. 按步骤完成工作流
+
+![Wan2.1 Fun Camera 工作流步骤](/images/tutorial/video/wan/wan2-1-fun-camera-1-3b-step-guide.jpg)
+
+1. 确保加载了正确版本的模型文件：
+   - 1.3B 版本：`wan2.1_fun_camera_v1.1_1.3B_bf16.safetensors`
+   - 14B 版本：`wan2.1_fun_camera_v1.1_14B_bf16.safetensors`
+2. 确保 `Load CLIP` 节点加载了 `umt5_xxl_fp8_e4m3fn_scaled.safetensors`
+3. 确保 `Load VAE` 节点加载了 `wan_2.1_vae.safetensors`
+4. 确保 `Load CLIP Vision` 节点加载了 `clip_vision_h.safetensors`
+5. 在 `Load Image` 节点上传起始帧
+6. 修改 Prompt，如果你使用了你自己的图像输入
+7. 在 `WanCameraEmbedding` 节点设置相机动作
+8. 点击 `Run` 按钮，或使用快捷键 `Ctrl(cmd) + Enter(回车)` 执行生成
+
+## ComfyUI Wan2.1 Fun Camera 14B 工作流及输入图片
+
+<video
+  controls
+  className="w-full aspect-video"
+  src="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.mp4"
+></video>
+
+<a className="prose"  target='_blank'  href="https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>下载 Json 格式工作流文件</p>
+</a>
+
+**输入图片**
+![输入图片](https://raw.githubusercontent.com/Comfy-Org/example_workflows/refs/heads/main/video/wan/fun-camera/v1.1/wan2.1_fun_camera_14B_input.jpg)
+
+## 性能参考
+
+**1.3B 版本**：
+- 512×512 RTX 4090 生成 81 帧约需 72 秒
+
+**14B 版本**：
+- RTX4090 24GB 显存在生成 512×512 分辨率时可能会出现显存不足, 在 A100 上运行尺寸过大时也出现过显存不足的情况
diff --git a/zh-CN/tutorials/video/wan/vace.mdx b/zh-CN/tutorials/video/wan/vace.mdx
@@ -108,7 +108,7 @@ VACE 14B 是阿里通义万相团队推出的开源视频编辑统一模型。
 
 ### 2. 按步骤完成工作流的运行
 
-![](/images/tutorial/video/wan/wan-vace-t2v-step-guide.jpg)
+![图像](/images/tutorial/video/wan/wan-vace-t2v-step-guide.jpg)
 
 请参照图片序号进行逐步确认，来保证对应工作流的顺利运行