Add Qwen-Image-Layered Tutorial Documentation (#665)

mintlify[bot] · web-flow · commit 033c22072238 · 2025-12-23T01:04:02.000+08:00
* Update tutorials/image/qwen/qwen-image-layered.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update docs.json

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update docs.json

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

* Update zh-CN/tutorials/image/qwen/qwen-image-layered.mdx

Co-Authored-By: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;

---------

Co-authored-by: mintlify[bot] &lt;109931778+mintlify[bot]@users.noreply.github.com&gt;
diff --git a/docs.json b/docs.json
@@ -141,7 +141,8 @@
                         "group": "Qwen",
                         "pages": [
                           "tutorials/image/qwen/qwen-image",
-                          "tutorials/image/qwen/qwen-image-edit"
+                          "tutorials/image/qwen/qwen-image-edit",
+                          "tutorials/image/qwen/qwen-image-layered"
                         ]
                       },
                       {
@@ -785,7 +786,8 @@
                         "group": "Qwen",
                         "pages": [
                           "zh-CN/tutorials/image/qwen/qwen-image",
-                          "zh-CN/tutorials/image/qwen/qwen-image-edit"
+                          "zh-CN/tutorials/image/qwen/qwen-image-edit",
+                          "zh-CN/tutorials/image/qwen/qwen-image-layered"
                         ]
                       },
                       {
diff --git a/tutorials/image/qwen/qwen-image-layered.mdx b/tutorials/image/qwen/qwen-image-layered.mdx
@@ -0,0 +1,77 @@
+---
+title: "Qwen-Image-Layered ComfyUI Workflow Example"
+description: "Qwen-Image-Layered is a model capable of decomposing an image into multiple RGBA layers, enabling inherent editability through layer decomposition."
+sidebarTitle: "Qwen-Image-Layered"
+---
+
+import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
+
+**Qwen-Image-Layered** is a model developed by Alibaba's Qwen team that can decompose an image into multiple RGBA layers. This layered representation unlocks inherent editability: each layer can be independently manipulated without affecting other content.
+
+**Key Features**:
+- **Inherent Editability**: Each layer can be independently manipulated without affecting other content
+- **High-Fidelity Elementary Operations**: Supports resizing, repositioning, and recoloring with physical isolation of semantic components
+- **Variable-Layer Decomposition**: Not limited to a fixed number of layers - decompose into 3, 4, 8, or more layers as needed
+- **Recursive Decomposition**: Any layer can be further decomposed, enabling infinite decomposition depth
+
+**Related Links**:
+- [Hugging Face](https://huggingface.co/Qwen/Qwen-Image-Layered)
+- [Research Paper](https://arxiv.org/abs/2512.15603)
+- [Blog](https://qwenlm.github.io/blog/qwen-image-layered/)
+
+## Qwen-Image-Layered workflow
+
+<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_qwen_image_layered.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow File</p>
+</a>
+
+<UpdateReminder />
+
+## Model links
+
+**text_encoders**
+
+- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors)
+
+**diffusion_models**
+
+- [qwen_image_layered_bf16.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/diffusion_models/qwen_image_layered_bf16.safetensors)
+
+**vae**
+
+- [qwen_image_layered_vae.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/vae/qwen_image_layered_vae.safetensors)
+
+**Model Storage Location**
+
+```
+📂 ComfyUI/
+├── 📂 models/
+│   ├── 📂 text_encoders/
+│   │      └── qwen_2.5_vl_7b_fp8_scaled.safetensors
+│   ├── 📂 diffusion_models/
+│   │      └── qwen_image_layered_bf16.safetensors
+│   └── 📂 vae/
+│          └── qwen_image_layered_vae.safetensors
+```
+
+## FP8 version
+
+By default we are using bf16, which requires high VRAM. For lower VRAM usage, you can use the fp8 version:
+
+- [qwen_image_layered_fp8mixed.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/blob/main/split_files/diffusion_models/qwen_image_layered_fp8mixed.safetensors)
+
+Then update the **Load Diffusion model** node inside the [Subgraph](/interface/features/subgraph) to use it.
+
+## Workflow settings
+
+### Sampler settings
+
+This model is slow. The original sampling settings are steps: 50 and CFG: 4.0, which will at least double the generation time.
+
+### Input size
+
+For input size, 640px is recommended. Use 1024px for high-resolution output.
+
+### Prompt (optional)
+
+The text prompt is intended to describe the overall content of the input image—including elements that may be partially occluded (e.g., you may specify the text hidden behind a foreground object). It is not designed to control the semantic content of individual layers explicitly.
diff --git a/zh-CN/tutorials/image/qwen/qwen-image-layered.mdx b/zh-CN/tutorials/image/qwen/qwen-image-layered.mdx
@@ -0,0 +1,77 @@
+---
+title: "Qwen-Image-Layered ComfyUI 工作流示例"
+description: "Qwen-Image-Layered 是一个能够将图像分解为多个 RGBA 图层的模型，通过图层分解实现固有的可编辑性。"
+sidebarTitle: "Qwen-Image-Layered"
+---
+
+import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
+
+**Qwen-Image-Layered** 是阿里巴巴通义千问团队开发的模型，能够将图像分解为多个 RGBA 图层。这种分层表示解锁了固有的可编辑性：每个图层都可以独立操作而不影响其他内容。
+
+**主要特性**：
+- **固有可编辑性**：每个图层都可以独立操作而不影响其他内容
+- **高保真基础操作**：支持调整大小、重新定位和重新着色，语义组件物理隔离
+- **可变图层分解**：不限于固定数量的图层 - 可根据需要分解为 3、4、8 或更多图层
+- **递归分解**：任何图层都可以进一步分解，实现无限分解深度
+
+**相关链接**：
+- [Hugging Face](https://huggingface.co/Qwen/Qwen-Image-Layered)
+- [研究论文](https://arxiv.org/abs/2512.15603)
+- [博客](https://qwenlm.github.io/blog/qwen-image-layered/)
+
+## Qwen-Image-Layered 工作流
+
+<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_qwen_image_layered.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
+    <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>下载 JSON 格式工作流</p>
+</a>
+
+<UpdateReminder />
+
+## 模型下载
+
+**text_encoders**
+
+- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors)
+
+**diffusion_models**
+
+- [qwen_image_layered_bf16.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/diffusion_models/qwen_image_layered_bf16.safetensors)
+
+**vae**
+
+- [qwen_image_layered_vae.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/vae/qwen_image_layered_vae.safetensors)
+
+**模型保存位置**
+
+```
+📂 ComfyUI/
+├── 📂 models/
+│   ├── 📂 text_encoders/
+│   │      └── qwen_2.5_vl_7b_fp8_scaled.safetensors
+│   ├── 📂 diffusion_models/
+│   │      └── qwen_image_layered_bf16.safetensors
+│   └── 📂 vae/
+│          └── qwen_image_layered_vae.safetensors
+```
+
+## FP8 版本
+
+默认使用 bf16，需要较高显存。如需降低显存使用，可以使用 fp8 版本：
+
+- [qwen_image_layered_fp8mixed.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/blob/main/split_files/diffusion_models/qwen_image_layered_fp8mixed.safetensors)
+
+然后更新[子图](/zh-CN/interface/features/subgraph)中的 **Load Diffusion model** 节点来使用它。
+
+## 工作流设置
+
+### 采样器设置
+
+该模型运行较慢。原始采样设置为 steps: 50 和 CFG: 4.0，这将至少使生成时间翻倍。
+
+### 输入尺寸
+
+输入尺寸建议使用 640px。如需高分辨率输出，请使用 1024px。
+
+### 提示词（可选）
+
+文本提示词用于描述输入图像的整体内容——包括可能被部分遮挡的元素（例如，你可以指定隐藏在前景物体后面的文字）。它不是用来明确控制各个图层的语义内容的。

Original file line number	Diff line number	Diff line change
`@@ -141,7 +141,8 @@`
`141`	`141`	`"group": "Qwen",`
`142`	`142`	`"pages": [`
`143`	`143`	`"tutorials/image/qwen/qwen-image",`
`144`		`- "tutorials/image/qwen/qwen-image-edit"`
	`144`	`+ "tutorials/image/qwen/qwen-image-edit",`
	`145`	`+ "tutorials/image/qwen/qwen-image-layered"`
`145`	`146`	`]`
`146`	`147`	`},`
`147`	`148`	`{`
`@@ -785,7 +786,8 @@`
`785`	`786`	`"group": "Qwen",`
`786`	`787`	`"pages": [`
`787`	`788`	`"zh-CN/tutorials/image/qwen/qwen-image",`
`788`		`- "zh-CN/tutorials/image/qwen/qwen-image-edit"`
	`789`	`+ "zh-CN/tutorials/image/qwen/qwen-image-edit",`
	`790`	`+ "zh-CN/tutorials/image/qwen/qwen-image-layered"`
`789`	`791`	`]`
`790`	`792`	`},`
`791`	`793`	`{`