Skip to content

Commit 033c220

Browse files
Add Qwen-Image-Layered Tutorial Documentation (#665)
* Update tutorials/image/qwen/qwen-image-layered.mdx Co-Authored-By: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com> * Update docs.json Co-Authored-By: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com> * Update docs.json Co-Authored-By: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com> * Update zh-CN/tutorials/image/qwen/qwen-image-layered.mdx Co-Authored-By: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com> --------- Co-authored-by: mintlify[bot] <109931778+mintlify[bot]@users.noreply.github.com>
1 parent a9d3ef4 commit 033c220

File tree

3 files changed

+158
-2
lines changed

3 files changed

+158
-2
lines changed

docs.json

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -141,7 +141,8 @@
141141
"group": "Qwen",
142142
"pages": [
143143
"tutorials/image/qwen/qwen-image",
144-
"tutorials/image/qwen/qwen-image-edit"
144+
"tutorials/image/qwen/qwen-image-edit",
145+
"tutorials/image/qwen/qwen-image-layered"
145146
]
146147
},
147148
{
@@ -785,7 +786,8 @@
785786
"group": "Qwen",
786787
"pages": [
787788
"zh-CN/tutorials/image/qwen/qwen-image",
788-
"zh-CN/tutorials/image/qwen/qwen-image-edit"
789+
"zh-CN/tutorials/image/qwen/qwen-image-edit",
790+
"zh-CN/tutorials/image/qwen/qwen-image-layered"
789791
]
790792
},
791793
{
Lines changed: 77 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,77 @@
1+
---
2+
title: "Qwen-Image-Layered ComfyUI Workflow Example"
3+
description: "Qwen-Image-Layered is a model capable of decomposing an image into multiple RGBA layers, enabling inherent editability through layer decomposition."
4+
sidebarTitle: "Qwen-Image-Layered"
5+
---
6+
7+
import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'
8+
9+
**Qwen-Image-Layered** is a model developed by Alibaba's Qwen team that can decompose an image into multiple RGBA layers. This layered representation unlocks inherent editability: each layer can be independently manipulated without affecting other content.
10+
11+
**Key Features**:
12+
- **Inherent Editability**: Each layer can be independently manipulated without affecting other content
13+
- **High-Fidelity Elementary Operations**: Supports resizing, repositioning, and recoloring with physical isolation of semantic components
14+
- **Variable-Layer Decomposition**: Not limited to a fixed number of layers - decompose into 3, 4, 8, or more layers as needed
15+
- **Recursive Decomposition**: Any layer can be further decomposed, enabling infinite decomposition depth
16+
17+
**Related Links**:
18+
- [Hugging Face](https://huggingface.co/Qwen/Qwen-Image-Layered)
19+
- [Research Paper](https://arxiv.org/abs/2512.15603)
20+
- [Blog](https://qwenlm.github.io/blog/qwen-image-layered/)
21+
22+
## Qwen-Image-Layered workflow
23+
24+
<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_qwen_image_layered.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
25+
<p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow File</p>
26+
</a>
27+
28+
<UpdateReminder />
29+
30+
## Model links
31+
32+
**text_encoders**
33+
34+
- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors)
35+
36+
**diffusion_models**
37+
38+
- [qwen_image_layered_bf16.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/diffusion_models/qwen_image_layered_bf16.safetensors)
39+
40+
**vae**
41+
42+
- [qwen_image_layered_vae.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/vae/qwen_image_layered_vae.safetensors)
43+
44+
**Model Storage Location**
45+
46+
```
47+
📂 ComfyUI/
48+
├── 📂 models/
49+
│ ├── 📂 text_encoders/
50+
│ │ └── qwen_2.5_vl_7b_fp8_scaled.safetensors
51+
│ ├── 📂 diffusion_models/
52+
│ │ └── qwen_image_layered_bf16.safetensors
53+
│ └── 📂 vae/
54+
│ └── qwen_image_layered_vae.safetensors
55+
```
56+
57+
## FP8 version
58+
59+
By default we are using bf16, which requires high VRAM. For lower VRAM usage, you can use the fp8 version:
60+
61+
- [qwen_image_layered_fp8mixed.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/blob/main/split_files/diffusion_models/qwen_image_layered_fp8mixed.safetensors)
62+
63+
Then update the **Load Diffusion model** node inside the [Subgraph](/interface/features/subgraph) to use it.
64+
65+
## Workflow settings
66+
67+
### Sampler settings
68+
69+
This model is slow. The original sampling settings are steps: 50 and CFG: 4.0, which will at least double the generation time.
70+
71+
### Input size
72+
73+
For input size, 640px is recommended. Use 1024px for high-resolution output.
74+
75+
### Prompt (optional)
76+
77+
The text prompt is intended to describe the overall content of the input image—including elements that may be partially occluded (e.g., you may specify the text hidden behind a foreground object). It is not designed to control the semantic content of individual layers explicitly.
Lines changed: 77 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,77 @@
1+
---
2+
title: "Qwen-Image-Layered ComfyUI 工作流示例"
3+
description: "Qwen-Image-Layered 是一个能够将图像分解为多个 RGBA 图层的模型,通过图层分解实现固有的可编辑性。"
4+
sidebarTitle: "Qwen-Image-Layered"
5+
---
6+
7+
import UpdateReminder from '/snippets/zh/tutorials/update-reminder.mdx'
8+
9+
**Qwen-Image-Layered** 是阿里巴巴通义千问团队开发的模型,能够将图像分解为多个 RGBA 图层。这种分层表示解锁了固有的可编辑性:每个图层都可以独立操作而不影响其他内容。
10+
11+
**主要特性**
12+
- **固有可编辑性**:每个图层都可以独立操作而不影响其他内容
13+
- **高保真基础操作**:支持调整大小、重新定位和重新着色,语义组件物理隔离
14+
- **可变图层分解**:不限于固定数量的图层 - 可根据需要分解为 3、4、8 或更多图层
15+
- **递归分解**:任何图层都可以进一步分解,实现无限分解深度
16+
17+
**相关链接**
18+
- [Hugging Face](https://huggingface.co/Qwen/Qwen-Image-Layered)
19+
- [研究论文](https://arxiv.org/abs/2512.15603)
20+
- [博客](https://qwenlm.github.io/blog/qwen-image-layered/)
21+
22+
## Qwen-Image-Layered 工作流
23+
24+
<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_qwen_image_layered.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold', marginRight: '10px'}}>
25+
<p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>下载 JSON 格式工作流</p>
26+
</a>
27+
28+
<UpdateReminder />
29+
30+
## 模型下载
31+
32+
**text_encoders**
33+
34+
- [qwen_2.5_vl_7b_fp8_scaled.safetensors](https://huggingface.co/Comfy-Org/HunyuanVideo_1.5_repackaged/resolve/main/split_files/text_encoders/qwen_2.5_vl_7b_fp8_scaled.safetensors)
35+
36+
**diffusion_models**
37+
38+
- [qwen_image_layered_bf16.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/diffusion_models/qwen_image_layered_bf16.safetensors)
39+
40+
**vae**
41+
42+
- [qwen_image_layered_vae.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/resolve/main/split_files/vae/qwen_image_layered_vae.safetensors)
43+
44+
**模型保存位置**
45+
46+
```
47+
📂 ComfyUI/
48+
├── 📂 models/
49+
│ ├── 📂 text_encoders/
50+
│ │ └── qwen_2.5_vl_7b_fp8_scaled.safetensors
51+
│ ├── 📂 diffusion_models/
52+
│ │ └── qwen_image_layered_bf16.safetensors
53+
│ └── 📂 vae/
54+
│ └── qwen_image_layered_vae.safetensors
55+
```
56+
57+
## FP8 版本
58+
59+
默认使用 bf16,需要较高显存。如需降低显存使用,可以使用 fp8 版本:
60+
61+
- [qwen_image_layered_fp8mixed.safetensors](https://huggingface.co/Comfy-Org/Qwen-Image-Layered_ComfyUI/blob/main/split_files/diffusion_models/qwen_image_layered_fp8mixed.safetensors)
62+
63+
然后更新[子图](/zh-CN/interface/features/subgraph)中的 **Load Diffusion model** 节点来使用它。
64+
65+
## 工作流设置
66+
67+
### 采样器设置
68+
69+
该模型运行较慢。原始采样设置为 steps: 50 和 CFG: 4.0,这将至少使生成时间翻倍。
70+
71+
### 输入尺寸
72+
73+
输入尺寸建议使用 640px。如需高分辨率输出,请使用 1024px。
74+
75+
### 提示词(可选)
76+
77+
文本提示词用于描述输入图像的整体内容——包括可能被部分遮挡的元素(例如,你可以指定隐藏在前景物体后面的文字)。它不是用来明确控制各个图层的语义内容的。

0 commit comments

Comments
 (0)