docs/tutorials/image/qwen/qwen-image.mdx at 28e29eb275bb36e006efd3598110fd432b48d053 · Comfy-Org/docs

title	Qwen-Image ComfyUI Native Workflow Example
description	Qwen-Image is a 20B parameter MMDiT (Multimodal Diffusion Transformer) model open-sourced under the Apache 2.0 license.
sidebarTitle	Qwen-Image

import UpdateReminder from '/snippets/tutorials/update-reminder.mdx'

Qwen-Image is the first image generation foundation model released by Alibaba's Qwen team. It's a 20B parameter MMDiT (Multimodal Diffusion Transformer) model open-sourced under the Apache 2.0 license. The model has made significant advances in complex text rendering and precise image editing, achieving high-fidelity output for multiple languages including English and Chinese.

Model Highlights:

Excellent Multilingual Text Rendering: Supports high-precision text generation in multiple languages including English, Chinese, Korean, Japanese, maintaining font details and layout consistency
Diverse Artistic Styles: From photorealistic scenes to impressionist paintings, from anime aesthetics to minimalist design, fluidly adapting to various creative prompts

Related Links:

Qwen-Image Native Workflow Example

The models used in this document can be obtained from Huggingface or Modelscope

1. Workflow File

After updating ComfyUI, you can find the workflow file in the templates, or drag the workflow below into ComfyUI to load it.

<a className="prose" target='_blank' href="https://raw.githubusercontent.com/Comfy-Org/workflow_templates/refs/heads/main/templates/image_qwen_image.json" style={{ display: 'inline-block', backgroundColor: '#0078D6', color: '#ffffff', padding: '10px 20px', borderRadius: '8px', borderColor: "transparent", textDecoration: 'none', fontWeight: 'bold'}}> <p className="prose" style={{ margin: 0, fontSize: "0.8rem" }}>Download JSON Workflow

2. Model Download

You can find all the models on Huggingface or Modelscope

Diffusion Model

qwen_image_fp8_e4m3fn.safetensors

Text Encoder

qwen_2.5_vl_7b_fp8_scaled.safetensors

VAE

qwen_image_vae.safetensors

Model Storage Location

📂 ComfyUI/
├── 📂 models/
│   ├── 📂 diffusion_models/
│   │   └── qwen_image_fp8_e4m3fn.safetensors
│   ├── 📂 vae/
│   │   └── qwen_image_vae.safetensors
│   └── 📂 text_encoders/
│       └── qwen_2.5_vl_7b_fp8_scaled.safetensors

3. Complete the Workflow Step by Step

Load qwen_image_fp8_e4m3fn.safetensors in the Load Diffusion Model node
Load qwen_2.5_vl_7b_fp8_scaled.safetensors in the Load CLIP node
Load qwen_image_vae.safetensors in the Load VAE node
Set image dimensions in the EmptySD3LatentImage node
Enter your prompts in the CLIP Text Encoder (supports English, Chinese, Korean, Japanese, Italian, etc.)
Click Queue or press Ctrl+Enter to run

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Qwen-Image Native Workflow Example

1. Workflow File

2. Model Download

3. Complete the Workflow Step by Step

FilesExpand file tree

qwen-image.mdx

Latest commit

History

qwen-image.mdx

File metadata and controls

Qwen-Image Native Workflow Example

1. Workflow File

2. Model Download

3. Complete the Workflow Step by Step