Skip to content

Qwen Image | Image-To-Image + Editing + Inpainting #12065

@ghunkins

Description

@ghunkins

Qwen Image | Editing Capabilities

Image

When it comes to image editing, Qwen-Image goes far beyond simple adjustments. It enables advanced operations such as style transfer, object insertion or removal, detail enhancement, text editing within images, and even human pose manipulation—all with intuitive input and coherent output. This level of control brings professional-grade editing within reach of everyday users.

Image

But Qwen-Image doesn’t just create or edit—it understands. It supports a suite of image understanding tasks, including object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and super-resolution. These capabilities, while technically distinct, can all be seen as specialized forms of intelligent image editing, powered by deep visual comprehension.

Is your feature request related to a problem? Please describe.

Qwen-Image has incredible image-to-image and image editing capabilities. Would love to see these features in Diffusers.

Describe the solution you'd like.

The addition of a QwenImageToImagePipeline pipeline similar to the text-to-image pipeline. Not sure if image-editing would require a separate pipeline, or if it would be included. Inpainting would also be great.

Describe alternatives you've considered.

DiffSynth-Studio appears to already have image-to-image.

Additional context.

Qwen Image GitHub Repo

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions