- 
                Notifications
    You must be signed in to change notification settings 
- Fork 6.5k
Description
Qwen Image | Editing Capabilities
When it comes to image editing, Qwen-Image goes far beyond simple adjustments. It enables advanced operations such as style transfer, object insertion or removal, detail enhancement, text editing within images, and even human pose manipulation—all with intuitive input and coherent output. This level of control brings professional-grade editing within reach of everyday users.
But Qwen-Image doesn’t just create or edit—it understands. It supports a suite of image understanding tasks, including object detection, semantic segmentation, depth and edge (Canny) estimation, novel view synthesis, and super-resolution. These capabilities, while technically distinct, can all be seen as specialized forms of intelligent image editing, powered by deep visual comprehension.
Is your feature request related to a problem? Please describe.
Qwen-Image has incredible image-to-image and image editing capabilities. Would love to see these features in Diffusers.
Describe the solution you'd like.
The addition of a QwenImageToImagePipeline pipeline similar to the text-to-image pipeline. Not sure if image-editing would require a separate pipeline, or if it would be included. Inpainting would also be great.
Describe alternatives you've considered.
DiffSynth-Studio appears to already have image-to-image.
Additional context.

