fix(z_image): use unrestricted image self-attention for regional prompting by Pfannkuchensack · Pull Request #8718 · invoke-ai/InvokeAI

Pfannkuchensack · 2025-12-28T12:32:14Z

Summary

Changes image self-attention from restricted (region-isolated) to unrestricted (all image tokens can attend to each other), similar to the FLUX approach.

This fixes the issue where ZImage-Turbo with multiple regional guidance layers would generate two separate/disconnected images instead of compositing them into a single unified image.

The regional text-image attention remains restricted so that each region still responds to its corresponding prompt.

Fixes #8715

Related Issues / Discussions

Issue [bug]: ZImage-Turbo regional guidance not working as expected #8715: ZImage-Turbo regional guidance not working as expected

QA Instructions

Create a new canvas generation with ZImage-Turbo
Add two or more regional guidance layers with different prompts (e.g., "woman in business suit" on the left, "woman in peasant dress" on the right)
Add a global prompt (e.g., "two women")
Generate the image
Expected: A single unified image with both women composited according to their regions
Previous behavior: Two separate, independent images

Merge Plan

No special merge considerations required. This is a targeted fix to the attention mask construction.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
❗Changes to a redux slice have a corresponding migration - N/A
Documentation added / updated (if applicable) - N/A
Updated What's New copy (if doing a release after this PR)

…pting Changes image self-attention from restricted (region-isolated) to unrestricted (all image tokens can attend to each other), similar to the FLUX approach. This fixes the issue where ZImage-Turbo with multiple regional guidance layers would generate two separate/disconnected images instead of compositing them into a single unified image. The regional text-image attention remains restricted so that each region still responds to its corresponding prompt. Fixes invoke-ai#8715

lstein

This change works well in my hands. I find that even minimal overlap of the regions ensures better coherence to the main prompt, but I usually get acceptable results even when the regions are not overlapping.

Pfannkuchensack requested review from blessedcoolant and lstein as code owners December 28, 2025 12:32

github-actions bot added python PRs that change python files backend PRs that change backend files labels Dec 28, 2025

lstein mentioned this pull request Dec 28, 2025

[bug]: ZImage-Turbo regional guidance not working as expected #8715

Closed

1 task

lstein approved these changes Dec 28, 2025

View reviewed changes

lstein merged commit d7d0512 into invoke-ai:main Dec 28, 2025
13 checks passed

Pfannkuchensack deleted the fix/zimage-regional-guidance-unrestricted-attention branch December 28, 2025 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(z_image): use unrestricted image self-attention for regional prompting#8718

fix(z_image): use unrestricted image self-attention for regional prompting#8718
lstein merged 1 commit intoinvoke-ai:mainfrom
Pfannkuchensack:fix/zimage-regional-guidance-unrestricted-attention

Pfannkuchensack commented Dec 28, 2025

Uh oh!

lstein left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Pfannkuchensack commented Dec 28, 2025

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

lstein left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants