Feature: z-image Turbo Control Net #8679

Pfannkuchensack · 2025-12-14T09:25:11Z

Summary

Add support for Z-Image ControlNet V2.0 alongside the existing V1 support.

Key changes:

Auto-detect control_in_dim from adapter weights (16 for V1, 33 for V2.0)
Auto-detect n_refiner_layers from state dict
Add zero-padding for V2.0's additional control channels (diffusers approach)
Use accelerate.init_empty_weights() for more efficient model creation
Add ControlNet_Checkpoint_ZImage_Config to frontend schema

Related Issues / Discussions

Part of Z-Image feature implementation.

QA Instructions

Load a Z-Image ControlNet V1 model (control_in_dim=16) and verify it works
Load a Z-Image ControlNet V2.0 model (control_in_dim=33) and verify it works
Test with different control types: Canny, Depth, Pose
Recommended control_context_scale: 0.65-0.80

Merge Plan

Can be merged after review. No special considerations needed.

Checklist

The PR has a short but descriptive title, suitable for a changelog
Tests added / updated (if applicable)
❗Changes to a redux slice have a corresponding migration
Documentation added / updated (if applicable)
Updated What's New copy (if doing a release after this PR)

blessedcoolant · 2025-12-21T16:43:32Z

Merged the Z Image PR. Can you rebase this against main now so we can go through the checks? Thank you.

feat: Add Z-Image ControlNet support with spatial conditioning Add comprehensive ControlNet support for Z-Image models including: Backend: - New ControlNet_Checkpoint_ZImage_Config for Z-Image control adapter models - Z-Image control key detection (_has_z_image_control_keys) to identify control layers - ZImageControlAdapter loader for standalone control models - ZImageControlTransformer2DModel combining base transformer with control layers - Memory-efficient model loading by building combined state dict

VRAM usage is high. - Auto-detect control_in_dim from adapter weights (16 for V1, 33 for V2.0) - Auto-detect n_refiner_layers from state dict - Add zero-padding for V2.0's additional channels - Use accelerate.init_empty_weights() for efficient model creation - Add ControlNet_Checkpoint_ZImage_Config to frontend schema

- Add missing ControlNet_Checkpoint_ZImage_Config import - Remove unused imports (Any, Dict, ADALN_EMBED_DIM, is_torch_version) - Add strict=True to zip() calls - Replace mutable list defaults with immutable tuples - Replace dict() calls with literal syntax - Sort imports in z_image_denoise.py

Implement Z-Image ControlNet as an Extension pattern (similar to FLUX ControlNet) instead of merging control weights into the base transformer. This provides: - Lower memory usage (no weight duplication) - Flexibility to enable/disable control per step - Cleaner architecture with separate control adapter Key implementation details: - ZImageControlNetExtension: computes control hints per denoising step - z_image_forward_with_control: custom forward pass with hint injection - patchify_control_context: utility for control image patchification - ZImageControlAdapter: standalone adapter with control_layers and noise_refiner Architecture matches original VideoX-Fun implementation: - Hints computed ONCE using INITIAL unified state (before main layers) - Hints injected at every other main transformer layer (15 control blocks) - Control signal added after each designated layer's forward pass V2.0 ControlNet support (control_in_dim=33): - Channels 0-15: control image latents - Channels 16-31: reference image (zeros for pure control) - Channel 32: inpaint mask (1.0 = don't inpaint, use control signal)

blessedcoolant · 2025-12-22T01:02:55Z

Seems to be working fine. I didn't have the last commit pulled when I tested the last time. My bad.

Also not sure if it is LoRA problem or the ControlNet issue but the results with the Arcane lora I linked in the other PR are quite muddy.

This is the output I get with the scaled safetensor. But I am guessing coz that is coz scaled weights are not supported yet with the ControlNet?

Pfannkuchensack · 2025-12-22T13:52:01Z

So the v1 and v2 are really bad. the v2.1 works fine https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.1

During testing, we found that applying ControlNet to Z-Image-Turbo caused the model to lose its acceleration capability and become blurry. We performed 8-step distillation on the version 2.1 model, and the distilled model demonstrates better performance when using 8-step prediction. Additionally, we have uploaded a tile model that can be used for super-resolution generation. [2025.12.22]

blessedcoolant · 2025-12-22T18:34:57Z

So the v1 and v2 are really bad. the v2.1 works fine https://huggingface.co/alibaba-pai/Z-Image-Turbo-Fun-Controlnet-Union-2.1

During testing, we found that applying ControlNet to Z-Image-Turbo caused the model to lose its acceleration capability and become blurry. We performed 8-step distillation on the version 2.1 model, and the distilled model demonstrates better performance when using 8-step prediction. Additionally, we have uploaded a tile model that can be used for super-resolution generation. [2025.12.22]

Ah. Brand new. I'll check it out in a bit. Both the regular and the tile version. If they are go, we can set them as the suggested starter models and merge this one up too and move on to the regional guidance part.

blessedcoolant · 2025-12-22T19:36:06Z

Tested out with the newer models. Definitely better performance. The quality of the controlnet models themselves is alright. LoRA functionality is much better but not Z Image base level yet.

But this PR is good to go I think. ControlNet models seem to be working. Both tile and union.

I synced up with main and fixed the ruff checks. If there's nothing else to add to this one, let me know. I can merge this and we can move on to the next one.

Great job overall implementing Z Image. Looking great.

github-actions bot added api python PRs that change python files Root invocations PRs that change invocations backend PRs that change backend files frontend PRs that change frontend files python-deps PRs that change python dependencies labels Dec 14, 2025

Pfannkuchensack closed this Dec 21, 2025

Pfannkuchensack reopened this Dec 21, 2025

Pfannkuchensack added 2 commits December 21, 2025 18:43

Pfannkuchensack force-pushed the feature/z-image-control branch from 4f040f1 to 8db8aa8 Compare December 21, 2025 17:46

Pfannkuchensack added 3 commits December 21, 2025 18:50

style: apply ruff formatting

1c13ca8

blessedcoolant added 2 commits December 23, 2025 01:03

Merge branch 'main' into pr/8679

7b9ce35

chore: format code for ruff checks

874b547

Merge branch 'main' into feature/z-image-control

73be5e5

blessedcoolant marked this pull request as ready for review December 23, 2025 00:24

blessedcoolant requested review from blessedcoolant and lstein as code owners December 23, 2025 00:24

blessedcoolant approved these changes Dec 23, 2025

View reviewed changes

blessedcoolant enabled auto-merge December 23, 2025 00:25

blessedcoolant merged commit aa764f8 into invoke-ai:main Dec 23, 2025
25 checks passed

Pfannkuchensack deleted the feature/z-image-control branch December 23, 2025 00:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Feature: z-image Turbo Control Net #8679

Feature: z-image Turbo Control Net #8679

Uh oh!

Pfannkuchensack commented Dec 14, 2025

Uh oh!

blessedcoolant commented Dec 21, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025 •

edited

Loading

Uh oh!

Pfannkuchensack commented Dec 22, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Feature: z-image Turbo Control Net #8679

Feature: z-image Turbo Control Net #8679

Uh oh!

Conversation

Pfannkuchensack commented Dec 14, 2025

Summary

Related Issues / Discussions

QA Instructions

Merge Plan

Checklist

Uh oh!

blessedcoolant commented Dec 21, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Pfannkuchensack commented Dec 22, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025

Uh oh!

blessedcoolant commented Dec 22, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

blessedcoolant commented Dec 22, 2025 •

edited

Loading