Cosmos/transfer2.5 #3

miguelmartin75 · 2026-01-16T02:08:51Z

wip

* flux2-klein * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Klein tests (#2) * tests * up * tests * up * support step-distilled * Apply suggestions from code review Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * doc string etc * style * more * copies * klein lora training scripts (#3) * initial commit * initial commit * remove remote text encoder * initial commit * initial commit * initial commit * revert * img2img fix * text encoder + tokenizer * text encoder + tokenizer * update readme * guidance * guidance * guidance * test * test * revert changes not needed for the non klein model * Update examples/dreambooth/train_dreambooth_lora_flux2_klein.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix guidance * fix validation * fix validation * fix validation * fix path * space --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * style * Update src/diffusers/pipelines/flux2/pipeline_flux2_klein.py * Apply style fixes * auto pipeline --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* tag loader_id from Automodel * style * load_components by default only load components that are not already loaded * by default, skip loading the componeneets does not have the repo id

* add metadata field to input/output param * refactor mellonparam: move the template outside, add metaclass, define some generic template for custom node * add from_custom_block * style * up up fix * add mellon guide * add to toctree * style * add mellon_types * style * mellon_type -> inpnt_types + output_types * update doc * add quant info to components manager * fix more * up up * fix components manager * update custom block guide * update * style * add a warn for mellon and add new guides to overview * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/modular_diffusers/mellon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * more update on custom block guide * Update docs/source/en/modular_diffusers/mellon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * a few mamual * apply suggestion: turn into bullets * support define mellon meta with MellonParam directly, and update doc * add the video --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>

* start better template for modular pipeline card. * simplify structure. * refine. * style. * up * add tests

* add magcache * formatting * add magcache support with calibration mode * add imports * improvements * Apply style fixes * fix kandinsky errors * add tests and documentation * Apply style fixes * improvements * Apply style fixes * make fix-copies. * minor fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

) Fix syntax error in quantization configuration

…ggingface#13083) Improve docstring scheduling dpmsolver multistep inverse

* make flux hidden states contiguous * make fix-copies

…gingface#13081) make qwen hidden states contiguous to make torchao happy.

@asomoza

* Add ZImageInpaintPipeline Updated the pipeline structure to include ZImageInpaintPipeline alongside ZImagePipeline and ZImageImg2ImgPipeline. Implemented the ZImageInpaintPipeline class for inpainting tasks, including necessary methods for encoding prompts, preparing masked latents, and denoising. Enhanced the auto_pipeline to map the new ZImageInpaintPipeline for inpainting generation tasks. Added unit tests for ZImageInpaintPipeline to ensure functionality and performance. Updated dummy objects to include ZImageInpaintPipeline for testing purposes. * Add documentation and improve test stability for ZImageInpaintPipeline - Add torch.empty fix for x_pad_token and cap_pad_token in test - Add # Copied from annotations for encode_prompt methods - Add documentation with usage example and autodoc directive * Address PR review feedback for ZImageInpaintPipeline Add batch size validation and callback handling fixes per review, using diffusers conventions rather than suggested code verbatim. * Update src/diffusers/pipelines/z_image/pipeline_z_image_inpaint.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Update src/diffusers/pipelines/z_image/pipeline_z_image_inpaint.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Add input validation and fix XLA support for ZImageInpaintPipeline - Add missing is_torch_xla_available import for TPU support - Add xm.mark_step() in denoising loop for proper XLA execution - Add check_inputs() method for comprehensive input validation - Call check_inputs() at the start of __call__ Addresses PR review feedback from @asomoza. * Cleanup --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>

…face#12498) Even if the `qweight_type` is one of the `UNQUANTIZED_TYPES`, qweight still has to be "dequantized" because it is stored as an 8-bit tensor. Without doing so, it is therefore a shape mismatch in the following matmul. Side notes: - why isn't DIFFUSERS_GGUF_CUDA_KERNELS on by default? It's significantly faster and only used when installed - https://huggingface.co/Isotr0py/ggml/tree/main/build has no build for torch 2.8 (or the upcoming 2.9). Who can we contact to make such a build? Co-authored-by: YiYi Xu <yixu310@gmail.com>

…ggingface#13085) * Improve docstring scheduling dpmsolver sde * Update scheduling_dpmsolver_sde.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * run make fix-copies --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

@yiyixuxu

* simplify components manager doc * Apply suggestion from @yiyixuxu * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

) * initil * fix init_pipeline etc * style * copies * fix copies * upup more * fix test * add output type (huggingface#13091) --------- Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>

* up * style

…ce#13057) * Support different pipeline outputs for LTX 2 encode_video * Update examples to use improved encode_video function * Fix comment * Address review comments * make style and make quality * Have non-iterator video inputs respect video_chunks_number * make style and make quality * Add warning when encode_video receives a non-denormalized np.ndarray * make style and make quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* initial commit * initial commit * initial commit * initial commit * initial commit * initial commit * initial commit * fix vae * fix prompts * Apply style fixes * fix license --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>

* add wan modular tests * style. * add z-image tests and other fixes. * style. * increase tolerance for zimage * style * address reviewer feedback. * address reviewer feedback. * remove unneeded func * simplify even more.

miguelmartin75 force-pushed the cosmos/transfer2.5 branch from 5c6dd86 to 6d7d6af Compare January 16, 2026 02:09

miguelmartin75 force-pushed the cosmos/transfer2.5 branch from 446e6ea to 9b8338c Compare February 2, 2026 19:50

yiyixuxu and others added 27 commits February 3, 2026 05:34

[Modular] loader related (huggingface#13025)

ebd06f9

* tag loader_id from Automodel * style * load_components by default only load components that are not already loaded * by default, skip loading the componeneets does not have the repo id

[modular] change the template modular pipeline card (huggingface#13072)

1b8fc6c

* start better template for modular pipeline card. * simplify structure. * refine. * style. * up * add tests

[docs] Fix syntax error in quantization configuration (huggingface#13076

90818e8

) Fix syntax error in quantization configuration

docs: improve docstring scheduling_dpmsolver_multistep_inverse.py (hu…

03af690

…ggingface#13083) Improve docstring scheduling dpmsolver multistep inverse

[core] make flux hidden states contiguous (huggingface#13068)

9fe0a9c

* make flux hidden states contiguous * make fix-copies

[core] make qwen hidden states contiguous to make torchao happy. (hug…

a3dcd98

…gingface#13081) make qwen hidden states contiguous to make torchao happy.

ZImageControlNet cfg (huggingface#13080)

09dca38

Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>

[Modular] guard ModularPipeline.blocks attribute (huggingface#13014)

44f4dc0

* up * style

initial conversion script

94a44e4

cosmos control net block

3eec046

CosmosAttention

e181679

base model conversion

d46e6cb

wip

8cbb7a0

pipeline updates

e526bac

convert controlnet

7fef44a

pipeline: working without controls

6b93134

wip

8548861

miguelmartin75 added 24 commits February 10, 2026 02:41

control working

ec92d7f

cleanup + detail on neg_encoder_hidden_states

67cb736

convert edge

0a88230

pos emb for control latents

f501bb6

convert all chkpts

0d457f1

resolve TODOs

bfa83e2

remove prints

5ae1a05

Docs

f1ce209

add siglip image reference encoder

57388b7

Add unit tests

cee7324

controlnet: add duplicate layers

2f9ce6a

Additional tests

bc31b30

skip less

e8fbac2

skip less

a55fb3c

remove image_ref

0811456

minor

276a6b3

docs

1d912ec

remove skipped test in transfer

c0699dc

Don't crash process

2b4cecf

formatting

1f66428

revert some changes

6fdb677

remove skipped test

30a0866

make style

7d1525c

Address comment + fix example

4b38767

miguelmartin75 force-pushed the cosmos/transfer2.5 branch 2 times, most recently from 442e8e4 to ddec8fb Compare February 11, 2026 23:55

CosmosAttnProcessor2_0 revert + CosmosAttnProcessor2_5 changes

4bbedfb

miguelmartin75 force-pushed the cosmos/transfer2.5 branch from ddec8fb to 4bbedfb Compare February 11, 2026 23:56

miguelmartin75 added 2 commits February 12, 2026 00:08

make style

d4e7e6c

make fix-copies

0460203

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cosmos/transfer2.5 #3

Cosmos/transfer2.5 #3

Uh oh!

miguelmartin75 commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants

Cosmos/transfer2.5 #3

Are you sure you want to change the base?

Cosmos/transfer2.5 #3

Uh oh!

Conversation

miguelmartin75 commented Jan 16, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

10 participants