Fix AttributeError of `VisualClozeProcessor` #12121

Justin900429 · 2025-08-11T05:09:35Z

Summary

Fixes an AttributeError in VisualClozePipeline where the code attempted to access a non-existent height function on VisualClozeProcessor. The pipeline now calls the correct resizing utility (_resize_and_crop) during preprocessing.

diffusers/src/diffusers/pipelines/visualcloze/visualcloze_utils.py

Lines 105 to 113 in f442955

    
           if len(target_position) > 1 and sum(target_position) > 1: 
        
               new_w = resize_size[n_samples - 1][0] or 384 
        
               for i in range(len(processed_images)): 
        
                   for j in range(len(processed_images[i])): 
        
                       if processed_images[i][j] is not None: 
        
                           new_h = int(processed_images[i][j].height * (new_w / processed_images[i][j].width)) 
        
                           new_w = int(new_w / 16) * 16 
        
                           new_h = int(new_h / 16) * 16 
        
                           processed_images[i][j] = self.height(processed_images[i][j], new_h, new_w)

This error occurs only when generating more than one image.

Reproduction

from diffusers import VisualClozePipeline
from PIL import Image
import torch

image_paths = [
    [
        Image.new("RGB", (384, 384), (0, 0, 0)),
        Image.new("RGB", (384, 384), (0, 0, 0)),
        Image.new("RGB", (384, 384), (0, 0, 0)),
    ],
    [
        Image.new("RGB", (384, 384), (0, 0, 0)),
        None,
        None,
    ],
]

task_prompt = "test"
content_prompt = None

pipe = VisualClozePipeline.from_pretrained(
    "VisualCloze/VisualClozePipeline-384", resolution=384, torch_dtype=torch.bfloat16
).to("cuda")

image_result = pipe(
    task_prompt=task_prompt,
    content_prompt=content_prompt,
    image=image_paths,
    upsampling_width=512,
    upsampling_height=512,
    upsampling_strength=0.0,
    guidance_scale=30,
    num_inference_steps=30,
    max_sequence_length=512,
    generator=torch.Generator("cuda").manual_seed(0),
).images[0]

Error:

AttributeError: 'VisualClozeProcessor' object has no attribute 'height'

@yiyixuxu @asomoza

a-r-r-o-w

Looks correct to me! Just curious, why calling self._resize_and_crop and not self.resize?

Justin900429 · 2025-08-18T06:36:21Z

Thanks for the reply!

Not sure which one is the author’s intended approach, but since they use _resize_and_crop above for the same function, I just followed their implementation.

Reference:

diffusers/src/diffusers/pipelines/visualcloze/visualcloze_utils.py

Line 94 in 03be15e

    
           target = self._resize_and_crop(input_images[i][j], resize_size[i][0], resize_size[i][1])

Edit:

In the authors’ original repo, they apply resize first and then perform a center crop. Therefore, using _resize_and_crop better aligns with their original implementation. (Check here)

Justin900429 · 2025-09-10T05:52:36Z

Gentle ping — would love some feedback on this when time permits.
Totally understand maintainers are busy, happy to adjust things if needed.
@a-r-r-o-w

HuggingFaceDocBuilderDev · 2025-09-10T20:28:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

a-r-r-o-w · 2025-09-10T22:47:28Z

@Justin900429 sorry about the delay, i forgot to merge

Fix AttributeError of VisualClozeProcessor

b17e085

a-r-r-o-w approved these changes Aug 18, 2025

View reviewed changes

Merge branch 'main' into fix_ae

45af9e5

yiyixuxu added the close-to-merge label Sep 10, 2025

a-r-r-o-w merged commit 55f0b3d into huggingface:main Sep 10, 2025
9 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Fix AttributeError of `VisualClozeProcessor` #12121

Fix AttributeError of `VisualClozeProcessor` #12121

Uh oh!

Justin900429 commented Aug 11, 2025

Uh oh!

a-r-r-o-w left a comment

Uh oh!

Justin900429 commented Aug 18, 2025 •

edited

Loading

Uh oh!

Justin900429 commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

a-r-r-o-w commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

	if len(target_position) > 1 and sum(target_position) > 1:
	new_w = resize_size[n_samples - 1][0] or 384
	for i in range(len(processed_images)):
	for j in range(len(processed_images[i])):
	if processed_images[i][j] is not None:
	new_h = int(processed_images[i][j].height * (new_w / processed_images[i][j].width))
	new_w = int(new_w / 16) * 16
	new_h = int(new_h / 16) * 16
	processed_images[i][j] = self.height(processed_images[i][j], new_h, new_w)

Uh oh!

Fix AttributeError of VisualClozeProcessor #12121

Fix AttributeError of VisualClozeProcessor #12121

Uh oh!

Conversation

Justin900429 commented Aug 11, 2025

Summary

Reproduction

Uh oh!

a-r-r-o-w left a comment

Choose a reason for hiding this comment

Uh oh!

Justin900429 commented Aug 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Justin900429 commented Sep 10, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Sep 10, 2025

Uh oh!

a-r-r-o-w commented Sep 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Fix AttributeError of `VisualClozeProcessor` #12121

Fix AttributeError of `VisualClozeProcessor` #12121

Justin900429 commented Aug 18, 2025 •

edited

Loading