Fix image input type and tensor shape mismatch in inference pipeline #2

seongbae15 · 2025-06-13T15:50:17Z

First of all, thank you for your amazing research. I'm very interested in your work, and while running the inference script, I encountered a couple of errors related to the input image's data type and dimensions. I investigated the issues and made some fixes, which I’m sharing through this pull request.

This pull request fixes two runtime errors encountered during inference in the TEMU-VTOFF project.

Error 1: AttributeError — 'Image' object has no attribute 'shape'

Cause: vton_image was a PIL.Image object, which does not have a .shape attribute.

Error 2: RuntimeError — Sizes of tensors must match

Cause: Mismatch in tensor dimensions when concatenating vton_model_input, mask, and masked_vton_latents using torch.cat.

Fix

The image is now converted to a PyTorch tensor and batched using:
Ensured consistent dimensions by properly processing the image input as shown above. This resolved the size mismatch in concatenated tensors.

image = transforms.ToTensor()(image).unsqueeze(0)

Changes Made

Added image preprocessing line to convert PIL.Image to 4D tensor

Test

Verified inference runs without errors on Colab.

Output image was generated successfully with no runtime exceptions.

…dimension mismatch - Fixed AttributeError caused by attempting to access '.shape' on a PIL.Image object. - Converted PIL image to tensor and added batch dimension using: 'image = transforms.ToTensor()(image).unsqueeze(0)' - Fixed RuntimeError from mismatched tensor sizes during torch.cat by ensuring input tensors have consistent dimensions.

Add time logger

Fix: AttributeError: module 'time' has no attribute 'tim'

Update from .debug to .info

Change logging.info -> print

seongbae15 mentioned this pull request Jul 24, 2025

Wrong data format and image normalization #3

Open

seongbae15 and others added 9 commits July 24, 2025 17:39

Merge branch 'davidelobba:main' into main

bc05162

Add time logger

5bd0187

Merge pull request #1 from seongbae15/dev/inference-time

8e1ae55

Add time logger

Fix: AttributeError: module 'time' has no attribute 'tim'

4721bd8

Merge pull request #2 from seongbae15/dev/inference-time

bbc6051

Fix: AttributeError: module 'time' has no attribute 'tim'

Update from .debug to .info

bce78c6

Merge pull request #3 from seongbae15/dev/inference-time

539912a

Update from .debug to .info

Change logging.info -> print

5ac9441

Merge pull request #4 from seongbae15/dev/inference-time

9d3058c

Change logging.info -> print

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix image input type and tensor shape mismatch in inference pipeline #2

Fix image input type and tensor shape mismatch in inference pipeline #2

Uh oh!

seongbae15 commented Jun 13, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix image input type and tensor shape mismatch in inference pipeline #2

Are you sure you want to change the base?

Fix image input type and tensor shape mismatch in inference pipeline #2

Uh oh!

Conversation

seongbae15 commented Jun 13, 2025

Error 1: AttributeError — 'Image' object has no attribute 'shape'

Error 2: RuntimeError — Sizes of tensors must match

Fix

Changes Made

Test

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant