Fix small inconsistency in output dimension of "_get_t5_prompt_embeds" function in sd3 pipeline #12531

alirezafarashah · 2025-10-22T22:32:23Z

What does this PR do?

This PR fixes a small inconsistency in the output dimension of the _get_t5_prompt_embeds function in the Stable Diffusion 3 pipeline.

Previously, when self.text_encoder_3 was None, the function returned a tensor (torch.zeros) with a sequence length of self.tokenizer_max_length (77), which corresponds to the CLIP encoder. However, the T5 text encoder used in SD3 has a different maximum sequence length (256).

As a result, when text_encoder_3 was available, the prompt embeddings had a sequence length of 333 (256 from T5 + 77 from CLIP), but when it was not available, the returned tensor had only 154 (77 + 77), leading to an inconsistency in output dimensions in encode_prompt.

Motivation and Context

This change ensures consistent tensor shapes across different encoder availability conditions in the SD3 pipeline.
It prevents dimension mismatches and potential runtime errors when text_encoder_3 is None.

Previously, the zeros tensor used self.tokenizer_max_length, which corresponds to CLIP, instead of T5’s longer sequence length.
This mismatch led to inconsistent embedding dimensions when combining outputs from CLIP and T5 in encode_prompt.

Changes Made

Replaced self.tokenizer_max_length with max_sequence_length when returning the zero tensor in _get_t5_prompt_embeds, ensuring consistent output dimensions whether text_encoder_3 is None or available.
The same max_sequence_length parameter is already used in the tokenization step of the same function:
```
text_inputs = self.tokenizer_3(
    prompt,
    padding="max_length",
    max_length=max_sequence_length,
    truncation=True,
    add_special_tokens=True,
    return_tensors="pt",
)
```
No changes to functionality, inputs, or outputs beyond dimension consistency.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@asomoza (pipelines and callbacks)
@yiyixuxu (pipelines and callbacks)
@sayakpaul (general functionalities)

…coder_3 is None

yiyixuxu

thanks!

yiyixuxu · 2025-10-24T21:04:54Z

@bot /style

github-actions · 2025-10-24T21:05:31Z

Style fix runs successfully without any file modified.

yiyixuxu · 2025-10-24T21:06:40Z

hey @alirezafarashah
can you run `make fix-copies"?

HuggingFaceDocBuilderDev · 2025-10-24T21:10:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

alirezafarashah · 2025-10-24T21:36:18Z

hey @alirezafarashah can you run `make fix-copies"?

Hey @yiyixuxu
I did and pushed it.

Fix small inconsistency in output dimension of t5 embeds when text_en…

3eea0b8

…coder_3 is None

sayakpaul requested a review from yiyixuxu October 24, 2025 19:20

yiyixuxu approved these changes Oct 24, 2025

View reviewed changes

yiyixuxu added the close-to-merge label Oct 24, 2025

first commit

5dd8a10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix small inconsistency in output dimension of "_get_t5_prompt_embeds" function in sd3 pipeline #12531

Fix small inconsistency in output dimension of "_get_t5_prompt_embeds" function in sd3 pipeline #12531

alirezafarashah commented Oct 22, 2025 •

edited

Loading

Uh oh!

yiyixuxu left a comment

Uh oh!

yiyixuxu commented Oct 24, 2025

Uh oh!

github-actions bot commented Oct 24, 2025 •

edited

Loading

Uh oh!

yiyixuxu commented Oct 24, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 24, 2025

Uh oh!

alirezafarashah commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix small inconsistency in output dimension of "_get_t5_prompt_embeds" function in sd3 pipeline #12531

Are you sure you want to change the base?

Fix small inconsistency in output dimension of "_get_t5_prompt_embeds" function in sd3 pipeline #12531

Conversation

alirezafarashah commented Oct 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation and Context

Changes Made

Before submitting

Who can review?

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

yiyixuxu commented Oct 24, 2025

Uh oh!

github-actions bot commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yiyixuxu commented Oct 24, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Oct 24, 2025

Uh oh!

alirezafarashah commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

alirezafarashah commented Oct 22, 2025 •

edited

Loading

github-actions bot commented Oct 24, 2025 •

edited

Loading