[Sana bug] bug fix for 2K model config #10340

lawrence-cj · 2024-12-22T06:50:48Z

What does this PR do?

This PR fix the Positional Embedding missing bug. We re-introduce PE into > 2K resolution models. So we need to add PE back in SanaTransformer2DModel

Cc: @yiyixuxu @sayakpaul @a-r-r-o-w , thanks in advance for merging this PR to fix the bug.

Before the PR:

After the PR:

… output

HuggingFaceDocBuilderDev · 2024-12-22T08:41:23Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nitinmukesh · 2024-12-22T14:55:11Z

Adding comment to get notification when it's merged.

yiyixuxu · 2024-12-23T01:22:59Z

tests/models/test_modeling_common.py

        base_output = model(**inputs_dict)

-        model_size = compute_module_sizes(model)[""]
+        model_size = compute_module_persistent_sizes(model)[""]


we estimate the checkpoint size here, so should not include non-persistent buffers; technically we could pass remove_non_persistent to the named_module_tensors here but it will require a PR to accelerate so adding a function here for now
https://github.com/huggingface/accelerate/blob/200c9eb7833cfa505907f6f224ebf5a275aa6d92/src/accelerate/utils/modeling.py#L724

Is there anything I need to do here? I'm not familiar with what you said above. 🤔

* fix the Positinoal Embedding bug in 2K model; * Change the default model to the BF16 one for more stable training and output * make style * substract buffer size * add compute_module_persistent_sizes --------- Co-authored-by: yiyixuxu <[email protected]>

fix the Positinoal Embedding bug in 2K model;

459628c

lawrence-cj mentioned this pull request Dec 22, 2024

Bad output using 2K diffuser version NVlabs/Sana#107

Closed

Change the default model to the BF16 one for more stable training and…

6bf67e7

… output

yiyixuxu approved these changes Dec 22, 2024

View reviewed changes

yiyixuxu added 3 commits December 22, 2024 19:45

make style

b32bf00

substract buffer size

13c5954

add compute_module_persistent_sizes

574fe74

yiyixuxu reviewed Dec 23, 2024

View reviewed changes

yiyixuxu requested a review from DN6 December 23, 2024 01:27

DN6 approved these changes Dec 23, 2024

View reviewed changes

DN6 merged commit b58868e into huggingface:main Dec 23, 2024
12 checks passed

lawrence-cj deleted the Sana-PE branch January 2, 2025 15:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Sana bug] bug fix for 2K model config #10340

[Sana bug] bug fix for 2K model config #10340

Uh oh!

lawrence-cj commented Dec 22, 2024

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2024

Uh oh!

nitinmukesh commented Dec 22, 2024

Uh oh!

yiyixuxu Dec 23, 2024

Uh oh!

lawrence-cj Dec 23, 2024

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[Sana bug] bug fix for 2K model config #10340

[Sana bug] bug fix for 2K model config #10340

Uh oh!

Conversation

lawrence-cj commented Dec 22, 2024

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Dec 22, 2024

Uh oh!

nitinmukesh commented Dec 22, 2024

Uh oh!

yiyixuxu Dec 23, 2024

Choose a reason for hiding this comment

Uh oh!

lawrence-cj Dec 23, 2024

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants