Misallignment in Rotary Frequencies

### Describe the bug

In `WanRotaryPosEmbed` ([link](https://github.com/huggingface/diffusers/blob/9c3b58dcf16ebd027fd3d85ec703ad5a142b1e1e/src/diffusers/models/transformers/transformer_wan.py#L349)), we are splitting the `attention_head_dim` into the different dimensions in different ways in `__init__` and  `forward`. This causes a missmatch depending on the `attention_head_dim`. This issue is also presentin other models that use rotary (e.g., [Skyreels_v2](https://github.com/huggingface/diffusers/blob/9c3b58dcf16ebd027fd3d85ec703ad5a142b1e1e/src/diffusers/models/transformers/transformer_skyreels_v2.py#L374)).

## Details
Given

https://github.com/huggingface/diffusers/blob/9c3b58dcf16ebd027fd3d85ec703ad5a142b1e1e/src/diffusers/models/transformers/transformer_wan.py#L363-L364

if we have an `attention_head_dim=64`, we get:

```
h_dim = w_dim = 2 * (attention_head_dim // 6)
t_dim = attention_head_dim - h_dim - w_dim
print([t_dim, h_dim, w_dim])
```
printing `[24, 20, 20]`

In the `forward`, when spliting the dimensions, we have

https://github.com/huggingface/diffusers/blob/9c3b58dcf16ebd027fd3d85ec703ad5a142b1e1e/src/diffusers/models/transformers/transformer_wan.py#L390-L394

so if we try

```
split_sizes = [
    attention_head_dim - 2 * (attention_head_dim // 3),
    attention_head_dim // 3,
    attention_head_dim // 3,
]
print(split_sizes)
```
printing `[22, 21, 21]`

In most of the models where the attention head is equal to 128 the values match, but I was wondering if this is a bug to fix. 

### Reproduction

NA

### Logs

```shell

```

### System Info

NA

### Who can help?

@DN6 @yiyixuxu @sayakpaul

	split_sizes = [
	self.attention_head_dim - 2 * (self.attention_head_dim // 3),
	self.attention_head_dim // 3,
	self.attention_head_dim // 3,
	]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Misallignment in Rotary Frequencies #12538

Describe the bug

Details

Reproduction

Logs

System Info

Who can help?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	h_dim = w_dim = 2 * (attention_head_dim // 6)
	t_dim = attention_head_dim - h_dim - w_dim

Misallignment in Rotary Frequencies #12538

Description

Describe the bug

Details

Reproduction

Logs

System Info

Who can help?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions