flux kontext transformer single blocks forward behavior changed

### Describe the bug

I observed that this line of code(https://github.com/huggingface/diffusers/blob/0454fbb30bfbe21aa4ea29c827c396bac57dc518/src/diffusers/models/transformers/transformer_flux.py#L88) was added for `First Block Cache`. 

However, it not only increases the computation, but also thoroughly changes the forward logic; `encoder_hidden_states` will affect `hidden_states` because of attention in this and subsequent blocks within the whole denoising loop. Is that right?

I'm also confused by the questions. The `FluxSingleTransformerBlock` is designed to mix `hidden_states` with encoder_hidden_states( prompt_embedding, in this case) or not?

### Reproduction

no need

### Logs

```shell

```

### System Info

no need

### Who can help?

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

flux kontext transformer single blocks forward behavior changed #12071

Describe the bug

Reproduction

Logs

System Info

Who can help?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

flux kontext transformer single blocks forward behavior changed #12071

Description

Describe the bug

Reproduction

Logs

System Info

Who can help?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions