Replies: 1 comment 1 reply
-
this is supported in forge and proposed by auto 2 years ago way before those |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Flux seems to follow a narrow vector, allowing great prompt adherence without bleed- but in doing so works on its own terms.
It seems to miss certain ideas given in a prompt due to this narrow vector.
As a fix, it'd be awesome if either by native feature or extension we could combine two prompts with conditional averaging, like is able to be done in ComfyUI.
The other implication of creating a conditional average is to combine both clip_l and t5xxl without lost prompt adherence as using them together negatively impacts prompt adherence. For example, the user would create two separate prompts. One prompt would use t5xxl (natural narrative language) and one would use clip (language separated by tags, more similar to SDXL). Then, an average would be created between the two, leading to (in theory) better prompt adherence. This is something I would have to further test to see if it has merit. See this for further analysis on how using both encoders negatively impacts prompt adherence:
#1182
The images and some of the info were sourced from Latent Vision's video, highly recommend for those using Flux: https://youtu.be/tned5bYOC08?si=jloiEz-dR8ecYFJG
Beta Was this translation helpful? Give feedback.
All reactions