Improving prompt understanding in Stable Diffusion (e.g. LLM integration) #13321

CalculonPrime · 2023-09-19T00:23:32Z

CalculonPrime
Sep 19, 2023

Even the latest SD version (SDXL I assume?) doesn't seem capable of grasping separate objects and properly assigning their attributes, and seems to lack positional knowledge derived from your prompt.

However, most of us have used ChatGPT and know that this should be a solvable problem. I see from Cornell Paper: LLM-grounded Diffusion that research is already underway towards this goal.

Let's have this thread be a place for a high-level discussion of such potential improvements to SD. Are the authors of SD already attempting a similar integration with LLM software?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improving prompt understanding in Stable Diffusion (e.g. LLM integration) #13321

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Improving prompt understanding in Stable Diffusion (e.g. LLM integration) #13321

Uh oh!

Uh oh!

CalculonPrime Sep 19, 2023

Replies: 0 comments

CalculonPrime
Sep 19, 2023