Too many fingers - it maybe can be solved with a bit of a performance loss ? #1821

cmp-nct · 2022-10-06T18:29:15Z

cmp-nct
Oct 6, 2022

We can't fix the model itself to draw a hand correctly but we need those proper hands, arms, legs, toes, etc ..
Sometimes it works fine, but it seems more like random luck how many fingers a hand is showing.

I watched hundreds of hands begin to form and then end up badly but during the forming process you could see that it can go both ways, it just went bad. What that area would need is a tiny nudge at the right time to form a proper hand.

I am proposing a temporary solution that hooks in between the steps during generation and analyzes the image.
Using object recognition or a compact trained model to specifically detect an area with wrong fingers we should be able to detect most of the wrong outcomes early on.
When that is detected we could reverse y steps, modify the noise/image in the problem area in a semi-randomized fashion (using the seed for reproducibility) let it process again.
That step could be set to repeat up to "n" times until it gives up. When giving up it chooses the solution where the error was smallest.

An alternative to that would be inpainting as a post-processing step using the same method to detect and retry hands but often an error of that sort is getting worse through more steps. So stopping it at it's root might work best.

It's just an idea, maybe someone who worked in that area could tell if this sounds doable ?

Miraculix200 · 2022-10-07T10:52:41Z

Miraculix200
Oct 7, 2022

There was someone on Reddit yesterday who used textual inversion or dreambooth to train proper hands into a model or embedding

0 replies

Miraculix200 · 2022-10-07T10:54:06Z

Miraculix200
Oct 7, 2022

https://www.reddit.com/r/StableDiffusion/comments/xwzyvh/custom_hand_model_mean_pretty_much_perfect_hands/

0 replies

cmp-nct · 2022-10-07T13:26:04Z

cmp-nct
Oct 7, 2022
Author

Looks interesting, I wonder how flexible it will turn out to adapt and you'll have to refer those hands in your prompt. Maybe it's enough to refer them with a low weight [[]] though

0 replies

ss8319 · 2022-11-23T04:56:42Z

ss8319
Nov 23, 2022

Another way might be to train the model on some hands dataset or use pose estimators to guide the diffussion process to get better hands.

0 replies

mom333 · 2022-11-23T05:22:42Z

Direct link to the .pt: https://huggingface.co/datasets/Nerfgun3/bad_prompt/resolve/main/bad_prompt_version2.pt (it's 25.5 kB). Rename it to bad_prompt.pt, put it in embeddings, and add bad_prompt (or (bad_prompt:0.8)) in your negative prompt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Too many fingers - it maybe can be solved with a bit of a performance loss ? #1821

Uh oh!

{{title}}

Uh oh!

Replies: 5 comments 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Too many fingers - it maybe can be solved with a bit of a performance loss ? #1821

Uh oh!

cmp-nct Oct 6, 2022

Replies: 5 comments · 2 replies

Uh oh!

Miraculix200 Oct 7, 2022

Uh oh!

Miraculix200 Oct 7, 2022

Uh oh!

cmp-nct Oct 7, 2022 Author

Uh oh!

ss8319 Nov 23, 2022

Uh oh!

mom333 Nov 23, 2022

Uh oh!

ss8319 Nov 25, 2022

Uh oh!

garrett Nov 25, 2022

cmp-nct
Oct 6, 2022

Replies: 5 comments 2 replies

Miraculix200
Oct 7, 2022

Miraculix200
Oct 7, 2022

cmp-nct
Oct 7, 2022
Author

ss8319
Nov 23, 2022

mom333
Nov 23, 2022