Medium- and Small-sized Faces #1680

rstockm · 2023-03-03T22:58:07Z

rstockm
Mar 3, 2023

Hi, I get stellar results with my models trained on a face using "portrait" and/or "closeup" prompts - when the face covers most of the 512px area or is even bigger. Really impressive.

Bit when I try to render the same model on a more wide-angle shot with more of the body visible, things change dramatically. There is a faint resemblance at best, more often no likeness at all. Sometimes to the extend of "cursed faces" with washed out features.

Things I tried:

throwing some pictures from midrange into the learning mix
prompt optimization
different output resolutions
more/less training pictures and learning steps

...but to no avail. So Is there a secret I'm missing? Somne best practice for medium sized/small faces?

TheLastBen · 2023-03-04T08:43:29Z

TheLastBen
Mar 4, 2023
Maintainer

you can train on a 1024 resolution images, you'll get better results, but at inference you shouldn't go below 1024

1 reply

JustAnOkapi Mar 19, 2023

is there any other way?

rstockm · 2023-03-04T11:28:55Z

rstockm
Mar 4, 2023
Author

OK, I will try. What do you mean by "inference"? (I'm using Draw Things on a M2Pro)

1 reply

TheLastBen Mar 4, 2023
Maintainer

inference or when generating

THX1139b · 2023-03-04T13:24:36Z

THX1139b
Mar 4, 2023

The way I deal with this is to upscale and then manually crop out the face in Photoshop and do an img2img on that new image before pasting the result back into the original upscaled image in Photoshop. I haven’t done it that much lately tho since it’s a lot kore work and I’m lazy and I prefer a face to be in the foreground anyway, so typically I just add the words closeup, portrait or headshot to my prompts and generate more new images.

I’m going to use this method again this upcoming week because I’m making some posters that will be enlarged to… well, poster size, so then I’ll be OK with more of a landscape with the person being a smaller part of the overal image area. But you have to really look away from the fact that your subject doesn’t look good at all and work on the image anyway because the overall image and composition and everything else is great.

Also I have to say that often img2img simply doesn’t work for me and I get no improved details that way for some reason. It happens.

1 reply

SorenTruelsen Mar 7, 2023

I use the batch-face-swap extention. When ever I try to use upscale, it changes the picture too much and I have a hard time finding the sweet spot.

dr-formalyst · 2023-03-10T13:28:03Z

dr-formalyst
Mar 10, 2023

I am having the same results and my guess -maybe I'm wrong- is because Stable Diffusion does not have idea what the face (or any other concept) is and that it should be resized. Stable Diffusion works by adding noise to images (when training) and progressively denoising them (when generating new images). This is a gross oversimplification but it can help understand why generation of small faces would not work - simply it can't recognize a small patch of say 100x125px as a "candidate" for your face, if you trained it on your photos where your face mostly fills 512x512px area. It will though throw other people faces in that 100x125px area simply because of many photos it was trained on, where some of them had smaller faces to begin with.
I am not sure if this is a correct interpretation, but it feels logical. Neither do I know what solution may be. Maybe train on the full body shots where the head is small in the format (similar to what the size of your generated "heads' should be on the format)? I haven't tried this, but I am considering doing it, to see if it can fix it.

1 reply

kgonia May 25, 2023

It's probably wrong explanation. This project https://github.com/NVlabs/ODISE shows that SD uderstands what's in the image.

meunumerotim333 · 2023-03-10T15:10:03Z

meunumerotim333
Mar 10, 2023

I have tried some "cheat" for some specific images that maybe you should try. I generate 1024x768 picture very close to the face and them I used Dalle-2 to outpaint the cenary and them I use inpaint on SD to restore the image to the according style. It's harder but I've found this solution for specfic results.

0 replies

dr-formalyst · 2023-03-10T19:03:50Z

dr-formalyst
Mar 10, 2023

Thanks, @meunumerotim333 this is an interesting workflow! I am wondering, why are you using Dalle-2 for outpainting?

1 reply

meunumerotim333 Mar 19, 2023

Because I use Colab and I don't know how to generate good outpaint on Colab.

jimys · 2023-03-18T19:52:13Z

Medium- and Small-sized Faces #1680

Uh oh!

Replies: 7 comments · 7 replies

Uh oh!

TheLastBen Mar 4, 2023 Maintainer

Uh oh!

Uh oh!

rstockm Mar 4, 2023 Author

Uh oh!

TheLastBen Mar 4, 2023 Maintainer

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Replies: 7 comments 7 replies

TheLastBen
Mar 4, 2023
Maintainer

rstockm
Mar 4, 2023
Author

TheLastBen Mar 4, 2023
Maintainer