Optimal parameters (unet, text encoder, concept steps) #981
Unanswered
specblades
asked this question in
Q&A
Replies: 1 comment 12 replies
-
don't use tags for the images, use only one single identifier for all the instance images, and for the concept images, you don't need to rename them or crop them, just upload them. for 40 images and 200 concept images, start with 3500 UNet, 600 text_enc, and 800 concept_text_enc |
Beta Was this translation helpful? Give feedback.
12 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Before I have had too many unsuccessful attempts (4 so far), I want to ask - how can I calculate in advance what parameters to set?
I would be insanely happy if you could share your experiences.
For example.
I train for subjects.
Different representations of the same class of objects, 40 images, each named with 4 tags, 200 concept imgs
tags: men/women, armor, leather/chain/heavy/futuristic, helmet/helmetless
As I understand it (for the next attempt, prev was totally garbage), the approximate values should be: 4000 unet, 1200-1600 text encoder, 600-700 encoder concept steps.
Correct me please.
Beta Was this translation helpful? Give feedback.
All reactions