Replies: 2 comments 11 replies
-
In general, first fork this repository: Then you can make the changes in your fork and when you're done submit a pull request from there. There's some more good info here: |
Beta Was this translation helpful? Give feedback.
-
Ive seen other posts from you on reddit and I think you are a genius. I hope A1111 takes some advice from you. You said you didnt want to make an integration of voltaML into A1111 as to not duplicate efforts - here we are weeks later and nothing has been done, there were only performance degrading updates to the A1111 repo. You made the impression that you are able to do it - it seems like the voltaML engineers dont care about integrating it into other UIs anymore so, once you figured out how to do github contributions, would you be able to? The voltaML license would allow it, but only with mentioning the creators Im just asking because progress on SD seems to be coming to a crawl since SD 2.1 and some fresh wind would be very much appreciated by all of us eagerly awaiting the distilled diffusion models and hoping they make one for 1.5 too |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Sometimes you want to generate 100 images to pick and find the best. I've tested two changes to improve throughput. One fixes #5409 and the other implements an idea of mine. I'm posted some research on memory usage 2 days ago. #5409 is the fact that performance drops with the commit 67efee3
It you have enough memory there is no reason not to use the faster version of decode_first_stage(). So I add an option "--hivram" when you don't care about usage(within reason) and want performance.
The second change moves the post GPU image processing, like image save, onto a second thread so the main thread can move to the next batch without waiting. I put this under a "--pllcpugpu" option.
With a batch size of 16 and 6 batches I got a 12.5 percent improvement in image generation throughput.
Given that I have never done a github contribution before I need to know what I need to do as the first couple of steps.
?Create a feature request to track the work? Should it be a bug?
I know how to clone, create a branch, make changes, commit but don't know the github stuff.
BASELINE
Generated 96 (6 X 16) images in 57.048285 seconds
Time per image 0.59425296875 seconds
HIVRAM
Generated 96 (6 X 16) images in 53.527927 seconds
Time per image 0.5575825729166667 seconds
HIVRAM + PLLCPUGPU
Generated 96 (6 X 16) images in 49.773048 seconds
Time per image 0.51846925 seconds
Beta Was this translation helpful? Give feedback.
All reactions