Using NVIDIA GeForce RTX 3090 24G GPU, using DPM++2M Karras with steps of 201024 * 1024 to generate a graph at a speed of 2.55 it/s. Is this speed normal? #15113
hjj-lmx
started this conversation in
Optimization
Replies: 1 comment 2 replies
-
With TensorRT you will hit a max of 5 it/s, and that is the limit of the 3090 for single image. FP8 does not give speed improvements on the Ampere tensor cores, only memory savings. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
How can you optimize the speed? If you use Controlnet, the speed will be slower. Do you have any good suggestions?
Beta Was this translation helpful? Give feedback.
All reactions