Performance Comparison - Vanilla vs. Pytorch 2.0 + Optimization on RTX 3070 (OC) #6615
Replies: 5 comments 15 replies
-
Thats interesting, i thought |
Beta Was this translation helpful? Give feedback.
-
fyi, i've tried enabing it on my rtx3060 and definitely not something to use daily - my batch-size 1 has same performance as before, but using higher batch sizes no longer has any improvements, performance is constant. which means by batch size 8, i'm loosing 25% of performance. |
Beta Was this translation helpful? Give feedback.
-
How to add mode="max-autotune"? |
Beta Was this translation helpful? Give feedback.
-
Is there any tutorial out there on how to apply this upgrade? Would it work on an RTX 2080S too? |
Beta Was this translation helpful? Give feedback.
-
Started discussion #6932 |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Dell 3070 (8GB) OC, i7 10700F, Headless Win 10 Home, 16GB DDR4
Studio Driver 528.02, cuDNN 8.7.0.84 for cuda 11.8
Steps: 20, Sampler: Euler a, CFG scale: 7, Seed: 3508515359, Size: 512x512, Model hash: a9263745, Model: v1-5-pruned
Vanilla
commit: 8850fc2
PyTorch 2.0 cu118
commit: 8850fc2
python: 3.10.6 • torch: 2.0.0 • xformers: 0.0.15+6cd1b36.d20230108 • gradio: 3.15.0
arguments: --listen --xformers --enable-insecure-extension-access --opt-channelslast
set PYTORCH_CUDA_ALLOC_CONF=garbage_collection_threshold:0.9,max_split_size_mb:464
With mode="max-autotune"
python: 3.10.6 • torch: 2.0.0 • xformers: 0.0.15+6cd1b36.d20230108 • gradio: 3.15.0 • commit: a0ef416
With torch.backends.cudnn.benchmark = True & mode="max-autotune"
python: 3.10.6 • torch: 2.0.0 • xformers: 0.0.15+6cd1b36.d20230108 • gradio: 3.15.0 • commit: a0ef416
512x512:
Higher Resolutions:
768x768:
1920x1088:
2048x1280:
1 : Calculated value. It/s for each image generation in the batch,
NOTE: OVERCLOCKED 3070 ON A HEADLESS WINDOWS 10 MACHINE.
Download cuDNN Libraries (Windows / Linux)
cuDNN license agreement: https://developer.nvidia.com/cudnn/license_agreement
Beta Was this translation helpful? Give feedback.
All reactions