Skip to content

Commit a98c37b

Browse files
authored
Added extra steps to update the Cudnnn DLL found in the Torch packages (#4459)
I added extra steps to update the Cudnnn DLL found in the Torch package because it wasn't optimised or didn't use the lastest version. So manually updating it can speed up iteration but the result might differ from each card. Exemple i passed from 3 it/s to a steady 20 it/s. ## What type of PR is this? (check all applicable) - [ ] Refactor - [ ] Feature - [ ] Bug Fix - [ ] Optimization - [x] Documentation Update - [ ] Community Node Submission ## Have you discussed this change with the InvokeAI team? - [x] Yes - [ ] No, because: ## Have you updated all relevant documentation? - [x] Yes - [ ] No ## Description ## Related Tickets & Documents <!-- For pull requests that relate or close an issue, please include them below. For example having the text: "closes #1234" would connect the current pull request to issue 1234. And when we merge the pull request, Github will automatically close the issue. --> - Related Issue # - Closes # ## QA Instructions, Screenshots, Recordings <!-- Please provide steps on how to test changes, any hardware or software specifications as well as any other pertinent information. --> ## Added/updated tests? - [x] Yes - [ ] No : _please replace this line with details on why tests have not been included_ ## [optional] Are there any post deployment tasks we need to perform?
2 parents 0e5eac7 + 252adb9 commit a98c37b

File tree

1 file changed

+24
-0
lines changed

1 file changed

+24
-0
lines changed

docs/installation/030_INSTALL_CUDA_AND_ROCM.md

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -57,6 +57,30 @@ familiar with containerization technologies such as Docker.
5757
For downloads and instructions, visit the [NVIDIA CUDA Container
5858
Runtime Site](https://developer.nvidia.com/nvidia-container-runtime)
5959

60+
### cuDNN Installation for 40/30 Series Optimization* (Optional)
61+
62+
1. Find the InvokeAI folder
63+
2. Click on .venv folder - e.g., YourInvokeFolderHere\\.venv
64+
3. Click on Lib folder - e.g., YourInvokeFolderHere\\.venv\Lib
65+
4. Click on site-packages folder - e.g., YourInvokeFolderHere\\.venv\Lib\site-packages
66+
5. Click on Torch directory - e.g., YourInvokeFolderHere\InvokeAI\\.venv\Lib\site-packages\torch
67+
6. Click on the lib folder - e.g., YourInvokeFolderHere\\.venv\Lib\site-packages\torch\lib
68+
7. Copy everything inside the folder and save it elsewhere as a backup.
69+
8. Go to __https://developer.nvidia.com/cudnn__
70+
9. Login or create an Account.
71+
10. Choose the newer version of cuDNN. **Note:**
72+
There are two versions, 11.x or 12.x for the differents architectures(Turing,Maxwell Etc...) of GPUs.
73+
You can find which version you should download from [this link](https://docs.nvidia.com/deeplearning/cudnn/support-matrix/index.html).
74+
13. Download the latest version and extract it from the download location
75+
14. Find the bin folder E\cudnn-windows-x86_64-__Whatever Version__\bin
76+
15. Copy and paste the .dll files into YourInvokeFolderHere\\.venv\Lib\site-packages\torch\lib **Make sure to copy, and not move the files**
77+
16. If prompted, replace any existing files
78+
79+
**Notes:**
80+
* If no change is seen or any issues are encountered, follow the same steps as above and paste the torch/lib backup folder you made earlier and replace it. If you didn't make a backup, you can also uninstall and reinstall torch through the command line to repair this folder.
81+
* This optimization is intended for the newer version of graphics card (40/30 series) but results have been seen with older graphics card.
82+
83+
6084
### Torch Installation
6185

6286
When installing torch and torchvision manually with `pip`, remember to provide

0 commit comments

Comments
 (0)