-
Notifications
You must be signed in to change notification settings - Fork 13.3k
Add a warning for special devices #15563
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a warning for special devices #15563
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Mostly just minor changes. I've revised the recommended compilation options since it's still possible to use GTX 16XX in conjunction with Ampere or newer.
Co-authored-by: Johannes Gäßler <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For the renaming of the vector you can't just accept the suggested change on Github because it needs to be changed consistently. If you fix that I'll merge this PR.
It now compiles and runs successfully. |
This causes an abort unless you have compiled in turing support, see #15584 |
I would guess that's due to 7d3e9fd#diff-cb2761994492f7839320b81765a6fc2a23c180e3ddfd4a51d6b0f618f8d76a69R277 ? A fix might be on the lines of replacing |
seems to work, at least the model was successfully loaded and answer generated |
This commit seems to break llama-server on my system: |
Should be fixed with #15587 . |
* Add warning * Print the devices names * Add newlines * Apply suggestions from code review Co-authored-by: Johannes Gäßler <[email protected]> * Fix vector names --------- Co-authored-by: Johannes Gäßler <[email protected]>
This reverts commit 4c37636.
Add a warning for special GTX series cards that have low performance due to MMA kernels.