Xformers or sdp attention support? #310
just-someguy
started this conversation in
Ideas
Replies: 1 comment
-
Currently not being persued since we have other priorities. One of which is a backend overhaul that allows the community to add stuff to the backend much more easily. We are currently mostly in a phase where we are aiming to get things stable so we can ship a new main version of KoboldAI and since its by far the biggest update we ever worked on thats taking a while. But once the new backend overhaul is in people from the community might be able to add this before we have time for these kinds of additions again. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I can barely find any mention on the internet at all about KoboldAI using an improved attention model like xformers or sdp_attention, with just a few people on reddit saying they wish it were a feature. These greatly improve token generation, and I'd like to see them added. Is this something being worked on? Or not necessarily a feature being pursued?
Beta Was this translation helpful? Give feedback.
All reactions