KV split while using -sm row
#397
Unanswered
pt13762104
asked this question in
Q&A
Replies: 1 comment 1 reply
-
I have never looked into splitting the KV cache when using Out of curiosity: does |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have found that ik_llama.cpp does NOT support kv-split while using
-sm row
, which is a limitation compared to llama.cpp. Is there any way to do this or it's just not implemented yet?Example output:
Beta Was this translation helpful? Give feedback.
All reactions