请问一下在prefill计算时,ProcessKV和FlattenKV的作用分别是什么? #2063
Unanswered
sitabulaixizawaluduo
asked this question in
Q&A
Replies: 1 comment 3 replies
-
是的 |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
是不是可以理解ProcessKV是把经过qkv_proj计算的k v存到block里面,FlattenKV是把block kv cache转为整块的kv cache参与后续的attention计算?
Beta Was this translation helpful? Give feedback.
All reactions