Commit 7638a4b
committed
Update base for Update on "[Executorch][llm] Enable leveraging ring kv cache via module swap"
This allows us to make some of the attention modules to use sliding window kv cache. Will help enable models like gemma3.
Differential Revision: [D73891426](https://our.internmc.facebook.com/intern/diff/D73891426/)
[ghstack-poisoned]1 parent 74c0dff commit 7638a4b
File tree
0 file changed
+0
-0
lines changed0 file changed
+0
-0
lines changed
0 commit comments