You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Update on "[llama-mm] Enable kv cache for MultiHeadAttention"
Summary: Change `MultiHeadAttention` in `extension/llm/modules` to
support KV cache. Only enable eager but not export yet.
Test Plan: Unit test
Reviewers:
Subscribers:
Tasks:
Tags:
[ghstack-poisoned]
0 commit comments