Refactoring of multi-head attention and support for KV caching#2061
Closed
mseeger wants to merge 1 commit intoLightning-AI:mainfrom
Closed
Refactoring of multi-head attention and support for KV caching#2061mseeger wants to merge 1 commit intoLightning-AI:mainfrom
mseeger wants to merge 1 commit intoLightning-AI:mainfrom