a #8

sfc-gh-zhwang · 2023-09-11T08:02:31Z

Get rid of the mha kvcache hacky for llama2-70b and do it in the gqa;

sfc-gh-zhwang added 7 commits September 11, 2023 00:58

commit

a5f2ae3

commit

feeb43b

commit

4360e41

commit

fa96674

commit

ab722cb

commit

e1f22d3

commit

cf5fc55

sfc-gh-zhwang changed the title ~~Zhwang/llama gqa~~ Implement kv-cache for gqa Sep 11, 2023

sfc-gh-zhwang changed the title ~~Implement kv-cache for gqa~~ a Sep 12, 2023

Provide feedback