Skip to content

Commit e510907

Browse files
committed
move k forward_expand up
1 parent eac4bde commit e510907

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/llama-graph.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1595,12 +1595,12 @@ ggml_tensor * llm_graph_context::build_attn(
15951595
// expand k later to enable rope fusion which directly writes into k-v cache
15961596
ggml_build_forward_expand(gf, q_cur);
15971597
ggml_build_forward_expand(gf, v_cur);
1598+
ggml_build_forward_expand(gf, k_cur);
15981599

15991600
const auto * mctx_cur = inp->mctx;
16001601

16011602
// store to KV cache
16021603
{
1603-
ggml_build_forward_expand(gf, k_cur);
16041604
const auto & k_idxs = inp->get_k_idxs();
16051605
const auto & v_idxs = inp->get_v_idxs();
16061606

0 commit comments

Comments
 (0)