Skip to content

Commit 375de5b

Browse files
committed
llama : use unused n_embd_k_gqa in k_shift
This also slightly reduces the diff from the master branch
1 parent 5f62db7 commit 375de5b

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/llama.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10806,7 +10806,7 @@ struct llm_build_context {
1080610806
ggml_view_3d(ctx0, kv_self.k_l[il],
1080710807
n_embd_head_k, n_head_kv, n_ctx,
1080810808
ggml_row_size(kv_self.k_l[il]->type, n_embd_head_k),
10809-
ggml_row_size(kv_self.k_l[il]->type, hparams.n_embd_k_gqa(il)),
10809+
ggml_row_size(kv_self.k_l[il]->type, n_embd_k_gqa),
1081010810
0),
1081110811
lctx.inp_K_shift, rope_factors, n_rot, rope_type, n_ctx_orig, freq_base, freq_scale,
1081210812
ext_factor, attn_factor, beta_fast, beta_slow);

0 commit comments

Comments
 (0)