Skip to content

Commit 204cd80

Browse files
committed
feat(wip): Use type_k/type_v for hybrid cache types
Branch: Mamba2SSD Signed-off-by: Gabe Goodhart <[email protected]>
1 parent 6256f9a commit 204cd80

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

src/llama-model.cpp

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -6786,8 +6786,8 @@ llama_memory_i * llama_model::create_memory(const llama_memory_params & params,
67866786
/* attn_n_pad */ 1,
67876787
/* attn_n_swa */ hparams.n_swa,
67886788
/* attn_swa_type */ hparams.swa_type,
6789-
/* recurrent_type_k */ GGML_TYPE_F32,
6790-
/* recurrent_type_v */ GGML_TYPE_F32,
6789+
/* recurrent_type_r */ params.type_k,
6790+
/* recurrent_type_s */ params.type_v,
67916791
/* recurrent_kv_size */ std::max((uint32_t) 1, cparams.n_seq_max),
67926792
/* n_seq_max */ cparams.n_seq_max,
67936793
/* offload */ cparams.offload_kqv,

0 commit comments

Comments
 (0)