Skip to content

Commit 146fdc6

Browse files
authored
Merge pull request #24 from Thireus/glm-4.5
Final Glm 4.5 merge
2 parents 63284c2 + 0675f41 commit 146fdc6

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

src/llama.cpp

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -20713,7 +20713,7 @@ static void llama_model_quantize_internal(const std::string & fname_inp, const s
2071320713
// - model.arch == LLM_ARCH_DECI for Deci-Nemotron models
2071420714
//
2071520715
GGML_ASSERT((qs.n_attention_wv == 0 || qs.n_attention_wv == (int)model.hparams.n_layer || qs.n_attention_wv == 3 * (int)model.hparams.n_layer || model.arch == LLM_ARCH_DECI) && "n_attention_wv is unexpected");
20716-
20716+
2071720717
size_t total_size_org = 0;
2071820718
size_t total_size_new = 0;
2071920719

0 commit comments

Comments
 (0)