Skip to content

Commit 2f951a8

Browse files
authored
Ling-1T convert fixup (ikawrakow#837)
* Conditionally write moe_shared_expert_intermediate_size Ling-1T config.json does *not* have `moe_shared_expert_intermediate_size`. Ling-flash-2.0a *does* have it. This small patch just makes the gguf_writer conditionally detect as needed. * Fix Ling-1T missing moe_shared_expert_intermediate_size Thanks CISC for the proper patch to include the needed values!
1 parent dbfd151 commit 2f951a8

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

convert_hf_to_gguf.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4494,7 +4494,7 @@ def set_gguf_parameters(self):
44944494
self.gguf_writer.add_leading_dense_block_count(hparams["first_k_dense_replace"])
44954495
self.gguf_writer.add_vocab_size(hparams["vocab_size"])
44964496
self.gguf_writer.add_expert_feed_forward_length(hparams["moe_intermediate_size"])
4497-
self.gguf_writer.add_expert_shared_feed_forward_length(hparams["moe_shared_expert_intermediate_size"])
4497+
self.gguf_writer.add_expert_shared_feed_forward_length(hparams.get("moe_shared_expert_intermediate_size", hparams["moe_intermediate_size"] * hparams["num_shared_experts"]))
44984498
self.gguf_writer.add_expert_weights_scale(hparams["routed_scaling_factor"])
44994499
self.gguf_writer.add_expert_count(hparams["num_experts"])
45004500
self.gguf_writer.add_expert_shared_count(hparams["num_shared_experts"])

0 commit comments

Comments
 (0)