llama-graph: replace ggml_repeat_4d with ggml_repeat + ggml_new_tensor_4d

mastered-lore · mastered-lore · commit d50006e55416 · 2025-06-06T12:36:55.000-04:00
The ggml_repeat_4d function doesn't exist in the current ggml API.
Replace it with the correct approach using ggml_new_tensor_4d to
create a target tensor with the desired shape, then use ggml_repeat
to repeat the input tensor to match that shape.

This fixes compilation errors when building against system-provided
ggml libraries that don't include this non-existent function.
diff --git a/src/llama-graph.cpp b/src/llama-graph.cpp
@@ -770,7 +770,8 @@ ggml_tensor * llm_graph_context::build_moe_ffn(
 
     if (weight_before_ffn) {
         // repeat cur to [n_embd, n_expert_used, n_tokens]
-        ggml_tensor * repeated = ggml_repeat_4d(ctx0, cur, n_embd, n_expert_used, n_tokens, 1);
+        ggml_tensor * target_shape = ggml_new_tensor_4d(ctx0, cur->type, n_embd, n_expert_used, n_tokens, 1);
+        ggml_tensor * repeated = ggml_repeat(ctx0, cur, target_shape);
         cur = ggml_mul(ctx0, repeated, weights);
         cb(cur, "ffn_moe_weighted", il);
     }

Original file line number	Diff line number	Diff line change
`@@ -770,7 +770,8 @@ ggml_tensor * llm_graph_context::build_moe_ffn(`
`770`	`770`
`771`	`771`	`if (weight_before_ffn) {`
`772`	`772`	`// repeat cur to [n_embd, n_expert_used, n_tokens]`
`773`		`- ggml_tensor * repeated = ggml_repeat_4d(ctx0, cur, n_embd, n_expert_used, n_tokens, 1);`
	`773`	`+ ggml_tensor * target_shape = ggml_new_tensor_4d(ctx0, cur->type, n_embd, n_expert_used, n_tokens, 1);`
	`774`	`+ ggml_tensor * repeated = ggml_repeat(ctx0, cur, target_shape);`
`774`	`775`	`cur = ggml_mul(ctx0, repeated, weights);`
`775`	`776`	`cb(cur, "ffn_moe_weighted", il);`
`776`	`777`	`}`