Skip to content

Commit 471e026

Browse files
committed
fixed vram leak
1 parent d72f9d5 commit 471e026

File tree

1 file changed

+2
-0
lines changed

1 file changed

+2
-0
lines changed

src/llama-context.cpp

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -3050,5 +3050,7 @@ void llama_build_and_execute_mtp_graph(struct llama_context * ctx,
30503050
if (logits_mtp) {
30513051
ctx->set_logits_ith(logits_mtp, sched, last_tok_idx);
30523052
}
3053+
3054+
ggml_backend_sched_free(sched);
30533055
}
30543056

0 commit comments

Comments
 (0)