Skip to content

Add --no-op-offload to improve -ot pp perf in MoE models like llama4 400B#13386

Merged
slaren merged 4 commits intoggml-org:masterfrom
hjc4869:no_op_offload
May 11, 2025
Merged

Add `--no-op-offload` to improve `-ot` pp perf in MoE models like llama4 400B#13386
slaren merged 4 commits intoggml-org:masterfrom
hjc4869:no_op_offload

Commits

Commits on May 8, 2025

Commits on May 9, 2025

Commits on May 11, 2025