You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add a flag to the tool to ensure some tensor names are always followed by another tensor and not at the end of a shard. This ensures the shard will not be released when the tensor is processed, and avoid missing-file failures of duplicate tensors that are re-referenced a few tensors later (typically token_embd.weight / output).
printf(" --split-max-tensors max tensors in each split (default: %d)\n", default_params.n_split_tensors);
62
64
printf(" --split-max-size N(M|G) max size per split\n");
63
65
printf(" --no-tensor-first-split do not add tensors to the first split (disabled by default)\n");
66
+
printf(" --must-be-followed LAYER ensure LAYER is not the last tensor in a split and will not be released when loading after any tensor is created (can be used multiple times)\n");
64
67
printf(" --dry-run only print out a split plan and exit, without writing any new files\n");
0 commit comments