fix: prevent data corruption when input==output in llama-quantize (#12753) #16997

tamarPal · 2025-11-04T13:11:14Z

Add safety checks to prevent catastrophic data loss when users accidentally specify the same file for both input and output in llama-quantize.

Changes:

Add same_file() function to detect identical files (including symlinks/hardlinks)
Block quantization when input==output without --inplace flag
Add --inplace flag for safe in-place quantization using temp file + atomic rename
Add --overwrite flag to allow overwriting existing files
Add comprehensive test suite (test_quantize_safety.sh)

Before this fix: input file would be truncated immediately, causing SIGBUS and data loss
After this fix: clear error message with solutions, or safe in-place operation with --inplace

Fixes #12753

…ml-org#12753) Add safety checks to prevent catastrophic data loss when users accidentally specify the same file for both input and output in llama-quantize. Changes: - Add same_file() function to detect identical files (including symlinks/hardlinks) - Block quantization when input==output without --inplace flag - Add --inplace flag for safe in-place quantization using temp file + atomic rename - Add --overwrite flag to allow overwriting existing files - Add comprehensive test suite (test_quantize_safety.sh) Before this fix: input file would be truncated immediately, causing SIGBUS and data loss After this fix: clear error message with solutions, or safe in-place operation with --inplace Fixes ggml-org#12753

tamarPal · 2025-11-04T13:15:41Z

@slaren @m18coppola
This PR fixes the issue where the input and output file paths were identical during quantization, which could lead to file truncation and SIGBUS errors.
I’ve added proper validation and a safe handling path for this case.
The fix ensures that:

The tool now prevents in-place overwrite unless --inplace is explicitly set.
Clear error messages are shown to the user in such cases.

Would appreciate a quick review and confirmation that this approach aligns with the project’s intended behavior.
Thanks!

tamarPal requested a review from ggerganov as a code owner November 4, 2025 13:11

tamarPal added 2 commits November 4, 2025 15:17

fix: remove trailing whitespace

452e21d

remove test file - project has its own test suite

a9a9853

DajanaV mentioned this pull request Nov 4, 2025

UPSTREAM PR #16997: fix: prevent data corruption when input==output in llama-quantize (#12753) auroralabs-loci/llama.cpp#79

Open

github-actions bot added the examples label Nov 4, 2025

tamarPal closed this Nov 4, 2025

tamarPal deleted the fix-quantize-same-file-corruption branch November 4, 2025 14:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: prevent data corruption when input==output in llama-quantize (#12753) #16997

fix: prevent data corruption when input==output in llama-quantize (#12753) #16997

Uh oh!

tamarPal commented Nov 4, 2025

Uh oh!

tamarPal commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

fix: prevent data corruption when input==output in llama-quantize (#12753) #16997

fix: prevent data corruption when input==output in llama-quantize (#12753) #16997

Uh oh!

Conversation

tamarPal commented Nov 4, 2025

Uh oh!

tamarPal commented Nov 4, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant