Skip to content

Commit 4a1c959

Browse files
pks-tgitster
authored andcommitted
builtin/patch-id: fix uninitialized hash function
In c8aed5e (repository: stop setting SHA1 as the default object hash, 2024-05-07), we have adapted `initialize_repository()` to no longer set up a default hash function. As this function is also used to set up `the_repository`, the consequence is that `the_hash_algo` will now by default be a `NULL` pointer unless the hash algorithm was configured properly. This is done as a mechanism to detect cases where we may be using the wrong hash function by accident. This change now causes git-patch-id(1) to segfault when it's run outside of a repository. As this command can read diffs from stdin, it does not necessarily need a repository, but then relies on `the_hash_algo` to compute the patch ID itself. It is somewhat dubious that git-patch-id(1) relies on `the_hash_algo` in the first place. Quoting its manpage: A "patch ID" is nothing but a sum of SHA-1 of the file diffs associated with a patch, with line numbers ignored. As such, it’s "reasonably stable", but at the same time also reasonably unique, i.e., two patches that have the same "patch ID" are almost guaranteed to be the same thing. We explicitly document patch IDs to be using SHA-1. Furthermore, patch IDs are supposed to be stable for most of the part. But even with the same input, the patch IDs will now be different depending on the repo's configured object hash. Work around the issue by setting up SHA-1 when there was no startup repository for now. This is arguably not the correct fix, but for now we rather want to focus on getting the segfault fixed. Signed-off-by: Patrick Steinhardt <[email protected]> Signed-off-by: Junio C Hamano <[email protected]>
1 parent abece6e commit 4a1c959

File tree

3 files changed

+48
-1
lines changed

3 files changed

+48
-1
lines changed

builtin/patch-id.c

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,7 @@
55
#include "hash.h"
66
#include "hex.h"
77
#include "parse-options.h"
8+
#include "setup.h"
89

910
static void flush_current_id(int patchlen, struct object_id *id, struct object_id *result)
1011
{
@@ -237,6 +238,18 @@ int cmd_patch_id(int argc, const char **argv, const char *prefix)
237238
argc = parse_options(argc, argv, prefix, builtin_patch_id_options,
238239
patch_id_usage, 0);
239240

241+
/*
242+
* We rely on `the_hash_algo` to compute patch IDs. This is dubious as
243+
* it means that the hash algorithm now depends on the object hash of
244+
* the repository, even though git-patch-id(1) clearly defines that
245+
* patch IDs always use SHA1.
246+
*
247+
* NEEDSWORK: This hack should be removed in favor of converting
248+
* the code that computes patch IDs to always use SHA1.
249+
*/
250+
if (!the_hash_algo)
251+
repo_set_hash_algo(the_repository, GIT_HASH_SHA1);
252+
240253
generate_id_list(opts ? opts > 1 : config.stable,
241254
opts ? opts == 3 : config.verbatim);
242255
return 0;

t/t1517-outside-repo.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -21,7 +21,7 @@ test_expect_success 'set up a non-repo directory and test file' '
2121
git diff >sample.patch
2222
'
2323

24-
test_expect_failure 'compute a patch-id outside repository (uses SHA-1)' '
24+
test_expect_success 'compute a patch-id outside repository (uses SHA-1)' '
2525
nongit env GIT_DEFAULT_HASH=sha1 \
2626
git patch-id <sample.patch >patch-id.expect &&
2727
nongit \

t/t4204-patch-id.sh

Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -310,4 +310,38 @@ test_expect_success 'patch-id handles diffs with one line of before/after' '
310310
test_config patchid.stable true &&
311311
calc_patch_id diffu1stable <diffu1
312312
'
313+
314+
test_expect_failure 'patch-id computes same ID with different object hashes' '
315+
test_when_finished "rm -rf repo-sha1 repo-sha256" &&
316+
317+
cat >diff <<-\EOF &&
318+
diff --git a/bar b/bar
319+
index bdaf90f..31051f6 100644
320+
--- a/bar
321+
+++ b/bar
322+
@@ -2 +2,2 @@
323+
b
324+
+c
325+
EOF
326+
327+
git init --object-format=sha1 repo-sha1 &&
328+
git -C repo-sha1 patch-id <diff >patch-id-sha1 &&
329+
git init --object-format=sha256 repo-sha256 &&
330+
git -C repo-sha256 patch-id <diff >patch-id-sha256 &&
331+
test_cmp patch-id-sha1 patch-id-sha256
332+
'
333+
334+
test_expect_success 'patch-id without repository' '
335+
cat >diff <<-\EOF &&
336+
diff --git a/bar b/bar
337+
index bdaf90f..31051f6 100644
338+
--- a/bar
339+
+++ b/bar
340+
@@ -2 +2,2 @@
341+
b
342+
+c
343+
EOF
344+
nongit git patch-id <diff
345+
'
346+
313347
test_done

0 commit comments

Comments
 (0)