Add macros for mutex use around libc functions that need them by khwilliamson · Pull Request #24098 · Perl/perl5

khwilliamson · 2026-01-18T04:29:17Z

This replaces #22283
This file is intended to insulate C code from thread issues, when used as directed.

This commit merely creates the file; it is not #included yet.

This set of changes requires a perldelta entry, and I will write it before merging this

khwilliamson · 2026-01-18T14:59:51Z

Why does this generate:
#  error_foo_not suitable...
rather than
#  error foo_not suitable...
? The first may produce an error, but it might not include the text of the symbol.

The second will produce the requested message, though it might be better to quote it, which will prevent macro replacement, allowing you to avoid the need for all_the_underscores.

khwilliamson · 2026-01-18T15:28:50Z

The answer is that it turns out that #error is not valid as an expansion of a #define. So, I got rid of the # and the symbol should show up including 'foo'

khwilliamson · 2026-01-18T17:19:57Z

@tonycoz wrote:

Two issues, in general:
1. This defines a huge number of names most of which will never be used, and have a potential to conflict with names from other libraries.
I think this should add a PERL_ prefix to each lock name.

ok

2. This defines separate symbols for each function in a closely related group of functions, for example `dbm_*` and any read locale function.
The separate macros imply the need to do each of those locks, even though in some cases it doesn't make sense, eg. if I do a dbm_delete(), I really don't want to release the lock until I do dbm_error().

I think macros should be defined for groups of such functions where it makes sense, eg. a single macro for the dbm_* functions, and just document the need for LC_*_LOCK for locale reading functions etc.

I don't know what to do about this. First, for the casual reader, these all expand to no-ops except on a threaded build.

This commit doesn't have anything about how these locks are actually implemented. My proposed implementation, I believe avoids the possibility of deadlock, and doesn't introduce any new mutexes, which doing would make deadlock possible. These locks look like they have three possible mutexes, the existing ENV and LOCALE ones, plus a GENeric one. That one is mapped to either of the other two, depending on the Configuration. It turns out on systems with thread-safe locales, the LOCALE mutex is essentially unused, so is available for the generic one. Otherwise, that mutex can get used a lot, and so the ENV mutex is used instead, as we expect changes to the environment to be fairly rare. The ENV and LOCALE mutexes have co-existed for quite a few releases now without deadlock reports. Deadlock is avoided by always doing things in a particular order.

By using a common mutex for unrelated libc calls, threads block that wouldn't if there were a mutex for just the related libc calls. For example asctime needs to lock out threads that are simultaneously executing itself or ctime and nothing else.. But this implementation locks out threads executing a whole host of other libc calls, such as drand48. If there were a mutex for each possible group, you would find that there would be overlap, so that to fully protect such a group would require multiple mutexes, which greatly increases the likelihood of deadlock occurring. My belief is that the performance is going to be acceptable when the common mutex is just for a single libc call that returns quickly.

But if we use a common mutex for a series of operations, unlocking it at the end, threads using unrelated operations which use the same mutex are locked out for the duration. It would be better I think to have a new mutex for just the series. This would entail that the code using it be carefully crafted to not have deadlock potential.

And, then would there be a mutex for each type of series, like dbm and pwent'?

I do have text in the comments indicating there is an issue here. But maybe the solution is to just not generate locks for functions that are part of a series

tonycoz · 2026-01-20T04:00:52Z

My belief is that the performance is going to be acceptable when the common mutex is just for a single libc call that returns quickly.

The problem is that something like:

PERL_DBM_STORE_LOCK;
int status = dbm_store(db, key, value, DBM_INSERT);
PERL_DBM_STORE_UNLOCK;
if (status < 0) {
  PERL_DBM_ERROR_LOCK;
  printf("error %d\n", dbm_error(db));
  PERL_DBM_ERROR_UNLOCK;
}

is invalid, since other operations may have occurred in the gaps between locks to invalidate the stored error value.

Now the user could just lock everything they need in a block:

PERL_DBM_STORE_LOCK;
PERL_DBM_ERROR_LOCK;
int status = dbm_store(db, key, value, DBM_INSERT);
if (status < 0) {
  printf("error %d\n", dbm_error(db));
}
PERL_DBM_ERROR_UNLOCK;
PERL_DBM_STORE_UNLOCK;

but that's wasteful, and annoying.

Note that the extended span of the lock here is about guarding access to a working API, not just about preventing crashes that might occur if there were multiple concurrent dbm_store()s in progress.

khwilliamson · 2026-01-20T22:42:40Z

I removed the locks for functions that need to be used in conjunction with other ones as an atomic unit

tonycoz · 2026-02-05T02:40:05Z

regen/lock_definitions.pl

+print $l <<EOT;
+/* This file contains macros to wrap their respective libc uses to ensure that
+ * those uses are thread-safe in a multi-threaded environment.  It is ordered
+ * alphabetically by fucntion name.


fucntion name

tonycoz · 2026-02-05T03:05:40Z

This commit merely creates the file; it is not #included yet.

I assume (hope?) it will be perl's headers including this.

In any case, I think lock_definitions.h should have a perl related name to avoid conflicts with headers from other sources.

Whether that's perl_locks,h, perl_lock_definitions.h I'm not too worried.

khwilliamson · 2026-02-05T22:10:23Z

I have now added commits intended to complete this pull request. Previously, only the file lock_definitions.h was included. This #includes it in perl.h and creates two different implementations for them, and adds some usage for them in POSIX.xs. Still needed are changes to perlclib, perlapi, and perldelta to refer to these.

The intent is that an XS writer or perl core writer can wrap calls to libc functions with the appropriate macros without needing to understand the nuances, and get thread-safe operation without the possibility of deadlock

One implementation is for when there are thread-safe locales. In this case, an existing mutex is essentially unused, and can be repurposed. The other implementation works, with somewhat less performance, otherwise.

The file is not intended to be included directly into XS space, but the definitions are always there to anyone who includes perl.h

This file is intended to insulate C code from thread issues, when used as directed. This commit merely creates the file; it is not #included yet.

These symbols aren't really documented, and we're not about to. So move them to the resolved list.

This is for debugging mutex locking/unlocking. Convert some locale debug statements that are really about locking to use this. This is not compile in by default because of the overhead during time-critical operations. It requires -Accflags=-DPERL_DEBUG_MUTEXES to be passed to Configure. Its use is for very low level problems. It is incompatible with being compiled to have mem log. Putting that logic in its definition simplifies some code a bit.

This is in preparation for the next commit when it otherwise would be confusing. The terminology is also changed to exclusive lock, as that is clearer.

Make the unlock mate of its corresponding lock adjacent to it

This sets xcounter unconditionally, but now after the lock. It's a small thing, but I think it is slightly clearer

Perl has mutexes to prevent cooperating threads from colliding in accessing shared resources. This requires all threads to use these; a rogue thread that doesn't follow the protocol can wreak havoc. We can make sure that the perl core fully cooperates, but an XS module could go rogue. A thread can declare that it accesses the resource only to query it, and will not change it in any way. Any number of threads can be accessing the resource in this manner at the same time without fear of collisions. We say that such threads have read-only access. A thread can also declare that it will be changing the resource. Only one thread can safely be doing this at a time. It has exclusive access. Perl has macros to call to lock and unlock the resource for both types of access. While a thread has exclusive access, any other thread wanting either type of access will hang until the first thread releases it. Conversely, if a thread wants exclusive access, it will hang until all the threads that have read-only access release their locks. Any number of threads may have simultaneous read-only locks. The macros allow a thread to lock a resource while already holding it. They are called general semaphores or recursive locks. Only when the unlock matching the first lock is executed does the resource get released. Locking the rest of the system out from a resource should be done for the shortest time possible to prevent bottlenecks. A recursive lock could encourage the bad habit of locking it longer than the minimum possible. But these locks have been added to existing code. It would be a tremendous amount of effort to change that code, hence the recursiveness. This commit temporarily removes a few DEBUG_K statements. This is so that the logic of the code can be seen without this distraction. The next commit adds them, and more like them, back in. Commit 262c141 added two unused macros for future use. It turns out this was somewhat premature, because the implementation was buggy (except they're not called). This new commit fixes that, and the next few commits will finally start to use these macros. The problem was the interaction with exclusive locking attempts. If a thread owns an exclusive lock, and then recursively asks for a read one, that should trivially succeed. But instead it would hang, depending on the system's locking implementation. The solution is to add a per-thread bookkeeping counter ourselves, so we bypass the system's in this situation. Conversely, an attempt to add an exclusive lock when already holding a read lock was't handled properly. I discovered via testing that this can lead to deadlock (the comments added in this commit explain the scenario). This commit hence panics when such an attempt is made. The panic is to try to make sure that code that can trigger this doesn't get shipped.

Now that we have a general reentrant read-only lock, use it. This prepares for future commits

A lock by this name has existed for a few releases, but it actually was an exclusive lock. Previous commits have added the infrastructure to do an actual read-only lock. Take advantage of it.

That header file maps libc calls that need mutex locking into various macros, based on the characteristics of each libc call. But it leaves it to perl.h to define those various expansions. This commit does that, based on the Configuration of the build.

This is clearer code.

The many macros in this file now all begin with "PERL_" to avoid namespace pollution. However, a few names already existed in previous Perl releases without that prefix. This commit allows those names to be retained, along with the new spellings.

This commit could break something that relies on these, but it would be broken anyway because these locks are ineffective. The reason is that these functions need to be used in conjunction with other functions in a critical section, and the locks are not designed for that. Using them could give a false sense of security.

The mutex lock macros in this commit are retaine for backwards compatibility, but are now formulated in terms of their new names

This makes it more convenient for a reader to find a line

This code had gotten fragmented. Put as much stuff controlled by the same #ifdef logic under a single #ifdef

Since 5335601, there has been the capability of having nested locks, where the inner one is necessary in all conditions, and the outer one is desirable in just some conditions. But this hasn't been enabled because in some Configurations it could cause a deadlock, with the thread locking the ENV mutex, and then in a nested call trying to lock it again. The previous commit made this mutex reentrant, so the deadlock is gone.

Use these new macros to assure thread safety

khwilliamson mentioned this pull request Jan 18, 2026

Add lock_definitions.h #22283

Closed

khwilliamson marked this pull request as draft January 18, 2026 04:31

khwilliamson marked this pull request as ready for review January 18, 2026 17:20

khwilliamson force-pushed the lock_definitions.h branch from 3445caf to b1a36ac Compare January 18, 2026 17:30

khwilliamson force-pushed the lock_definitions.h branch from b1a36ac to 4d0fba5 Compare January 20, 2026 21:21

khwilliamson force-pushed the lock_definitions.h branch from 4d0fba5 to 5e6ee19 Compare January 25, 2026 13:28

tonycoz reviewed Feb 5, 2026

View reviewed changes

khwilliamson force-pushed the lock_definitions.h branch from 5e6ee19 to cbbec68 Compare February 5, 2026 21:51

khwilliamson changed the title ~~Add lock_definitions.h~~ Add macros for mutex use around libc functions that need them Feb 5, 2026

khwilliamson force-pushed the lock_definitions.h branch 2 times, most recently from d08ee38 to 1a940d6 Compare February 5, 2026 23:27

khwilliamson added 12 commits February 5, 2026 20:20

Add lock_definitions.h

2b10d1a

This file is intended to insulate C code from thread issues, when used as directed. This commit merely creates the file; it is not #included yet.

embed.pl: Mark DEBUG_foo symbols as resolved visibility

6166873

These symbols aren't really documented, and we're not about to. So move them to the resolved list.

Use DEBUG_K in thread.h

0616937

perl.h: Change formal macro parameter name

dfb7963

This is in preparation for the next commit when it otherwise would be confusing. The terminology is also changed to exclusive lock, as that is clearer.

perl.h: Add assertion

deaa972

perl.h: Swap macro definition order

d087b28

Make the unlock mate of its corresponding lock adjacent to it

perl.h: Swap two lines

288da54

This sets xcounter unconditionally, but now after the lock. It's a small thing, but I think it is slightly clearer

Add DEBUG_K statements to mutex locking macros

242bcf6

Use reentrant lock for ENV read lock

0eb6ae4

Now that we have a general reentrant read-only lock, use it. This prepares for future commits

perl.h: Create an actual LOCALE_READ_LOCK

eae4eb2

A lock by this name has existed for a few releases, but it actually was an exclusive lock. Previous commits have added the infrastructure to do an actual read-only lock. Take advantage of it.

khwilliamson added 15 commits February 5, 2026 20:20

perl.h: Rework WSETLOCALE_LOCK,POSIX_SETLOCALE_LOCK

131254f

This is clearer code.

perl.h: Rework backward compatibility locks

2274f84

The mutex lock macros in this commit are retaine for backwards compatibility, but are now formulated in terms of their new names

perl.h: Sort some #defines

d1729a7

This makes it more convenient for a reader to find a line

perl.h: Rearrange some initialization code

5f39e15

This code had gotten fragmented. Put as much stuff controlled by the same #ifdef logic under a single #ifdef

POSIX.xs: Add locks to libc functions missing them

4bd428d

Use these new macros to assure thread safety

POSIX.xs: Use new names for lock macros

fde1542

pp_sys.c: Use new mutex lock names

067bb72

pp_sys.c: Add missing mutex locks

bd980f8

locale.c locks

e9ea85e

locale.c change locks

52d94ae

win32.c

8a7ef04

khwilliamson force-pushed the lock_definitions.h branch from 1a940d6 to 5d6c2ab Compare February 6, 2026 03:36

fixups

fe51b79

khwilliamson force-pushed the lock_definitions.h branch from 5d6c2ab to fe51b79 Compare February 6, 2026 04:04

khwilliamson marked this pull request as draft February 6, 2026 13:24

fixups

283c699

khwilliamson force-pushed the lock_definitions.h branch from aa693a7 to 283c699 Compare February 6, 2026 17:44

fixups

9366ad6

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add macros for mutex use around libc functions that need them#24098

Add macros for mutex use around libc functions that need them#24098
khwilliamson wants to merge 30 commits intoPerl:bleadfrom
khwilliamson:lock_definitions.h

khwilliamson commented Jan 18, 2026 •

edited

Loading

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

tonycoz commented Jan 20, 2026

Uh oh!

khwilliamson commented Jan 20, 2026

Uh oh!

tonycoz Feb 5, 2026

Uh oh!

khwilliamson Feb 5, 2026

Uh oh!

tonycoz commented Feb 5, 2026

Uh oh!

khwilliamson commented Feb 5, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

khwilliamson commented Jan 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

khwilliamson commented Jan 18, 2026

Uh oh!

tonycoz commented Jan 20, 2026

Uh oh!

khwilliamson commented Jan 20, 2026

Uh oh!

tonycoz Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

khwilliamson Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

tonycoz commented Feb 5, 2026

Uh oh!

khwilliamson commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

khwilliamson commented Jan 18, 2026 •

edited

Loading

khwilliamson commented Feb 5, 2026 •

edited

Loading