[libc++] Fix broken precondition of __bit_log2 #155476

ldionne · 2025-08-26T19:33:14Z

In #135303, we started using __bit_log2 instead of __log2i inside std::sort. However, __bit_log2 has a precondition that __log2i didn't have, which is that the input is non-zero. While it technically makes no sense to request the logarithm of 0, __log2i handled that case and returned 0 without issues.

After switching to __bit_log2, passing 0 as an input results in an unsigned integer overflow which can trigger -fsanitize=unsigned-integer-overflow. While not technically UB in itself, it's clearly not intended either.

To fix this, we add an internal assertion to __bit_log2 which catches the issue in our test suite, and we make sure not to violate __bit_log2's preconditions before we call it from std::sort.

In llvm#135303, we started using __bit_log2 instead of __log2i inside `std::sort`. However, __bit_log2 has a precondition that __log2i didn't have, which is that the input is non-zero. While it technically makes no sense to request the logarihm of 0, __log2i handled that case and returned 0 without issues. After switching to __bit_log2, passing 0 as an input results in an unsigned integer overflow which can trigger -fsanitize=unsigned-integer-overflow. While not technically UB in itself, it's clearly not intended either. To fix this, we add an internal assertion to __bit_log2 which catches the issue in our test suite, and we make sure not to violate __bit_log2's preconditions before we call it from `std::sort`.

llvmbot · 2025-08-26T19:33:43Z

@llvm/pr-subscribers-libcxx

Author: Louis Dionne (ldionne)

Changes

In #135303, we started using __bit_log2 instead of __log2i inside std::sort. However, __bit_log2 has a precondition that __log2i didn't have, which is that the input is non-zero. While it technically makes no sense to request the logarihm of 0, __log2i handled that case and returned 0 without issues.

After switching to __bit_log2, passing 0 as an input results in an unsigned integer overflow which can trigger -fsanitize=unsigned-integer-overflow. While not technically UB in itself, it's clearly not intended either.

To fix this, we add an internal assertion to __bit_log2 which catches the issue in our test suite, and we make sure not to violate __bit_log2's preconditions before we call it from std::sort.

Full diff: https://github.com/llvm/llvm-project/pull/155476.diff

3 Files Affected:

(modified) libcxx/include/__algorithm/sort.h (+3)
(modified) libcxx/include/__bit/bit_log2.h (+2)
(modified) libcxx/src/algorithm.cpp (+3)

diff --git a/libcxx/include/__algorithm/sort.h b/libcxx/include/__algorithm/sort.h
index 06cb5b8ce7057..c6587b065e837 100644
--- a/libcxx/include/__algorithm/sort.h
+++ b/libcxx/include/__algorithm/sort.h
@@ -860,6 +860,9 @@ __sort<__less<long double>&, long double*>(long double*, long double*, __less<lo
 template <class _AlgPolicy, class _RandomAccessIterator, class _Comp>
 _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_SINCE_CXX20 void
 __sort_dispatch(_RandomAccessIterator __first, _RandomAccessIterator __last, _Comp& __comp) {
+  if (__first == __last)
+    return;
+
   typedef typename iterator_traits<_RandomAccessIterator>::difference_type difference_type;
   difference_type __depth_limit = 2 * std::__bit_log2(std::__to_unsigned_like(__last - __first));
 
diff --git a/libcxx/include/__bit/bit_log2.h b/libcxx/include/__bit/bit_log2.h
index 8077cd91d6fd7..9ceeec1b2bc94 100644
--- a/libcxx/include/__bit/bit_log2.h
+++ b/libcxx/include/__bit/bit_log2.h
@@ -9,6 +9,7 @@
 #ifndef _LIBCPP___BIT_BIT_LOG2_H
 #define _LIBCPP___BIT_BIT_LOG2_H
 
+#include <__assert>
 #include <__bit/countl.h>
 #include <__config>
 #include <__type_traits/integer_traits.h>
@@ -23,6 +24,7 @@ _LIBCPP_BEGIN_NAMESPACE_STD
 template <class _Tp>
 _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_SINCE_CXX14 _Tp __bit_log2(_Tp __t) _NOEXCEPT {
   static_assert(__is_unsigned_integer_v<_Tp>, "__bit_log2 requires an unsigned integer type");
+  _LIBCPP_ASSERT_INTERNAL(__t != 0, "logarithm of 0 is undefined");
   return numeric_limits<_Tp>::digits - 1 - std::__countl_zero(__t);
 }
 
diff --git a/libcxx/src/algorithm.cpp b/libcxx/src/algorithm.cpp
index d388fee5f99cc..2cf076759d3f8 100644
--- a/libcxx/src/algorithm.cpp
+++ b/libcxx/src/algorithm.cpp
@@ -13,6 +13,9 @@ _LIBCPP_BEGIN_NAMESPACE_STD
 
 template <class Comp, class RandomAccessIterator>
 void __sort(RandomAccessIterator first, RandomAccessIterator last, Comp comp) {
+  if (first == last)
+    return;
+
   auto depth_limit = 2 * std::__bit_log2(static_cast<size_t>(last - first));
 
   // Only use bitset partitioning for arithmetic types.  We should also check

libcxx/include/__algorithm/sort.h

var-const · 2025-08-26T19:56:08Z

libcxx/include/__algorithm/sort.h

 template <class _AlgPolicy, class _RandomAccessIterator, class _Comp>
 _LIBCPP_HIDE_FROM_ABI _LIBCPP_CONSTEXPR_SINCE_CXX20 void
 __sort_dispatch(_RandomAccessIterator __first, _RandomAccessIterator __last, _Comp& __comp) {
+  if (__first == __last) // don't even try computing the depth


How about

`log(0)` is undefined, so don't try computing the depth

?

philnik777

I'm not a huge fan of this as-is. I've seen multiple people comment about us breaking UBsan for unsigned integers, but nobody seems to care to actually do the legwork and just enable it for all our tests. I'm not against adding an assertion in __bit_log2, but I don't think "people don't want unsigned overflow" is a good reason to do so. Someone should just do the actual work to allow using UBSan with unsigned overflow checking in libc++, since we will run into this again and again otherwise.

ldionne · 2025-08-27T16:58:21Z

I'm not a huge fan of this as-is. I've seen multiple people comment about us breaking UBsan for unsigned integers, but nobody seems to care to actually do the legwork and just enable it for all our tests. I'm not against adding an assertion in __bit_log2, but I don't think "people don't want unsigned overflow" is a good reason to do so. Someone should just do the actual work to allow using UBSan with unsigned overflow checking in libc++, since we will run into this again and again otherwise.

I am purposefully separating this from supporting -fsanitize=unsigned-integer-overflow in the general sense. For example, if we used unsigned integer overflow on purpose in this function, we'd be having a different conversation. Currently, we end up computing an invalid result in __bit_log2 and the only reason why it works is that we bail out from the underlying __introsort function before we try to use the invalid depth, since __first == __last.

In other words, what I'm fixing here is a broken precondition of an existing function that we refactored in what was supposed to be a NFC patch, and ended up not being NFC.

ldionne · 2025-08-28T22:07:49Z

I'm going to merge this so we can get started with the cherry-pick ASAP. This seemingly innocent change had significant fallout and I think we definitely want that fixed in the release.

If we need to do additional follow-ups to get consensus, I'm fine with that, but as-is this prevents us from violating one of our own function's preconditions, so I think it's reasonable.

ldionne · 2025-08-28T22:08:49Z

/cherry-pick 2ae4b92

llvmbot · 2025-08-28T22:15:13Z

/pull-request #155932

philnik777 · 2025-08-29T07:43:32Z

Why do we need to cherry-pick this ASAP? What fallout is there? If the answer is "UBSan complains", I'm very much not happy, since that is exactly what I was complaining about - doing ad-hoc fixes without actually addressing the underlying problem, namely not having CI coverage. Saying "but it violates a precondition" is IMO not a good reason here. The only actual problem we have here AFAICT is UBSan, and that seems to me to be the actual motivation behind this patch.

In llvm#135303, we started using `__bit_log2` instead of `__log2i` inside `std::sort`. However, `__bit_log2` has a precondition that `__log2i` didn't have, which is that the input is non-zero. While it technically makes no sense to request the logarithm of 0, `__log2i` handled that case and returned 0 without issues. After switching to `__bit_log2`, passing 0 as an input results in an unsigned integer overflow which can trigger `-fsanitize=unsigned-integer-overflow`. While not technically UB in itself, it's clearly not intended either. To fix this, we add an internal assertion to `__bit_log2` which catches the issue in our test suite, and we make sure not to violate `__bit_log2`'s preconditions before we call it from `std::sort`. (cherry picked from commit 2ae4b92)

ldionne · 2025-09-17T13:42:50Z

Why do we need to cherry-pick this ASAP? What fallout is there? If the answer is "UBSan complains", I'm very much not happy, since that is exactly what I was complaining about - doing ad-hoc fixes without actually addressing the underlying problem, namely not having CI coverage.

In practice, yes the fallout is that UBSan is complaining (which crashes the program that runs it). Now, we have three options. We introduced a bug in our code and it introduced a regression in something our users were relying on. We can:

Ignore the problem until we have full support and CI coverage for -fsanitize=unsigned-integer-overflow, and then reapply the exact same fix I'm doing here (since it fixes a precondition). Opting for this approach would be taking the hard line and being rigid just for the sake of enforcing that everything we do is covered by CI, knowing full well that we have an underlying bug anyways (the broken precondition) and that we want to get said CI running in the future anyways. And in the meantime, our users would be broken. That is not a good option.
Revert the original refactoring patch that introduced the issue.
Apply this fix which resolves the issue, fixes the broken precondition and moves us forward.

Clearly, I think that either (2) or (3) are better pragmatic decisions, which is why I went for (3).

`libcxx`: backport llvm/llvm-project#155476, llvm/llvm-project#147389, llvm/llvm-project#155786

ldionne added this to the LLVM 21.x Release milestone Aug 26, 2025

ldionne requested a review from a team as a code owner August 26, 2025 19:33

github-project-automation bot added this to LLVM Release Status Aug 26, 2025

github-project-automation bot moved this to Needs Triage in LLVM Release Status Aug 26, 2025

llvmbot added the libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. label Aug 26, 2025

var-const reviewed Aug 26, 2025

View reviewed changes

libcxx/include/__algorithm/sort.h Outdated Show resolved Hide resolved

Add comment

ded5c59

var-const reviewed Aug 26, 2025

View reviewed changes

var-const approved these changes Aug 26, 2025

View reviewed changes

github-project-automation bot moved this from Needs Triage to Needs Merge in LLVM Release Status Aug 26, 2025

frederick-vs-ja approved these changes Aug 27, 2025

View reviewed changes

philnik777 requested changes Aug 27, 2025

View reviewed changes

Improve comment

4b31bf9

ldionne merged commit 2ae4b92 into llvm:main Aug 28, 2025
69 of 78 checks passed

github-project-automation bot moved this from Needs Merge to Done in LLVM Release Status Aug 28, 2025

ldionne deleted the review/fix-sort-ubsan branch August 28, 2025 22:08

dwblaikie mentioned this pull request Sep 16, 2025

port 5b4819e to release #159209

Merged

jhuber6 mentioned this pull request Sep 17, 2025

[LLVM] Update CUDA ELF flags for their new ABI #159451

Merged

alexrp mentioned this pull request Oct 14, 2025

libcxx: backport llvm/llvm-project#155476, llvm/llvm-project#147389, llvm/llvm-project#155786 ziglang/zig#25572

Merged

alexrp added a commit to ziglang/zig that referenced this pull request Oct 14, 2025

Merge pull request #25572 from alexrp/libcxx-backports

27a396d

`libcxx`: backport llvm/llvm-project#155476, llvm/llvm-project#147389, llvm/llvm-project#155786

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libc++] Fix broken precondition of __bit_log2 #155476

[libc++] Fix broken precondition of __bit_log2 #155476

Uh oh!

ldionne commented Aug 26, 2025 •

edited by var-const

Loading

Uh oh!

llvmbot commented Aug 26, 2025

Uh oh!

Uh oh!

var-const Aug 26, 2025

Uh oh!

philnik777 left a comment

Uh oh!

ldionne commented Aug 27, 2025

Uh oh!

ldionne commented Aug 28, 2025

Uh oh!

Uh oh!

ldionne commented Aug 28, 2025

Uh oh!

llvmbot commented Aug 28, 2025

Uh oh!

philnik777 commented Aug 29, 2025 •

edited

Loading

Uh oh!

ldionne commented Sep 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[libc++] Fix broken precondition of __bit_log2 #155476

[libc++] Fix broken precondition of __bit_log2 #155476

Uh oh!

Conversation

ldionne commented Aug 26, 2025 • edited by var-const Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

llvmbot commented Aug 26, 2025

Uh oh!

Uh oh!

var-const Aug 26, 2025

Choose a reason for hiding this comment

Uh oh!

philnik777 left a comment

Choose a reason for hiding this comment

Uh oh!

ldionne commented Aug 27, 2025

Uh oh!

ldionne commented Aug 28, 2025

Uh oh!

Uh oh!

ldionne commented Aug 28, 2025

Uh oh!

llvmbot commented Aug 28, 2025

Uh oh!

philnik777 commented Aug 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ldionne commented Sep 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ldionne commented Aug 26, 2025 •

edited by var-const

Loading

philnik777 commented Aug 29, 2025 •

edited

Loading