[libc++] Switch over to the LLVM-wide premerge test runners. #147794

cmtice · 2025-07-09T17:49:05Z

Update the premerge testing system to use the LLVM-wide premerge infrastructure. Also remove libcxx-restart-preempted-jobs.yaml, as this should no longer be needed.

cmtice · 2025-07-09T17:50:40Z

Just testing this for now. Don't review yet.

cmtice · 2025-07-09T18:15:13Z

There seems to be a problem: Things are being queued rather than running...need to figure this out.

ldionne · 2025-07-09T19:40:49Z

You should rebase onto main to get some recent CI fixes (that won't resolve the queueing issue though).

EricWF · 2025-07-09T21:14:21Z

So I think controllers or listeners are struggling to connect to github (rather than a queuing issue; I think queued jobs produce more/different output).

When debugging these sorts of issues, I've found it easiest to start from the Google Cloud "Workloads" tab under GKE,
. (Filter by the namespaces you used for the runner & controller).

I'll verify that both the controller & listener are running correctly. If the runners are failing to scale, you'll see the "unschedulable" jobs listed on the workloads page.

cmtice · 2025-07-09T21:37:36Z

It looks like the runner version in your container image is too old for the runner to connect to Github. We are working on a solution to update the runner version in the container image (it's a Work-In-Progress). Ideally we will update the runner version without having to bump any of the tools or library versions.

EricWF · 2025-07-09T22:12:14Z

It looks like the runner version in your container image is too old for the runner to connect to Github. We are working on a solution to update the runner version in the container image (it's a Work-In-Progress). Ideally we will update the runner version without having to bump any of the tools or library versions.

To put a name to it, this is the issue we discussed during initial libc++ monthly meeting.

I struggled to install the runner binaries into the container image manually because, at the time, the manual installation didn't support auto-scaling ephemeral runners. I'm hopeful the additional maturity of Github Actions has resolved this.

boomanaiden154 · 2025-07-09T22:25:12Z

I struggled to install the runner binaries into the container image manually because, at the time, the manual installation didn't support auto-scaling ephemeral runners. I'm hopeful the additional maturity of Github Actions has resolved this.

It seems like that has changed. We've been manually installing the runner binaries for the monorepo wide premerge since we built out the infrastructure.

llvm-project/.github/workflows/containers/github-action-ci/Dockerfile

Line 91 in f1acd69

RUN mkdir actions-runner && \

EricWF · 2025-07-09T22:49:28Z

I struggled to install the runner binaries into the container image manually because, at the time, the manual installation didn't support auto-scaling ephemeral runners. I'm hopeful the additional maturity of Github Actions has resolved this.

It seems like that has changed. We've been manually installing the runner binaries for the monorepo wide premerge since we built out the infrastructure.

llvm-project/.github/workflows/containers/github-action-ci/Dockerfile

Line 91 in f1acd69

RUN mkdir actions-runner && \

I would take a look at the documentation regarding the minimal runner container image.

I think we should try to mirror that setup as close we can. I worry about the possible subtle effects of omitting something like ACTIONS_RUNNER_PRINT_LOG_TO_STDOUT or RUNNER_MANUALLY_TRAP_SIG. We also probably want the container hooks.

But otherwise I think the LLVM image approach would suite libc++ as well.

EricWF · 2025-07-09T22:51:34Z

This is looking awesome!

EricWF · 2025-07-09T23:39:09Z

This looks ready to me.

Once it's converted from a draft, I'll add my stamp.

cmtice · 2025-07-10T02:54:49Z

You should rebase onto main to get some recent CI fixes (that won't resolve the queueing issue though).

Done. :-)

llvmbot · 2025-07-10T15:24:42Z

@llvm/pr-subscribers-github-workflow

Author: None (cmtice)

Changes

Update the premerge testing system to use the LLVM-wide premerge infrastructure. Also remove libcxx-restart-preempted-jobs.yaml, as this should no longer be needed.

Full diff: https://github.com/llvm/llvm-project/pull/147794.diff

2 Files Affected:

(modified) .github/workflows/libcxx-build-and-test.yaml (+8-11)
(removed) .github/workflows/libcxx-restart-preempted-jobs.yaml (-158)

diff --git a/.github/workflows/libcxx-build-and-test.yaml b/.github/workflows/libcxx-build-and-test.yaml
index f0bdf6c0b5899..ec937de02ca1a 100644
--- a/.github/workflows/libcxx-build-and-test.yaml
+++ b/.github/workflows/libcxx-build-and-test.yaml
@@ -36,8 +36,7 @@ concurrency:
 jobs:
   stage1:
     if: github.repository_owner == 'llvm'
-    runs-on: libcxx-self-hosted-linux
-    container: ghcr.io/llvm/libcxx-linux-builder:b060022103f551d8ca1dad84122ef73927c86512
+    runs-on: llvm-premerge-libcxx-runners
     continue-on-error: false
     strategy:
       fail-fast: false
@@ -74,8 +73,7 @@ jobs:
             **/crash_diagnostics/*
   stage2:
     if: github.repository_owner == 'llvm'
-    runs-on: libcxx-self-hosted-linux
-    container: ghcr.io/llvm/libcxx-linux-builder:2b57ebb50b6d418e70382e655feaa619b558e254
+    runs-on: llvm-premerge-libcxx-runners
     needs: [ stage1 ]
     continue-on-error: false
     strategy:
@@ -149,21 +147,20 @@ jobs:
           'generic-static',
           'bootstrapping-build'
         ]
-        machine: [ 'libcxx-self-hosted-linux' ]
+        machine: [ 'llvm-premerge-libcxx-runners' ]
         include:
         - config: 'generic-cxx26'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-asan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-tsan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-ubsan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         # Use a larger machine for MSAN to avoid timeout and memory allocation issues.
         - config: 'generic-msan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
     runs-on: ${{ matrix.machine }}
-    container: ghcr.io/llvm/libcxx-linux-builder:2b57ebb50b6d418e70382e655feaa619b558e254
     steps:
       - uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
       - name: ${{ matrix.config }}
diff --git a/.github/workflows/libcxx-restart-preempted-jobs.yaml b/.github/workflows/libcxx-restart-preempted-jobs.yaml
deleted file mode 100644
index accb84efb5c90..0000000000000
--- a/.github/workflows/libcxx-restart-preempted-jobs.yaml
+++ /dev/null
@@ -1,158 +0,0 @@
-name: Restart Preempted Libc++ Workflow
-
-# The libc++ builders run on preemptable VMs, which can be shutdown at any time.
-# This workflow identifies when a workflow run was canceled due to the VM being preempted,
-# and restarts the workflow run.
-
-# We identify a canceled workflow run by checking the annotations of the check runs in the check suite,
-# which should contain the message "The runner has received a shutdown signal."
-
-# Note: If a job is both preempted and also contains a non-preemption failure, we do not restart the workflow.
-
-on:
-  workflow_run:
-    workflows: [Build and Test libc\+\+]
-    types:
-      - completed
-
-permissions:
-  contents: read
-
-jobs:
-  restart:
-    if: github.repository_owner == 'llvm' && (github.event.workflow_run.conclusion == 'failure')
-    name: "Restart Job"
-    permissions:
-      statuses: read
-      checks: write
-      actions: write
-    runs-on: ubuntu-24.04
-    steps:
-      - name: "Restart Job"
-        uses: actions/github-script@60a0d83039c74a4aee543508d2ffcb1c3799cdea #v7.0.1
-        with:
-          script: |
-            // The "The run was canceled by" message comes from a user manually canceling a workflow
-            // the "higher priority" message comes from github canceling a workflow because the user updated the change.
-            // And the "exit code 1" message indicates a genuine failure.
-            const failure_regex = /(Process completed with exit code 1.)/
-            const preemption_regex = /(The runner has received a shutdown signal)|(The operation was canceled)/
-
-            const wf_run = context.payload.workflow_run
-            core.notice(`Running on "${wf_run.display_title}" by @${wf_run.actor.login} (event: ${wf_run.event})\nWorkflow run URL: ${wf_run.html_url}`)
-
-
-            async function create_check_run(conclusion, message) {
-                // Create a check run on the given workflow run to indicate if
-                // we are restarting the workflow or not.
-                if (conclusion != 'success' && conclusion != 'skipped' && conclusion != 'neutral') {
-                  core.setFailed('Invalid conclusion: ' + conclusion)
-                }
-                await github.rest.checks.create({
-                    owner: context.repo.owner,
-                    repo: context.repo.repo,
-                    name: 'Restart Preempted Job',
-                    head_sha: wf_run.head_sha,
-                    status: 'completed',
-                    conclusion: conclusion,
-                    output: {
-                      title: 'Restarted Preempted Job',
-                      summary: message
-                    }
-                })
-            }
-
-            console.log('Listing check runs for suite')
-            const check_suites = await github.rest.checks.listForSuite({
-              owner: context.repo.owner,
-              repo: context.repo.repo,
-              check_suite_id: context.payload.workflow_run.check_suite_id,
-              per_page: 100 // FIXME: We don't have 100 check runs yet, but we should handle this better.
-            })
-
-            check_run_ids = [];
-            for (check_run of check_suites.data.check_runs) {
-              console.log('Checking check run: ' + check_run.id);
-              if (check_run.status != 'completed') {
-                console.log('Check run was not completed. Skipping.');
-                continue;
-              }
-              if (check_run.conclusion != 'failure') {
-                console.log('Check run had conclusion: ' + check_run.conclusion + '. Skipping.');
-                continue;
-              }
-              check_run_ids.push(check_run.id);
-            }
-
-            has_preempted_job = false;
-
-            for (check_run_id of check_run_ids) {
-              console.log('Listing annotations for check run: ' + check_run_id);
-
-              annotations = await github.rest.checks.listAnnotations({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                check_run_id: check_run_id
-              })
-
-              // For temporary debugging purposes to see the structure of the annotations.
-              console.log(annotations);
-
-              has_failed_job = false;
-              saved_failure_message = null;
-
-              for (annotation of annotations.data) {
-                if (annotation.annotation_level != 'failure') {
-                  continue;
-                }
-
-                const preemption_match = annotation.message.match(preemption_regex);
-
-                if (preemption_match != null) {
-                  console.log('Found preemption message: ' + annotation.message);
-                  has_preempted_job = true;
-                }
-
-                const failure_match = annotation.message.match(failure_regex);
-                if (failure_match != null) {
-                  has_failed_job = true;
-                  saved_failure_message = annotation.message;
-                }
-              }
-              if (has_failed_job && (! has_preempted_job)) {
-                // We only want to restart the workflow if all of the failures were due to preemption.
-                // We don't want to restart the workflow if there were other failures.
-                //
-                // However, libcxx runners running inside docker containers produce both a preemption message and failure message.
-                //
-                // The desired approach is to ignore failure messages which appear on the same job as a preemption message
-                // (An job is a single run with a specific configuration, ex generic-gcc, gcc-14).
-                //
-                // However, it's unclear that this code achieves the desired approach, and it may ignore all failures
-                // if a preemption message is found at all on any run.
-                //
-                // For now, it's more important to restart preempted workflows than to avoid restarting workflows with
-                // non-preemption failures.
-                //
-                // TODO Figure this out.
-                core.notice('Choosing not to rerun workflow because we found a non-preemption failure' +
-                  'Failure message: "' + saved_failure_message + '"');
-                await create_check_run('skipped', 'Choosing not to rerun workflow because we found a non-preemption failure\n'
-                    + 'Failure message: ' + saved_failure_message)
-                return;
-              }
-            }
-
-            if (!has_preempted_job) {
-              core.notice('No preempted jobs found. Not restarting workflow.');
-              await create_check_run('neutral', 'No preempted jobs found. Not restarting workflow.')
-              return;
-            }
-
-            core.notice("Restarted workflow: " + context.payload.workflow_run.id);
-            await github.rest.actions.reRunWorkflowFailedJobs({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                run_id: context.payload.workflow_run.id
-              })
-            await create_check_run('success', 'Restarted workflow run due to preempted job')

llvmbot · 2025-07-10T15:24:44Z

@llvm/pr-subscribers-libcxx

Author: None (cmtice)

Changes

Update the premerge testing system to use the LLVM-wide premerge infrastructure. Also remove libcxx-restart-preempted-jobs.yaml, as this should no longer be needed.

Full diff: https://github.com/llvm/llvm-project/pull/147794.diff

2 Files Affected:

(modified) .github/workflows/libcxx-build-and-test.yaml (+8-11)
(removed) .github/workflows/libcxx-restart-preempted-jobs.yaml (-158)

diff --git a/.github/workflows/libcxx-build-and-test.yaml b/.github/workflows/libcxx-build-and-test.yaml
index f0bdf6c0b5899..ec937de02ca1a 100644
--- a/.github/workflows/libcxx-build-and-test.yaml
+++ b/.github/workflows/libcxx-build-and-test.yaml
@@ -36,8 +36,7 @@ concurrency:
 jobs:
   stage1:
     if: github.repository_owner == 'llvm'
-    runs-on: libcxx-self-hosted-linux
-    container: ghcr.io/llvm/libcxx-linux-builder:b060022103f551d8ca1dad84122ef73927c86512
+    runs-on: llvm-premerge-libcxx-runners
     continue-on-error: false
     strategy:
       fail-fast: false
@@ -74,8 +73,7 @@ jobs:
             **/crash_diagnostics/*
   stage2:
     if: github.repository_owner == 'llvm'
-    runs-on: libcxx-self-hosted-linux
-    container: ghcr.io/llvm/libcxx-linux-builder:2b57ebb50b6d418e70382e655feaa619b558e254
+    runs-on: llvm-premerge-libcxx-runners
     needs: [ stage1 ]
     continue-on-error: false
     strategy:
@@ -149,21 +147,20 @@ jobs:
           'generic-static',
           'bootstrapping-build'
         ]
-        machine: [ 'libcxx-self-hosted-linux' ]
+        machine: [ 'llvm-premerge-libcxx-runners' ]
         include:
         - config: 'generic-cxx26'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-asan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-tsan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         - config: 'generic-ubsan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
         # Use a larger machine for MSAN to avoid timeout and memory allocation issues.
         - config: 'generic-msan'
-          machine: libcxx-self-hosted-linux
+          machine: llvm-premerge-libcxx-runners
     runs-on: ${{ matrix.machine }}
-    container: ghcr.io/llvm/libcxx-linux-builder:2b57ebb50b6d418e70382e655feaa619b558e254
     steps:
       - uses: actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683 # v4.2.2
       - name: ${{ matrix.config }}
diff --git a/.github/workflows/libcxx-restart-preempted-jobs.yaml b/.github/workflows/libcxx-restart-preempted-jobs.yaml
deleted file mode 100644
index accb84efb5c90..0000000000000
--- a/.github/workflows/libcxx-restart-preempted-jobs.yaml
+++ /dev/null
@@ -1,158 +0,0 @@
-name: Restart Preempted Libc++ Workflow
-
-# The libc++ builders run on preemptable VMs, which can be shutdown at any time.
-# This workflow identifies when a workflow run was canceled due to the VM being preempted,
-# and restarts the workflow run.
-
-# We identify a canceled workflow run by checking the annotations of the check runs in the check suite,
-# which should contain the message "The runner has received a shutdown signal."
-
-# Note: If a job is both preempted and also contains a non-preemption failure, we do not restart the workflow.
-
-on:
-  workflow_run:
-    workflows: [Build and Test libc\+\+]
-    types:
-      - completed
-
-permissions:
-  contents: read
-
-jobs:
-  restart:
-    if: github.repository_owner == 'llvm' && (github.event.workflow_run.conclusion == 'failure')
-    name: "Restart Job"
-    permissions:
-      statuses: read
-      checks: write
-      actions: write
-    runs-on: ubuntu-24.04
-    steps:
-      - name: "Restart Job"
-        uses: actions/github-script@60a0d83039c74a4aee543508d2ffcb1c3799cdea #v7.0.1
-        with:
-          script: |
-            // The "The run was canceled by" message comes from a user manually canceling a workflow
-            // the "higher priority" message comes from github canceling a workflow because the user updated the change.
-            // And the "exit code 1" message indicates a genuine failure.
-            const failure_regex = /(Process completed with exit code 1.)/
-            const preemption_regex = /(The runner has received a shutdown signal)|(The operation was canceled)/
-
-            const wf_run = context.payload.workflow_run
-            core.notice(`Running on "${wf_run.display_title}" by @${wf_run.actor.login} (event: ${wf_run.event})\nWorkflow run URL: ${wf_run.html_url}`)
-
-
-            async function create_check_run(conclusion, message) {
-                // Create a check run on the given workflow run to indicate if
-                // we are restarting the workflow or not.
-                if (conclusion != 'success' && conclusion != 'skipped' && conclusion != 'neutral') {
-                  core.setFailed('Invalid conclusion: ' + conclusion)
-                }
-                await github.rest.checks.create({
-                    owner: context.repo.owner,
-                    repo: context.repo.repo,
-                    name: 'Restart Preempted Job',
-                    head_sha: wf_run.head_sha,
-                    status: 'completed',
-                    conclusion: conclusion,
-                    output: {
-                      title: 'Restarted Preempted Job',
-                      summary: message
-                    }
-                })
-            }
-
-            console.log('Listing check runs for suite')
-            const check_suites = await github.rest.checks.listForSuite({
-              owner: context.repo.owner,
-              repo: context.repo.repo,
-              check_suite_id: context.payload.workflow_run.check_suite_id,
-              per_page: 100 // FIXME: We don't have 100 check runs yet, but we should handle this better.
-            })
-
-            check_run_ids = [];
-            for (check_run of check_suites.data.check_runs) {
-              console.log('Checking check run: ' + check_run.id);
-              if (check_run.status != 'completed') {
-                console.log('Check run was not completed. Skipping.');
-                continue;
-              }
-              if (check_run.conclusion != 'failure') {
-                console.log('Check run had conclusion: ' + check_run.conclusion + '. Skipping.');
-                continue;
-              }
-              check_run_ids.push(check_run.id);
-            }
-
-            has_preempted_job = false;
-
-            for (check_run_id of check_run_ids) {
-              console.log('Listing annotations for check run: ' + check_run_id);
-
-              annotations = await github.rest.checks.listAnnotations({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                check_run_id: check_run_id
-              })
-
-              // For temporary debugging purposes to see the structure of the annotations.
-              console.log(annotations);
-
-              has_failed_job = false;
-              saved_failure_message = null;
-
-              for (annotation of annotations.data) {
-                if (annotation.annotation_level != 'failure') {
-                  continue;
-                }
-
-                const preemption_match = annotation.message.match(preemption_regex);
-
-                if (preemption_match != null) {
-                  console.log('Found preemption message: ' + annotation.message);
-                  has_preempted_job = true;
-                }
-
-                const failure_match = annotation.message.match(failure_regex);
-                if (failure_match != null) {
-                  has_failed_job = true;
-                  saved_failure_message = annotation.message;
-                }
-              }
-              if (has_failed_job && (! has_preempted_job)) {
-                // We only want to restart the workflow if all of the failures were due to preemption.
-                // We don't want to restart the workflow if there were other failures.
-                //
-                // However, libcxx runners running inside docker containers produce both a preemption message and failure message.
-                //
-                // The desired approach is to ignore failure messages which appear on the same job as a preemption message
-                // (An job is a single run with a specific configuration, ex generic-gcc, gcc-14).
-                //
-                // However, it's unclear that this code achieves the desired approach, and it may ignore all failures
-                // if a preemption message is found at all on any run.
-                //
-                // For now, it's more important to restart preempted workflows than to avoid restarting workflows with
-                // non-preemption failures.
-                //
-                // TODO Figure this out.
-                core.notice('Choosing not to rerun workflow because we found a non-preemption failure' +
-                  'Failure message: "' + saved_failure_message + '"');
-                await create_check_run('skipped', 'Choosing not to rerun workflow because we found a non-preemption failure\n'
-                    + 'Failure message: ' + saved_failure_message)
-                return;
-              }
-            }
-
-            if (!has_preempted_job) {
-              core.notice('No preempted jobs found. Not restarting workflow.');
-              await create_check_run('neutral', 'No preempted jobs found. Not restarting workflow.')
-              return;
-            }
-
-            core.notice("Restarted workflow: " + context.payload.workflow_run.id);
-            await github.rest.actions.reRunWorkflowFailedJobs({
-                owner: context.repo.owner,
-                repo: context.repo.repo,
-                run_id: context.payload.workflow_run.id
-              })
-            await create_check_run('success', 'Restarted workflow run due to preempted job')

cmtice · 2025-07-10T15:24:51Z

This looks ready to me.

Once it's converted from a draft, I'll add my stamp.

Done -- it's ready for official review now. :-)

boomanaiden154 · 2025-07-10T15:26:01Z

Just FYI: The container image we're using here is a bit of a hack given we needed to bump the runner binary version. I took the existing container, deleted the runner binary, and then reinstalled it and uploaded it to the registry so that we could use it:

FROM ghcr.io/llvm/libcxx-linux-builder:b060022103f551d8ca1dad84122ef73927c86512
RUN rm -rf ./*
ENV GITHUB_RUNNER_VERSION=2.326.0
RUN curl -O -L https://github.com/actions/runner/releases/download/v$GITHUB_RUNNER_VERSION/actions-runner-linux-x64-$GITHUB_RUNNER_VERSION.tar.gz && \
    tar xzf ./actions-runner-linux-x64-$GITHUB_RUNNER_VERSION.tar.gz && \
    rm ./actions-runner-linux-x64-$GITHUB_RUNNER_VERSION.tar.gz

This shouldn't be an issue when building the next container as #147831 has already landed

boomanaiden154

LGTM.

Maybe not worth reiterating at this point given I think everyone is on the same page, but let's make sure to wait until the libc++ maintainers have taken a look before landing.

EricWF

I'm comfortable approving this for the project since rolling it back is trivial.

We should monitor the change to ensure it can handle the load, but otherwise I think this is likely to stick.

EricWF · 2025-07-10T16:41:46Z

There are still a few things we should figure out after this lands. In particular:

How will the container images get updated, who's responsible, and what can we do to make this process as automated as possible?

As it stand now, upgrading the container is still a manual process and a bit of a pain. With work we can cleanup our Dockerfile to better support rebuilding without affecting the compiler/tool versions. But until then, this process seems manual, and we should ensure it's done with a regular cadence.

EricWF · 2025-07-10T16:44:42Z

Do we have documentation explaining the names/purposes of the 3 runner groups, and how libc++ developers should interact or update them?

cmtice · 2025-07-10T17:38:37Z

Do we have documentation explaining the names/purposes of the 3 runner groups, and how libc++ developers should interact or update them?

Not yet -- where should this documentation go?

boomanaiden154 · 2025-07-10T18:07:44Z

As it stand now, upgrading the container is still a manual process and a bit of a pain. With work we can cleanup our Dockerfile to better support rebuilding without affecting the compiler/tool versions. But until then, this process seems manual, and we should ensure it's done with a regular cadence.

I was hoping to get to this refactoring by the end of the week.

EricWF · 2025-07-10T18:16:47Z

Do we have documentation explaining the names/purposes of the 3 runner groups, and how libc++ developers should interact or update them?

Not yet -- where should this documentation go?

Probably under libcxx/docs. I don't have a strong preference. I'm not sure if it fits well into existing files/sections, but in that case you can add a new rst file.

As it stand now, upgrading the container is still a manual process and a bit of a pain. With work we can cleanup our Dockerfile to better support rebuilding without affecting the compiler/tool versions. But until then, this process seems manual, and we should ensure it's done with a regular cadence.

I was hoping to get to this refactoring by the end of the week.

Amazing! It's been a long standing wart.

EricWF · 2025-07-10T18:21:30Z

And now we just wait for users to rebase their pull requests (I forgot about that part).

I'm tempted to turn down the existing runners (but not dismantle) to "encourage" users to rebase.
I would announce the change & requirement to rebase on discord.

@ldionne Would you be OK with this approach?

boomanaiden154 · 2025-07-10T18:26:19Z

And now we just wait for users to rebase their pull requests (I forgot about that part).

They shouldn't need to. Github PRs get tested as if they were merged into main, including the workflow files (as far as I'm aware). Changes should just propagate to everything once they're in main.

EricWF · 2025-07-10T18:45:19Z

And now we just wait for users to rebase their pull requests (I forgot about that part).

They shouldn't need to. Github PRs get tested as if they were merged into main, including the workflow files (as far as I'm aware). Changes should just propagate to everything once they're in main.

This is certainly true of some workflows. But IDK if it's true for this one.

When we were testing this change it used the workflow from the PR. I also tried re-running the nightly build, and it used the old workflow.

Here's a test PR I created: #148029

EDIT:

So you're totally right. It's weird though. If you view the "workflow" file on the github actions page for the PR, you'll see the old runner names, but if you look at the runners actually being assigned, they're from the new runner groups.

EricWF · 2025-07-10T19:02:37Z

Since @boomanaiden154 corrected my mistake about which runners are used for old changes, I'll propose a new plan.

I'm going to wait 24 hours, then disable the old runners entirely. I'll watch the CI over the weekend, and if the old runners become needed, I'll be ready to scale up.

Then we can discuss when to nix the old runner setup entirely.

ldionne · 2025-07-10T21:30:32Z

@EricWF I won't oppose to your plan, however I would suggest being less aggressive and waiting for maybe 1 week until we touch anything. I don't think we lose anything from waiting a bit more, but we potentially make it easier to CTRL-Z in case things go wrong. Your decision though!

ldionne · 2025-07-10T21:31:27Z

Thanks a lot both @boomanaiden154 and @cmtice for getting this ready and merging this, I'm excited to see the new system in action!

[libc++] Swith over to the LLVM-wide premerge test runners.

2e929c3

Update the premerge testing system to use the LLVM-wide premerge infrastructure. Also remove libcxx-restart-preempted-jobs.yaml, as this should no longer be needed.

cmtice requested a review from boomanaiden154 July 9, 2025 17:51

cmtice changed the title ~~[libc++] Swith over to the LLVM-wide premerge test runners.~~ [libc++] Switch over to the LLVM-wide premerge test runners. Jul 9, 2025

cmtice requested review from EricWF and ldionne and removed request for ldionne July 9, 2025 17:58

Merge remote-tracking branch 'origin/main' into libc++-migration

5e202c3

cmtice marked this pull request as ready for review July 10, 2025 15:24

llvmbot added libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. github:workflow labels Jul 10, 2025

cmtice requested a review from lnihlen July 10, 2025 15:25

boomanaiden154 approved these changes Jul 10, 2025

View reviewed changes

cmtice requested a review from ldionne July 10, 2025 16:16

EricWF approved these changes Jul 10, 2025

View reviewed changes

cmtice merged commit 582cfb1 into llvm:main Jul 10, 2025
67 checks passed

[libc++] Switch over to the LLVM-wide premerge test runners. #147794

[libc++] Switch over to the LLVM-wide premerge test runners. #147794

Uh oh!

Conversation

cmtice commented Jul 9, 2025

Uh oh!

cmtice commented Jul 9, 2025

Uh oh!

cmtice commented Jul 9, 2025

Uh oh!

ldionne commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EricWF commented Jul 9, 2025

Uh oh!

cmtice commented Jul 9, 2025

Uh oh!

EricWF commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

boomanaiden154 commented Jul 9, 2025

Uh oh!

EricWF commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EricWF commented Jul 9, 2025

Uh oh!

EricWF commented Jul 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cmtice commented Jul 10, 2025

Uh oh!

llvmbot commented Jul 10, 2025

Uh oh!

llvmbot commented Jul 10, 2025

Uh oh!

cmtice commented Jul 10, 2025

Uh oh!

boomanaiden154 commented Jul 10, 2025

Uh oh!

boomanaiden154 left a comment

Choose a reason for hiding this comment

Uh oh!

EricWF left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

EricWF commented Jul 10, 2025

Uh oh!

EricWF commented Jul 10, 2025

Uh oh!

Uh oh!

cmtice commented Jul 10, 2025

Uh oh!

boomanaiden154 commented Jul 10, 2025

Uh oh!

EricWF commented Jul 10, 2025

Uh oh!

EricWF commented Jul 10, 2025

Uh oh!

boomanaiden154 commented Jul 10, 2025

Uh oh!

EricWF commented Jul 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

EricWF commented Jul 10, 2025

Uh oh!

ldionne commented Jul 10, 2025

Uh oh!

ldionne commented Jul 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

ldionne commented Jul 9, 2025 •

edited

Loading

EricWF commented Jul 9, 2025 •

edited

Loading

EricWF commented Jul 9, 2025 •

edited

Loading

EricWF commented Jul 9, 2025 •

edited

Loading

EricWF left a comment •

edited

Loading

EricWF commented Jul 10, 2025 •

edited

Loading