fix(errors): restore CUDA exception hierarchy to avoid slow string compilation by cpcloud · Pull Request #796 · NVIDIA/numba-cuda

cpcloud · 2026-02-18T19:55:31Z

Summary

Rebuild CUDA error classes in numba_cuda/numba/cuda/core/errors.py as CUDA-local subclasses of corresponding numba.core.errors classes.
Apply a ruff-mandated style fix in numba_cuda/numba/cuda/core/errors.py (_mod.NumbaError = NumbaError) with no behavior change.
Restore narrow CUDA-vs-core exception checks used by compile-time control flow while preserving compatibility for handlers that catch core error types.
Add regression coverage in numba_cuda/numba/cuda/tests/cudapy/test_errors.py that asserts hierarchy invariants across NumbaError descendants.
Sync pixi.lock package entries so local numba-cuda lock metadata matches the current 0.27.0 version.

Problem statement

The redirect in numba_cuda/numba/cuda/core/errors.py made CUDA error names direct aliases of numba.core.errors. That broadened exception checks in CUDA typing/overload resolution, so some compile paths (notably string-heavy cases) stopped short-circuiting and became much slower.

Alternatives considered

The options were evaluated on behavior restoration, compatibility, implementation effort, and maintenance.

Option A: Revert redirect and restore independent CUDA error classes

Mechanism: remove aliasing and reinstate pre-redirect CUDA-local class definitions.
Behavior impact: restores narrow isinstance behavior.
Compatibility impact: may drift from upstream numba.core.errors shape.
Maintenance profile: medium-high ongoing sync burden.

Option B: Keep aliasing and patch compiler callsites

Mechanism: keep shared error identity, patch callsites that rely on narrow CUDA-vs-core checks.
Behavior impact: restores targeted paths where patched.
Compatibility impact: preserves current alias model.
Maintenance profile: medium risk of future regressions as new callsites appear.

Option C: Manually mirror CUDA hierarchy as subclasses of core classes

Mechanism: explicitly define CUDA classes and inheritance mirroring numba.core.errors.
Behavior impact: restores narrow checks and subclass compatibility.
Compatibility impact: explicit and predictable.
Maintenance profile: medium-high upkeep as upstream classes evolve.

Option D: Dynamically mirror hierarchy at import time

Mechanism: generate CUDA-local subclasses for each core NumbaError descendant while preserving parent-child structure.
Behavior impact: restores narrow checks and keeps CUDA exceptions as subclasses of matching core exceptions.
Compatibility impact: preserves expected catch behavior for core exception handlers.
Maintenance profile: lower ongoing sync burden than manual hierarchy mirroring.

Comparison at a glance

Option	Restores narrow checks	Preserves core-compat catches	Initial effort	Ongoing upkeep
A: Independent CUDA classes	Yes	Partial	Low	Medium-High
B: Callsite patches	Partial	Yes	Medium	Medium
C: Manual mirrored hierarchy	Yes	Yes	Medium-High	Medium-High
D: Dynamic mirrored hierarchy	Yes	Yes	Medium	Low-Medium

Selected approach in this PR

This PR implements Option D and adds regression tests to lock in the hierarchy invariants:

issubclass(cuda_error, core_error) remains true for matching classes.
issubclass(core_error, numba.cuda.core.errors.NumbaError) remains false to preserve narrow compile-time gates.

Closes #755

greptile-apps · 2026-02-18T19:55:34Z

Automatic reviews are disabled for this repository.

copy-pr-bot · 2026-02-18T19:55:35Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

cpcloud · 2026-02-18T20:03:28Z

/ok to test

cpcloud · 2026-02-18T20:10:56Z

/ok to test

…compilation (NVIDIA#755) The error module redirect (491f552) replaced the CUDA exception subclass hierarchy with identity aliases to numba.core.errors classes. This broadened isinstance checks throughout the CUDA compiler, causing it to catch upstream exceptions it previously ignored and try far more compilation passes -- resulting in orders-of-magnitude slower compile times for types with many overload candidates (e.g. strings). Use dynamic diamond inheritance to create local exception subclasses: for every upstream NumbaError descendant, a local class is created that inherits from both the local parent and the upstream class. This restores the narrow isinstance semantics the compiler relies on while preserving the user-facing catch semantics that 491f552 introduced. Co-authored-by: Cursor <cursoragent@cursor.com>

Add a focused hierarchy invariants test to ensure CUDA error classes remain local subclasses of core errors while preserving narrow core-vs-CUDA isinstance behavior relied on by compile-time control flow. Co-authored-by: Cursor <cursoragent@cursor.com>

Update local numba-cuda package entries in pixi.lock to the current 0.27.0 version so the lockfile metadata stays consistent with this branch. Co-authored-by: Cursor <cursoragent@cursor.com>

Use direct attribute assignment for `NumbaError` on the redirected module to satisfy ruff while preserving existing error hierarchy behavior. Co-authored-by: Cursor <cursoragent@cursor.com>

…le-pr

gmarkall · 2026-02-27T20:10:10Z

/ok to test

rparolin added this to the numba-cuda backlog milestone Feb 18, 2026

cpcloud and others added 4 commits February 20, 2026 14:38

chore(lock): sync pixi lock numba-cuda version entries

41c4f57

Update local numba-cuda package entries in pixi.lock to the current 0.27.0 version so the lockfile metadata stays consistent with this branch. Co-authored-by: Cursor <cursoragent@cursor.com>

style(errors): apply ruff autofix in CUDA errors module

60493cf

Use direct attribute assignment for `NumbaError` on the redirected module to satisfy ruff while preserving existing error hierarchy behavior. Co-authored-by: Cursor <cursoragent@cursor.com>

cpcloud force-pushed the fix-slow-string-compile-pr branch from 832d56f to 60493cf Compare February 20, 2026 19:59

kkraus14 requested a review from gmarkall February 23, 2026 15:44

cpcloud modified the milestones: numba-cuda backlog, numba-cuda v0.28.0 Feb 24, 2026

rparolin modified the milestones: numba-cuda v0.29.0, numba-cuda v0.28.0 Feb 27, 2026

Merge remote-tracking branch 'NVIDIA/main' into fix-slow-string-compi…

21a2ef3

…le-pr

gmarkall approved these changes Feb 27, 2026

View reviewed changes

gmarkall added 4 - Waiting on CI Waiting for a CI run to finish successfully 5 - Ready to merge Testing and reviews complete, ready to merge and removed 4 - Waiting on CI Waiting for a CI run to finish successfully labels Feb 27, 2026

gmarkall merged commit 5df7dcb into NVIDIA:main Feb 27, 2026
206 of 208 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(errors): restore CUDA exception hierarchy to avoid slow string compilation#796

fix(errors): restore CUDA exception hierarchy to avoid slow string compilation#796
gmarkall merged 5 commits intoNVIDIA:mainfrom
cpcloud:fix-slow-string-compile-pr

cpcloud commented Feb 18, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

gmarkall commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cpcloud commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem statement

Alternatives considered

Option A: Revert redirect and restore independent CUDA error classes

Option B: Keep aliasing and patch compiler callsites

Option C: Manually mirror CUDA hierarchy as subclasses of core classes

Option D: Dynamically mirror hierarchy at import time

Comparison at a glance

Selected approach in this PR

Uh oh!

greptile-apps bot commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

gmarkall commented Feb 27, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cpcloud commented Feb 18, 2026 •

edited

Loading