Fix mixed-IR liveness for inline overload DCE by cpcloud · Pull Request #795 · NVIDIA/numba-cuda

cpcloud · 2026-02-18T19:46:58Z

Summary

fix use/def and DCE liveness tracking for mixed numba.core and vendored CUDA IR nodes created by inline="always" overload inlining
preserve inlined overload dependencies so DCE no longer drops the live static_getitem feeding the return value in issue [BUG] KeyError with inline="always" and some interaction with DCE #718
add a user-style regression with a module-level overload and kernel launch using cuda.local.array
refresh pixi.lock local package entries so numba-cuda variants resolve to 0.27.0

Related: the separate np.dtype signature compatibility fix was split into #797.

Fixes #718

greptile-apps · 2026-02-18T19:47:01Z

Automatic reviews are disabled for this repository.

copy-pr-bot · 2026-02-18T19:47:01Z

Auto-sync is disabled for ready for review pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

cpcloud · 2026-02-18T20:04:20Z

/ok to test

cpcloud · 2026-02-18T20:49:29Z

/ok to test

cpcloud · 2026-02-18T20:55:14Z

/ok to test

## Summary - align CUDA `np.dtype` overload parameters with NumPy (`dtype`, `align`, `copy`) while avoiding unsupported `**kwargs` in typing templates - align CUDA `np.dot` and `np.vdot` overload parameter names with NumPy (`a`, `b`, `out`) so signature-compatibility checks pass across NumPy variants - preserve existing lowering behavior for supported dtype and BLAS-backed dot/vdot paths Extracted from #795 to keep the issue-718 DCE fix focused. --------- Co-authored-by: Cursor <cursoragent@cursor.com> Co-authored-by: brandon-b-miller <brmiller@nvidia.com>

cpcloud · 2026-02-20T19:28:36Z

/ok to test

Issue NVIDIA#718 mixed IR node classes after inline overload expansion, which made use/def and liveness tracking miss live vars and allowed DCE to drop required expressions. Use compatibility var collection in analysis/DCE and add a user-level regression that exercises inline="always" with cuda.local.array. Co-authored-by: Cursor <cursoragent@cursor.com>

Use a direct list_vars call with AttributeError fallback in compat_list_vars_node instead of pre-checking attributes, which keeps the compatibility path straightforward while preserving mixed-IR behavior. Co-authored-by: Cursor <cursoragent@cursor.com>

Update local package records in pixi.lock so all matrix variants of numba-cuda reflect version 0.27.0 during pixi-managed test/build workflows. Co-authored-by: Cursor <cursoragent@cursor.com>

Match the CUDA np.dtype overload parameters to NumPy's public signature so signature-compatibility checks pass on environments where dtype exposes align/copy/kwargs parameters. Co-authored-by: Cursor <cursoragent@cursor.com>

This reverts commit af57423.

cpcloud · 2026-02-20T19:36:01Z

/ok to test

rparolin added this to the numba-cuda backlog milestone Feb 18, 2026

cpcloud mentioned this pull request Feb 18, 2026

Fix np.dtype overload signature drift #797

Merged

cpcloud and others added 5 commits February 20, 2026 14:29

chore: refresh pixi lock package version entries

53a221f

Update local package records in pixi.lock so all matrix variants of numba-cuda reflect version 0.27.0 during pixi-managed test/build workflows. Co-authored-by: Cursor <cursoragent@cursor.com>

fix: align numpy dtype overload signature with numpy API

17cda12

Match the CUDA np.dtype overload parameters to NumPy's public signature so signature-compatibility checks pass on environments where dtype exposes align/copy/kwargs parameters. Co-authored-by: Cursor <cursoragent@cursor.com>

Revert "fix: align numpy dtype overload signature with numpy API"

ccf93f2

This reverts commit af57423.

cpcloud force-pushed the fix/issue-718-inline-overload-dce branch from 3c93c51 to ccf93f2 Compare February 20, 2026 19:36

gmarkall modified the milestones: numba-cuda backlog, numba-cuda v0.28.0, numba-cuda v0.29.0 Feb 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix mixed-IR liveness for inline overload DCE#795

Fix mixed-IR liveness for inline overload DCE#795
cpcloud wants to merge 5 commits intoNVIDIA:mainfrom
cpcloud:fix/issue-718-inline-overload-dce

cpcloud commented Feb 18, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 20, 2026

Uh oh!

cpcloud commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

cpcloud commented Feb 18, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

greptile-apps bot commented Feb 18, 2026

Uh oh!

copy-pr-bot bot commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 18, 2026

Uh oh!

cpcloud commented Feb 20, 2026

Uh oh!

cpcloud commented Feb 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cpcloud commented Feb 18, 2026 •

edited

Loading