GH-47128: [Python] Numba-CUDA interop with NVIDIA bindings by gmarkall · Pull Request #47150 · apache/arrow

gmarkall · 2025-07-21T11:45:50Z

Rationale for this change

Testing with Numba-CUDA which uses the NVIDIA CUDA Python bindings by default identified that PyArrow Numba interop has an incompatibility with Numba / Numba-CUDA using the NVIDIA bindings. See Issue #47128.

What changes are included in this PR?

The fix is to get device pointer values from their device_pointer_value property, which is consistent across the ctypes and NVIDIA bindings in Numba.

I also attempted to update the CI config to install Numba-CUDA. I think some of the comments in docker-compose.yml were a bit out of sync with changes to it, so I also updated comments that appeared to be relevant to reflect what I had to run locally. I could have got the CI changes all wrong - happy to change these, as they're not really the critical part of this PR.

Fixes #47128.

Are these changes tested?

Yes, by the existing test_cuda_numba_interop.py and the CI changes in this PR.

Are there any user-facing changes?

No.

GitHub Issue: [Python] test_cuda_numba_interop fails with numba-cuda #47128

Numba and Numba-CUDA return a different type from `Context.memalloc()` depending on whether their built-in ctypes bindings or the NVIDIA CUDA Python bindings are in use - either e `ctypes.c_void_p` or a `cuda.bindings.driver.CUdeviceptr`. Whilst this inconsistency is unfortunate, it's hard to change as existing code in the wild depends on it. The issue in Arrow is that the value of the pointer cannot be obtained by `device_pointer.value` for a `CUdeviceptr`. Numba and Numba-CUDA do provide another property, `device_pointer_value`, that provides the device pointer as an `int` regardless of the kind of binding in use, so we can switch to use this for consistency between the two kinds of bindings. Fixes apache#47128.

github-actions · 2025-07-21T11:46:12Z

Thanks for opening a pull request!

If this is not a minor PR. Could you open an issue for this pull request on GitHub? https://github.com/apache/arrow/issues/new/choose

Opening GitHub issues ahead of time contributes to the Openness of the Apache Arrow project.

Then could you also rename the pull request title in the following format?

GH-${GITHUB_ISSUE_ID}: [${COMPONENT}] ${SUMMARY}

or

MINOR: [${COMPONENT}] ${SUMMARY}

See also:

github-actions · 2025-07-21T11:49:41Z

⚠️ GitHub issue #47128 has been automatically assigned in GitHub to PR creator.

raulcd · 2025-07-21T14:33:30Z

@github-actions crossbow submit -g cuda

github-actions · 2025-07-21T14:35:46Z

Revision: 232c66e

Submitted crossbow builds: ursacomputing/crossbow @ actions-6712a32a6e

Task	Status
test-cuda-cpp-ubuntu-22.04-cuda-11.7.1
test-cuda-python-ubuntu-22.04-cuda-11.7.1

raulcd · 2025-07-21T14:55:04Z

It seems the cuda runners haven't been available for the last several days. I am trying to investigate, cc @assignUser

raulcd · 2025-07-23T06:56:15Z

@github-actions crossbow submit -g cuda

github-actions · 2025-07-23T06:58:31Z

Revision: 232c66e

Submitted crossbow builds: ursacomputing/crossbow @ actions-9953d44aa0

Task	Status
test-cuda-cpp-ubuntu-22.04-cuda-11.7.1
test-cuda-python-ubuntu-22.04-cuda-11.7.1

raulcd · 2025-07-23T10:25:10Z

@assignUser the cuda runners still doesn't seem to be working 🤔

assignUser · 2025-07-23T14:23:21Z

@raulcd ah sorry I was testing with the nanoarrow cluster, looks like there are some issue with the crossbow ones. Investigating now.

assignUser · 2025-07-24T16:48:51Z

@raulcd the cuda nodes are back up!

gmarkall · 2025-08-14T22:53:18Z

A gentle ping on this one (perhaps for @pitrou ?)

kou

+1

conbench-apache-arrow · 2025-08-15T07:19:07Z

After merging your PR, Conbench analyzed the 4 benchmarking runs that have been run so far on merge-commit ddd25d1.

There weren't enough matching historic benchmark results to make a call on whether there were regressions.

The full Conbench report has more details.

pitrou · 2025-08-19T09:58:41Z

Sorry for not looking at this earlier, I was on holiday.

It's nice that the CI is green, but when looking at the logs it seems the tests are actually skipped:

=============================== warnings summary ===============================
arrow-dev/lib/python3.10/site-packages/pyarrow/tests/test_cuda_numba_interop.py:27
  /arrow-dev/lib/python3.10/site-packages/pyarrow/tests/test_cuda_numba_interop.py:27: PytestDeprecationWarning: 
  Module 'numba.cuda' was found, but when imported by pytest it raised:
      ImportError('CUDA bindings not found. Please pip install the cuda-bindings package. Alternatively, install numba-cuda[cuXY], where XY is the required CUDA version, to install the binding automatically. If no CUDA bindings are desired, set the env var NUMBA_CUDA_USE_NVIDIA_BINDING=0 to enable ctypes bindings.')
  In pytest 9.1 this warning will become an error by default.
  You can fix the underlying problem, or alternatively overwrite this behavior and silence this warning by passing exc_type=ImportError explicitly.
  See https://docs.pytest.org/en/stable/deprecations.html#pytest-importorskip-default-behavior-regarding-importerror
    nb_cuda = pytest.importorskip("numba.cuda")

Can you take a look at this @gmarkall ?

gmarkall · 2025-08-19T13:34:30Z

Can you take a look at this @gmarkall ?

Yes, I will take a look at this - thanks for spotting it and mentioning!

gmarkall · 2025-08-19T15:58:44Z

@pitrou A potential fix in #47372

kou · 2025-08-19T21:49:35Z

Oh, sorry. I should have noticed it...

gmarkall added 2 commits July 21, 2025 12:37

Update CI to install numba-cuda

232c66e

gmarkall requested review from AlenkaF, assignUser, jonkeane, kou, raulcd and rok as code owners July 21, 2025 11:45

github-actions bot added Component: Python awaiting review Awaiting review labels Jul 21, 2025

gmarkall mentioned this pull request Jul 21, 2025

[Python] test_cuda_numba_interop fails with numba-cuda #47128

Closed

gmarkall changed the title ~~Fix #47128, Numba-CUDA interop with NVIDIA bindings~~ GH-47128: [Python] Numba-CUDA interop with NVIDIA bindings Jul 21, 2025

kou approved these changes Aug 15, 2025

View reviewed changes

kou merged commit ddd25d1 into apache:main Aug 15, 2025
35 checks passed

kou removed the awaiting review Awaiting review label Aug 15, 2025

gmarkall mentioned this pull request Aug 19, 2025

[Python][GPU] CI environment not installing Numba-CUDA correctly #47371

Closed

Conversation

gmarkall commented Jul 21, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

raulcd commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

raulcd commented Jul 21, 2025

Uh oh!

raulcd commented Jul 23, 2025

Uh oh!

github-actions bot commented Jul 23, 2025

Uh oh!

raulcd commented Jul 23, 2025

Uh oh!

assignUser commented Jul 23, 2025

Uh oh!

assignUser commented Jul 24, 2025

Uh oh!

gmarkall commented Aug 14, 2025

Uh oh!

kou left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

conbench-apache-arrow bot commented Aug 15, 2025

Uh oh!

pitrou commented Aug 19, 2025

Uh oh!

gmarkall commented Aug 19, 2025

Uh oh!

gmarkall commented Aug 19, 2025

Uh oh!

kou commented Aug 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

gmarkall commented Jul 21, 2025 •

edited by github-actions bot

Loading