Skip to content

Releases: NVIDIA/numba-cuda

v0.30.0

06 Apr 21:37
v0.30.0
7928b8b

Choose a tag to compare

What's Changed

  • Test cuDF third party test test_groupby_apply_return_reindexed_series by @mroeschke in #823
  • Move cuda-pathfinder dependency to core by @ZzEeKkAa in #835
  • Fix cache invalidation logic. by @tpn in #800
  • Implement launch config infrastructure. by @tpn in #804
  • Use cuda-pathfinder to locate nvdisasm by @Jlisowskyy in #842
  • Fix CABI calling convention: skip env global and escape mangled names by @isVoid in #844
  • Bump Version to 0.30.0 by @isVoid in #848

New Contributors

Full Changelog: v0.29.0...v0.30.0

v0.29.0

17 Mar 21:46
v0.29.0
a8221c7

Choose a tag to compare

What's Changed

  • Extend dbg.value coverage to loadvar for scalar kernel parameters by @jiel-nv in #813
  • Fix FP8 uint64 cast flake on Windows by @cpcloud in #829
  • Use dbg.declare for scalar kernel parameters by @cpcloud in #828
  • Fix mixed-IR liveness for inline overload DCE by @cpcloud in #795
  • Use cuda-python for nvvm bindings by @brandon-b-miller in #818
  • fix(ci): cudaRoundMode typing failure in FP8 test by @kaeun97 in #834
  • Support cuda_bindings FastEnum by @mdboom in #837
  • Support cuda.core.GraphBuilder as a kernel-launch stream by @Andy-Jost in #836
  • fix: normalize numpy integer types to python int to prevent overflow errors by @kaeun97 in #774
  • Bump Version to 0.29.0 by @isVoid in #838

New Contributors

Full Changelog: v0.28.2...v0.29.0

v0.28.2

03 Mar 18:17
e2e23ed

Choose a tag to compare

What's Changed

Full Changelog: v0.28.1...v0.28.2

v0.28.1

02 Mar 23:16
09531d6

Choose a tag to compare

What's Changed

Full Changelog: v0.28.0...v0.28.1

v0.28.0

02 Mar 16:54
b2c6522

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.27.0...v0.28.0

v0.27.0

05 Feb 23:30
v0.27.0
ff49cc3

Choose a tag to compare

What's Changed

  • remove super args by @cpcloud in #763
  • test(refactor): clean up run_in_subprocess by @cpcloud in #762
  • Disable automatic review trigger for Greptile by @gmarkall in #743
  • Enable apt proxy caching; skip hosted Windows builds by @kkraus14 in #766
  • build(deps): bump actions/setup-python from 6.1.0 to 6.2.0 in the actions-monthly group across 1 directory by @dependabot[bot] in #768
  • Add cuda-core to oldest tests by @brandon-b-miller in #769
  • Generate line info for PHI exporters in terminator block by @jiel-nv in #756
  • Move CallConv from CUDAContext to FunctionDescriptor by @isVoid in #717
  • feat: Add documentation for debugging Numba CUDA programs with CUDA GDB and VSCode by @mmason-nvidia in #665
  • Remove unused rtapi.py by @brandon-b-miller in #773
  • fix: fix boolean return type mismatch in C ABI wrapper by @kaeun97 in #770
  • Add CUDA FP8 type + conversion bindings (E5M2/E4M3/E8M0), HW-accel detection, and comprehensive tests by @isVoid in #686
  • Bump version to 0.27.0 by @gmarkall in #776

Full Changelog: v0.26.0...v0.27.0

v0.26.0

30 Jan 14:57
v0.26.0
500b41f

Choose a tag to compare

What's Changed

  • Eliminate duplicate DWARF entries for boolean kernel parameters by @jiel-nv in #749
  • feat: accept cuda.core.Buffer and cuda.core.utils.StridedMemoryView as kernel inputs by @cpcloud in #751
  • Replace legacy wheels-build.yaml with build-wheel.yml in publish workflow [no-ci] by @kkraus14 in #760
  • MatMul test: Move from unittest to fully pytest by @maifeeulasad in #754
  • ci: move benchmarks to single function calls so that the units and results are easier to interpret by @cpcloud in #759
  • CI cleanup, extend no-ci skip logic, and add GitHub Release uploads by @kkraus14 in #761
  • bump pixi version and relock by @cpcloud in #757
  • Bump version to 0.26.0 by @kkraus14 in #764

New Contributors

Full Changelog: v0.25.0...v0.26.0

v0.25.0

28 Jan 09:29
bfa805a

Choose a tag to compare

What's Changed

  • build(deps): bump the actions-monthly group across 1 directory with 8 updates by @dependabot[bot] in #704
  • chore(dev): build pixi using rattler by @cpcloud in #713
  • [feat] Initial version of the Numba CUDA GDB pretty-printer by @mmason-nvidia in #692
  • revert: chore(dev): build pixi using rattler (#713) by @cpcloud in #719
  • Fix DISubprogram line number to point to function definition line by @jiel-nv in #695
  • chore(deps): regenerate pixi lockfile by @cpcloud in #722
  • Disable per-PR nvmath tests + follow same test practice by @leofang in #723
  • Adding pixi run test and pixi run test-par support by @rparolin in #724
  • CI: Add CUDA 13.1 testing support by @Copilot in #705
  • Use pathfinder for dynamic libraries by @brandon-b-miller in #308
  • ci: remove rapids containers from conda ci by @cpcloud in #737
  • Pass the -numba-debug flag to libnvvm by @mmason-nvidia in #681
  • Fix compatibility with NumPy 2.4: np.trapz and np.in1d removed by @kkraus14 in #739
  • feat: users can pass shared_memory_carveout to @cuda.jit by @kaeun97 in #642
  • ci: run tests in parallel by @cpcloud in #740
  • fix: Fix race condition in CUDA Simulator by @ccam80 in #690
  • fix: enable flake8-bugbear lints and fix found problems by @cpcloud in #708
  • chore(deps): add cuda-pathfinder to pixi deps by @cpcloud in #741
  • Fix: Pass correct flags to linker when debugging in the presence of LTOIR code by @mmason-nvidia in #698
  • Fix missing line info in Jupyter notebooks by @jiel-nv in #742
  • Fix kernel return type in DISubroutineType debug metadata by @jiel-nv in #745
  • Fix prologue debug line info pointing to decorator instead of def line by @jiel-nv in #746
  • Fix max block size computation in forall by @brandon-b-miller in #744
  • feat: swap out internal device array usage with StridedMemoryView by @cpcloud in #703
  • Add Python 3.14 to the wheel publishing matrix by @gmarkall in #750

New Contributors

Full Changelog: v0.24.0...v0.25.0

v0.24.0

22 Jan 12:13
c82ff48

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.23.0...v0.24.0

v0.23.0

18 Dec 13:07
1b8e3c0

Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.22.1...v0.23.0