Skip to content

Conversation

renovate[bot]
Copy link
Contributor

@renovate renovate bot commented Apr 8, 2025

This PR contains the following updates:

Package Type Update Change
cupy (source) dependencies major * -> >=13.4.1

Release Notes

cupy/cupy (cupy)

v13.4.1

This is the release note of v13.4.1. This is a hot-fix release addressing several issues including DLPack compatibility with existing user code. See here for the complete list of solved issues and merged PRs.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

📝 Changes

Bug Fixes

  • Revert toDlpack() default to the old unversioned one (#​9011)
  • Hot fix for numpy 2 support in some fusion paths (#​9016)
  • Fix compilation error of cupy.inf in fusion2 (#​9044)

Tests

  • CI: Pin setuptools version on Windows (#​9047)

Others

👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​asi1024 @​kmaehashi @​seberg

v13.4.0

This is the release note of v13.4.0. See here for the complete list of solved issues and merged PRs.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

✨ Highlights

NVIDIA CUDA 12.8 Support

CuPy now supports CUDA 12.8 and the latest NVIDIA Blackwell architecture.

AMD ROCm 6.x Support

CuPy can now be built with AMD ROCm 6.x.

Python 3.13 Support

Binary packages for Python 3.13 are now available.

🛠️ Changes without compatibility

Cython 3.0 as build requirement (#​8959)

To provide support for Python 3.13, CuPy codebase has been updated for Cython 3. To build CuPy from source, Cython 3.0 or later is now required instead of Cython 0.29.x.

📝 Changes

New Features

Enhancements

  • Support ROCm 6 (#​8608)
  • Support setuptools 74.0.0 or later (#​8649)
  • Use custom less instead of specializing thrust (#​8653)
  • Add NCCL_ERROR_REMOTE_ERROR to the set of errors from NCCL (#​8667)
  • Replace numpy.ComplexWarning with cupy.exceptions.ComplexWarning (#​8678)
  • Use weakref.finalize instead of del for RandomState._generator destruction (#​8680)
  • Implement dlpack v1 (#​8722)
  • Fix some NumPy 2.x CI failures (cont.) (#​8725)
  • Bump CUDA version in cuda11x-cuda-python CI (#​8743)
  • ROCm 6.2.2: Conditionally define CUDA_SUCCESS only if it's not (#​8799)
  • Raise VisibleDeprecationWarning for wavelet functions (#​8868)
  • Use a custom Min/Max instead of specializing CUB (#​8875)
  • Updating pylibraft pairwise_distance to cuvs (#​8897)
  • Support CUDA 12.8 + Blackwell GPUs (sm_100, sm_120) (#​8915)
  • Interpolate: update RBF to scipy 1.13 (#​8939)
  • Use C++17 in JIT compile (#​8941)
  • Bump library installers for CUDA 12.8 (#​8943)
  • Use CCCL 2.8.x branch + Use CUPY_CACHE_KEY in hash keys (#​8946)
  • Use NVIDIA CCCL 2.8 latest w/CUDA 12.3 fix (#​8948)
  • Broaden usage of C++17 (#​8958)
  • Port to Cython 3.0 (#​8959)
  • cupyx.scipy.distance: initialize output array with empty instead of zeros (#​8981)
  • cupyx.scipy.spatial.distance.cdist remove explicit zeroing of user-provided output array (#​8990)
  • Skip sparse.linalg.{cg, cgs, gmres} tests for scipy>=1.14 (#​8551)
  • cupyx.scipy.sparse tests for SciPy 1.14 (#​8552)
  • Fix some NumPy 2.x CI failures (cupyx) (#​8738)
  • Fix cupy.percentile for NumPy 2.x (#​8752)
  • Skip some tests incompatible with NumPy 2.2 (#​8830)
  • Disable contiguous_check for COO/dense matmul test (#​8888)
  • Raise ValueError upon attempts to create 3-dim sparse array (#​8889)
  • Skip a test for invalid scipy return value of invalid COO matmul (#​8890)
  • Fix fft.fht following bug fix in SciPy 1.15 (#​8891)
  • Support empty tuple indexing for sparse matrix (#​8892)
  • Deprecate cupyx.scipy.linalg.kron (#​8902)
  • Fix test for special.sph_harm to ignore DeprecationWarning (#​8906)

Bug Fixes

  • Add nccl.broadcast 64-bit support (#​8566)
  • Support building CuPy with setuptools 74 (#​8577)
  • Fix order 'K' with shape given for *_like array creation (#​8605)
  • hipPointerGetAttributes returns error when pointer is unregistered in ROCm 5.7 (#​8609)
  • Guard for ROCm 6.x (#​8611)
  • Fix HIP_VERSION unit (#​8619)
  • Switch to using platform.machine() instead of platform.processor() (#​8656)
  • Properly allocate in RNG when specified dtype is neither float32/float64 (#​8658)
  • Use platform.machine() instead of platform.processor() (#​8673)
  • Fix sosfilt state output shape when ndim < 2 (#​8679)
  • Fix undefined inf/nan constant in CuPy JIT (#​8712)
  • Fix bspline kernel to avoid out of bounds error (#​8763)
  • Fix race during SoftLink initialization (#​8787)
  • fix nanargmin and nanargmax's parameter order and pass optional parameters (#​8791)
  • Fix crashes of quantile and percentile (#​8811)
  • Fix handling of pinned memory (#​8852)
  • Use /bigobj on Windows build (#​8967)
  • Fix cupyx.scipy.spatial.distance's cdist for RAPIDS 24.12 compatibility (#​8975)

Code Fixes

  • Upgrade pre-commit hooks to silence warnings (#​8666)
  • Resolve import loop (#​8714)
  • Resolve uncaught type warning (#​8798)
  • Switch from .A attribute to .toarray() method (#​8814)
  • Fix typo in _cretate_frame_tree (#​8944)
  • Drop unneeded bytes copy of CUPY_CACHE_KEY (#​8947)

Documentation

  • Add docs about CUDA headers (#​8595)
  • Update fft.rst (#​8617)
  • Update documentation to use pre-commit (#​8650)
  • Add tips on Windows development in Contribution Guide (#​8704)
  • Add notice about cupy.array_api removal (#​8751)
  • Add CUDA 12.8 to docs (#​8968)
  • Update list of supported versions (#​8991)

Installation

  • Update conda-build CUDA detection logic for Setuptools 72.2.0 (#​8652)
  • Use relative path of header files to generate cache key (#​8930)
  • Fix minimum CUDA version check and update comments (#​8938)
  • Bump version to v13.4.0 (#​8993)

Tests

  • Relax test_firls atol (#​8522)
  • Skip test_homomorphic in scipy>=1.14 (#​8523)
  • Skip betaincinv test with SciPy 1.14.1 (#​8553)
  • Skip special tests for SciPy 1.14 dtype rule changes (#​8554)
  • Skip special.logsumexp test for empty input (#​8555)
  • Skip cupy.scipy.stats.entropy tessts for SciPy 1.14 dtype rule change (#​8556)
  • Use setuptools==73.0.1 (#​8569)
  • Revert CI timeout bump (#​8571)
  • Support SciPy 1.13 and 1.14 (#​8572)
  • Missing backport for sparse_array.A removal (#​8573)
  • Skip test_log_expit SciPy 1.7 (#​8576)
  • Catch ValueError (#​8625)
  • Use testing.with_requires to skip broken tests (#​8627)
  • CI: Update micro versions of Python (#​8635)
  • Skip tests if scipy is not installed (#​8637)
  • Accept OverflowError in TestCopytoFromScalar for NumPy v2 (#​8643)
  • Skip more tests if scipy is not installed (#​8645)
  • Update precommit (#​8663)
  • Backport the changes introduced in #​8690 (#​8694)
  • CI: Fix apt repository URL for Ubuntu 22.04 (#​8715)
  • Remove ndarray.ptp from fallback tests (#​8744)
  • Temporary skip for NumPy 2.0 tests (#​8745)
  • Relax tolerance of test_hilbert for NumPy 2.0 (#​8746)
  • Bump SciPy version to 1.14 in Windows CI (#​8764)
  • Add NumPy 2.x CI for Linux (#​8768)
  • CI: support "skip-ci" label (#​8841)
  • CI: Fix FlexCI compatibility (#​8842)
  • Add NumPy 2.2 to CI (#​8855)
  • Replace flake8 with ruff (#​8859)
  • Support Optuna 4 (#​8863)
  • Add testing.shaped_linspace (#​8900)
  • Disable contiguous_check for some signal.cont2discrete tests (#​8901)
  • Fix splines tests to remove unexpected skips (#​8921)
  • Minor updates for sm120 (#​8922)
  • Add CI for CUDA 12.8 (#​8951)
  • Increase host memory in Windows CI, free GPU memory in example code (#​8969)
  • Skip some signal tests for TypeError for inputs of np.longlong dtype (#​8972)
  • Add CI for Python 3.13 and mpi4py v4 (#​8974)
  • Pass locals dict to exec (#​8985)

Others

  • Add backport reminder (#​8684)
  • Fix script name of backport reminder (#​8686)
  • Update pre-commit hooks (#​8910)
  • Fix pull request project board workflows (#​8929)
  • Regenerate coverage matrix (#​8960)

👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​99991 @​andfoy @​asi1024 @​Azusachan @​bernhardmgruber @​Berrysoft @​chainer-ci @​cjnolet @​dagardner-nv @​EarlMilktea @​eltociear @​ev-br @​grlee77 @​HollowMan6 @​jakirkham @​jemiryguo @​kmaehashi @​leofang @​littlewu2508 @​mohitreddy1996 @​mroeschke @​seberg @​takagi

v13.3.0

This is the release note of v13.3.0. See here for the complete list of solved issues and merged PRs.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

✨ Highlights
Updated NVIDIA CCCL

The CCCL library bundled with CuPy has been updated to eliminate the Jitify preprocess phase. Users will no longer see the one-time performance warning (Jitify is performing a one-time only warm-up to populate the persistent cache, this may take a few seconds and will be improved in a future release...) unless explicitly requesting the use of Jitify (e.g., cupy.RawModule(..., jitify=True)).

Enhanced NumPy 2.0 Compatibility

This release provides better interoperability with NumPy 2.0.

Support for CUDA 12.5 & 12.6

CuPy is now tested with CUDA 12.5 and 12.6.

RFC: Removing NumPy Fallback Mode in CuPy v14

The CuPy team is discussing the possibility of removing NumPy fallback feature in CuPy v14. Feel free to join the discussion in https://github.com/cupy/cupy/issues/8497 if you have any comments or use-cases using this feature.

📝 Changes
Enhancements
  • Support CUDA 12.5 (#​8423)
  • Avoid using Jitify everywhere inside CuPy (#​8473)
  • Disable jitify for cub & Bump CCCL (#​8487)
  • Get rid of pkg_resources (#​8496)
  • Unregister cupyx.scipy.linalg.{tri,tril,triu} from uarray (reverted in #​8516) (#​8506)
  • Use .toarray() instead of .A attribute (#​8517)
  • Extend runtime header search logic to conda (#​8520)
  • Support CUDA 12.6 (#​8524)
  • Fallback to system headers for future CUDA 12.x versions (#​8529)
Bug Fixes
  • Fix spline temp container size in make_interp_spline (#​8390)
  • MAINT: Avoid using np.compat.integer_types (#​8413)
  • Fix type dispatcher for arm64 (#​8414)
  • Fix ndarray.get() not honoring current stream when layout is not contiguous (#​8418)
  • Fix copyto for NumPy 2 compatibility (#​8435)
  • Update compiler.py to avoid the popup of the nvcc.exe console (#​8438)
  • Fix RandomState.seed() for NumPy 2 compatibility (#​8439)
  • Fix the size of temporary CUB output space to consider its alignment (#​8447)
  • Address KeyErrors from importlib_metadata (#​8465)
  • upfirdn: mode=None -> mode="constant" (#​8495)
  • Search header files from CTK wheel (#​8504)
  • Fix CUDA version condition to use headers from wheel (#​8507)
  • Do not unregister cupyx.scipy.linalg.{tri,tril,triu} from uarray (#​8516)
  • Fix ROCm 4.3 binary package build broken (#​8534)
  • Fix cudart header detection for conda (#​8535)
Documentation
  • eigsh doc correction _eigen.py (#​8383)
  • typo: coping -> copying (#​8427)
  • Add CUDA 12.5 to list of supported platform (#​8428)
  • Add comparison table for (cupyx.)scipy.sparse.*_matrix classes class methods (#​8458)
Installation
  • Patch the build system to better support conda-build (#​8464)
Tests
  • Bump NumPy/SciPy versions in cuda-example CI (#​8420)
  • Support SciPy 1.12 (#​8422)
  • Fix CUDA 11.2 CI failure on Linux (#​8437)
  • Decrease number of threads to avoid "system error: excessive memory usage is detected" (#​8462)
  • CI: skip CUDA 12.1/12.2/12.3/12.4 CI on "mini" trigger (#​8469)
  • Resolve Ruff NPY errors - fix exception imports and asfarray usage in test code (#​8471)
  • Skip some tests in aarch64 CI (#​8490)
👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​andfoy @​arkdong @​asi1024 @​bmerry @​EarlMilktea @​emcastillo @​hmaarrfk @​jakirkham @​johnnynunez @​kmaehashi @​leofang @​monzelr @​seberg @​swelborn @​takagi @​YanivDorGalron

v13.2.0

This is the release note of v13.2.0. See here for the complete list of solved issues and merged PRs.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

✨ Highlights
Support for NumPy 2.0 (#​8357)

CuPy can now be imported under NumPy 2.0.

Lazily preloading NCCL (#​8367)

CuPy now loads NCCL shared library at the time of import cupy.cuda.nccl, instead of import cupy. This improves NCCL compatibility on mixed-library environments.

📝 Changes
Enhancements
  • cupyx: cleanup use of deprecated NumPy functionality (NumPy 2.0 compatibility) (#​8325)
  • make CuPy import under NumPy 2.0 (#​8357)
  • Lazy-preload NCCL (#​8367)
Bug Fixes
  • Fix overflow indexing ndarray generated with as_strided (#​8349)
  • Fix CUB build error on win-64 (#​8358)
  • Re-enable NVTX range coloring for NVTX3. (#​8361)
Documentation
Tests
  • [v13] Use the latest NumPy v1 for head CI (#​8355)
Others
👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!
@​asi1024 @​cclauss @​ev-br @​grlee77 @​kmaehashi @​leofang @​macrocosme @​romerojosh @​takagi

v13.1.0

This is the release note of v13.1.0. See here for the complete list of solved issues and merged PRs.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

✨ Highlights
Support for CUDA 12.3 & 12.4 (#​8286)

CuPy now supports CUDA 12.3 and 12.4. Binary packages are available for Linux (x86_64/aarch64) and Windows as cupy-cuda12x.

Fixed Regression on pre-Volta platforms (#​8216)

This release fixes the regression in CuPy v13.0.0 that part of CuPy functions were not functioning under pre-Volta platforms (compute capability < 7.0) such as NVIDIA Tesla P100 or GeForce GTX 1080.

📝 Changes
New Features
  • Add cupyx.signal.{complex_cepstrum,real_cepstrum,inverse_complex_cepstrum,minimum_phase} (#​8096)
  • Add cupyx.signal.{firfilter,firfilter_zi,firfilter2} (#​8107)
  • Add cupyx.signal.freq_shift (#​8131)
  • Add cupyx.signal.channelize_poly (#​8148)
  • Add cupyx.signal.ca_cfar (#​8167)
Enhancements
  • Add incontiguous support for cutensor functions (#​8168)
  • Remove usages of numpy.float_ and numpy.complex_ (#​8181)
  • Fix expm(complex matrix) (#​8214)
  • Various Jitify improvements (#​8237)
  • Bump to cuTENSOR 2.0.1 (#​8291)
NumPy-compatibility Improvements
  • Fix scp.signal.{medfilt,medfilt2d} to raise ValueError for complex64 inputs (#​8084)
  • Fix boxcox_llf for SciPy 1.12 changes (#​8132)
  • Deprecate cupyx.scipy wavelet functions (#​8139)
Bug Fixes
  • Fix #​7981, Update _nccl_comm.py (#​8112)
  • Fix Flags not to allow setters (#​8138)
  • Prevent angular brackets from appearing in Jitify's cache filename (#​8160)
  • Set -arch in the compiler options unconditionally (#​8161)
  • Allow cupy.show_config() without CUDA (#​8192)
  • Fix jitify warmup kernel (#​8216)
  • Fix: remove unnecessary include that causes deployment issue (#​8217)
  • Fix build system for Thrust detection (#​8230)
  • Fix: always switch to the submodule dir before checking git tag/commit (#​8240)
  • Fix overflow of index calculation in random generator API (#​8246)
  • Fix Generator API parallelism (#​8247)
  • Fix CUB min/max initial values (#​8266)
  • Fix jitify warmup kernel - Cont'd (#​8270)
Documentation
  • Update conda installation guide (#​8135)
  • Fix pdist docstring in order to specify that the returned matrix is condensed (#​8187)
  • Replace license notice in cupyx.scipy.signal._spectral (#​8271)
  • Update document for CUDA 12.3 and 12.4 (#​8284)
Installation
  • Do not search for static libs (#​8143)
Tests
  • Fix cupyx.scipy.special.betainc for invalid inputs (#​8098)
  • Revert CI timeout changes (#​8137)
  • Fix invalid vectorstength tests (#​8145)
  • Fix actions versions used in workflows to avoid node 16 deprecation warning (#​8194)
  • Add CI to test cupy.show_config() pass without CUDA installed (#​8195)
  • Add import test without CUDA Toolkit (#​8231)
  • BUG: cupyx/scipy/signal: fix mpmath test (#​8262)
  • Tentatively pin SciPy to v1.12 in CI (#​8275)
  • Add support for CUDA 12.3 & 12.4 (#​8286)
👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​andfoy @​asi1024 @​emcastillo @​ev-br @​jemiryguo @​kmaehashi @​leofang @​takagi

v13.0.0

This is the release note of v13.0.0. See here for the complete list of solved issues and merged PRs.

This release note only covers changes made since the v13.0.0rc1 release. Check out our blog for highlights of the v13 release!

See the Upgrade Guide for the list of possible breaking changes in v13.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

📝 Changes

For all changes in v13, please refer to the release notes of the pre-releases (alpha1, beta1, rc1).

New Features
  • Add cupyx.signal.pulse_compression from cuSignal's non SciPy-compat API (#​8039)
  • Add cupyx.signal.convolve1d3o from cuSignal's non SciPy-compat API (#​8067)
  • add cupyx.signal.{pulse_doppler, cfar_alpha} (#​8069)
  • Add cupyx.signal.convolve1d2o (#​8113)
Enhancements
  • Make cupyx.signal.radartools private (#​8053)
  • Fix csrmatrix.__pow__ to raise ValueError for non-int other (#​8085)
Performance Improvements
  • Speed up cupy environment duplicate detection (#​8042)
Bug Fixes
  • Fix lfilter_zi and sosfilt_zi when any IIR coefficient is zero (#​8036)
  • Fix argmax/argmin for large reduction axis (#​8041)
  • Fix cupyx.scipy.fft.{dst,dstn} in type 2/3 (#​8082)
  • Do not use from-import (#​8114)
Code Fixes
Documentation
  • Generate signature for ufunc documentation (#​8044)
  • Use modern dlpack interface in torch interoperability document (#​8048)
Installation
  • Skip CUDA_PATH warning in Conda installation (#​8076)
  • Bump version to v13.0.0 (#​8119)
Tests
  • Bump stable branch to v13 (#​8026)
  • Remove some signal.vectorstrength xfail tests (#​8083)
  • Fix scipy.linalg not to raise DeprecationWarning for zero-size inputs (#​8086)
  • scipy.special.{btdtr,btdtri} are deprecated since SciPy (#​8094)
  • Refactor radartools tests (#​8099)
  • Fix slow test (#​8117)
👥 Contributors

@​andfoy @​asi1024 @​emcastillo @​hauntsaninja @​kmaehashi @​takagi

The CuPy Team would like to thank all those who contributed to this release!

v12.3.0

This is the release note of v12.3.0. See here for the complete list of solved issues and merged PRs.

This is the last planned release for the CuPy v12 series. Please start testing your workload with the v13 release candidate to get ready for the final v13 release. To install: pip install -U --pre cupy-cuda11x -f https://pip.cupy.dev/pre. See the Upgrade Guide for the list of possible breaking changes in v13.

💬 Join the Matrix chat to talk with developers and users and ask quick questions!

🙌 Help us sustain the project by sponsoring CuPy!

✨ Highlights
Support for Python 3.12

Binary packages are now available for Python 3.12.

📝 Changes
Enhancements
  • Add a property to get access to the nccl handle. (#​7824)
  • Add Linux CI for Python 3.12 (#​7949)
  • Bump supported NumPy & SciPy versions (#​8001)
  • ENH: Remove NINF, PINF, Inf,... usages (#​7805)
  • Avoid using numpy.find_common_type (#​7810)
  • Remove conversions of array with ndim > 0 to a scalar (#​7895)
  • scipy.linalg.{tri/tril/triu} are deprecated in SciPy 1.11.0 (#​7902)
  • Fix signal.medfilt complex error type for SciPy>=1.11 (#​7909)
  • Fix return type of division of csr_matrix and dense array for SciPy 1.11 (#​7912)
  • Skip TestSpmatrix on SciPy 1.11 or later (#​7918)
  • Fix test of product, cumproduct, alltrue and sometrue for deprecation (#​7936)
  • Skip fusion round_ tests (#​7937)
Bug Fixes
  • Fix csr2dense to avoid race conditions (#​7808)
  • Fix cuTENSOR contraction descriptor cache (#​7817)
  • Provide .stop() method for cupyx.distributed._Backend (#​7960)
Code Fixes
  • MNT: suppress compiler warning from cupyx.cusolver (#​7819)
  • Fix coding style (#​7846)
  • Remove deprecated ast expr in CuPy JIT (#​7944)
  • Remove unnecessary CUB files from CuPy distribution (#​7975)
Documentation
  • Add -U to pre-release installation command (#​7806)
  • Fix CUDA 12.2 for Windows notice (#​7926)
Installation
  • Fix ROCm Dockerfile not working (#​7799)
  • Avoid warning when uploading packages (#​7807)
Tests
  • Add timeout to Windows CI (#​7859)
  • CI: Remove path length limitation on Windows CI image (#​7860)
  • Fix Windows CI failures (#​7865)
  • Fix Windows + CUDA 12.2 CI (#​7910)
  • Skip test_pos_boolarray if numpy>=1.25 (#​7913)
  • Skip some LOBPCG tests failing with SciPy 1.11 (#​7931)
  • Add NumPy 1.25/1.26 & SciPy 1.11 to CI (#​7932)
  • Skip logspace test in NumPy 1.25 & 1.26 (#​7946) (#​7951)
  • Support Python 3.12, add Windows CI (#​7958)
  • Fix Windows test scripts (#​7961)
  • Skip test_parameterize_pytest_impl test for pytest 7.4.3 (#​7968)
  • Filter DeprecationWarning for distutils.dep_util used in Cython (#​7999)
  • Fix TestLOBPCG.test_maxit_None CUDA 12.2 CI failure (#​8007)
Others
👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​asi1024 @​emcastillo @​kmaehashi @​leofang @​mtsokol @​mvnvidia

v12.2.0

This is the release note of v12.2.0. See here for the complete list of solved issues and merged PRs.

We are running a Gitter chat for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

✨ Highlights
Support for CUDA 12.2

CuPy now supports CUDA 12.2. Note that there is a known issue on CUDA 12.2 for Windows. See #​7776 for details.

Support us via GitHub Sponsors!

As a part of our effort to make CuPy sustainable, we have enrolled in GitHub Sponsors to accept donations. Help us to support CuPy’s development and contribute to ease the required infrastructure costs due to the need of GPU enabled CI platforms and resources to build binary packages.

As a NumFOCUS Sponsored Project, funds sponsored through the GitHub Sponsors are collected and disbursed via NumFOCUS, a 501(c)(3) public charity in the United States, which acts as the fiscal sponsor for the project.

🛠️ Changes without compatibility
Deprecation of cupy-wheel Package

Due to the recent specification change in Pip 23.1, it became difficult for cupy-wheel to ensure detecting the CUDA version installed correctly. As discussed in RFC #​7628, we have decided to remove this package in CuPy v13. To allow existing projects using cupy-wheel to continue to work, the package remains available for v12 releases.

📝 Changes
Enhancements
  • Minor updates for cuQuantum/cuTensorNet support (#​7730)
  • Bump mypy version to 1.4.1 (#​7736)
  • Support CUDA 12.2 (#​7752)
Performance Improvements
  • Fix random module performance regression (#​7592)
Bug Fixes
  • Fix returned CUDA statuses not being checked (#​7618)
  • Fix cuSPARSE error message (#​7684)
  • Fix memory pool to try resolve fragmentation when limit is set (#​7685)
  • Fix type/exception annotations in cuSPARSE binding (#​7703)
  • Update pylibcugraph weakly connected components call (#​7704)
  • Improve detection for package installation source on Windows (#​7711)
  • Temporarily disable CUB histogram (#​7716)
  • Fix aweights type not checked in cupy.cov (#​7717)
  • Revert FP16 headers from CUDA 12.2.0 to CUDA 12.1.1 (#​7773)
Code Fixes
Documentation
  • Improve README and Installation Guide (#​7599)
  • update badges (#​7600)
  • Fix small typos in docstrings (#​7657)
  • Fix docstring of asarray (#​7695)
  • Add CUDA 12.2 to list of supported CUDA (#​7756)
  • Remove incorrect cupyx.distributed.NCCLBackend.all_gather comment (#​7765)
  • Fix Note highlight sections in README (#​7770)
  • Add notes for CUDA 12.2 on Windows support (#​7778)
Installation
  • Fix cupy-wheel package installation fails with pip 23.1+ (#​7624)
Tests
  • Bump versions of static checkers (#​7598)
  • Fix build-cuda test restore-keys not working (#​7614)
  • [v12] Require numpy<1.25 for round_ tests (#​7642)
  • Ignore pkg_resources deprecation warning on import (#​7656)
  • Skip TestLOBPCG::test_maxit_None in CUDA 12.1.1 & cuSOVLER 11.4.5 (#​7670)
  • Bump CUDA minor versions used in CI (#​7683)
  • [v12] Allow specifying Docker repository for CI images (#​7690)
  • Use "/test" tag configuration from pull-request base branch (#​7706)
  • XFAIL known test failures in cuSPARSE module (#​7725)
  • Fix test_fht not to feed cupy.ndarray to scipy.fft.fhtoffset (#​7728)
  • CI: remove explicit Cython installation (#​7731)
  • Fix test_sum_duplicates_incompatibility for SciPy 1.11 (#​7768)
Others
👥 Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​12rambau @​asi1024 @​emcastillo @​jnke2016 @​kmaehashi @​leofang @​pelmers @​pri1311 @​RandomY-2 @​takagi

v12.1.0

This is the release note of v12.1.0. See here for the complete list of solved issues and merged PRs.

We are running a Gitter chat for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Changes
New Features
  • Add array_api.take function (#​7513)
Enhancements
Bug Fixes
  • Fixup array/asarray call to prefer C order on plain NumPy arrays (#​7493)
  • Fix cudart errors raised by texture APIs swallowed by Cython (#​7566)
  • Dispatch ufunc methods (#​7583)
Code Fixes
Documentation
  • Update aarch64 install insturctions (#​7503)
  • Fix RTD build failure (#​7554)
Installation
  • Use -Xfatbin=-compress-all (#​7505)
  • Fix _depends.json not included in wheel (#​7584)
Tests
  • Bump platform versions used in actions (#​7501)
  • Fix TestBSpline::test_design_matrix_same_as_BSpline_call (#​7525)
  • Remove unused test decorators (#​7535)
  • Restore GitHub Actions cache with prefix match (#​7571)
  • Fix CUDA Python CI failure (#​7582)
Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​andfoy @​arogozhnikov @​asi1024 @​kmaehashi @​leofang @​seberg @​takagi

v12.0.0

This is the release note of v12.0.0. See here for the complete list of solved issues and merged PRs.

This release note only covers changes made since the v12.0.0rc1 release. Check out our blog for highlights of the v12 release!

We are running a Gitter chat for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Highlights
Support for CUDA 12.1 & cuDNN 8.8 (#​7484 & #​7475)

CuPy now supports CUDA 12.1 and cuDNN 8.8. Binary packages are available for Linux (x86_64/aarch64) and Windows as cupy-cuda12x.

$ pip install cupy-cuda12x
Announcements
Arm packages available in PyPI

Binary packages for aarch64 (Jetson and Arm servers) can now be installed from PyPI.

$ pip install cupy-cuda102
$ pip install cupy-cuda11x
$ pip install cupy-cuda12x

~Note: At the time of the release, Arm wheel of cupy-cuda11x for Python 3.8 (cupy_cuda11x-12.0.0-cp38-cp38-manylinux2014_aarch64.whl) is not available on PyPI. We are working on resolving this issue. Meanwhile, this wheel can be installed from the CuPy index. $ pip install cupy-cuda11x -f https://pip.cupy.dev/aarch64~ This issue was resolved on 2023-04-03.

Changes

For all changes in v12, please refer to the release notes of the pre-releases (alpha1, alpha2, beta1, beta2, beta3, rc1).

Enhancements
Bug Fixes
  • Fix new strides when array is both C and F-contiguous (#​7451)
Code Fixes
  • Rename type_test to type_testing (#​7461)
Documentation
  • Add comparison table for scipy.interpolate module (#​7450)
  • Update list of supported libraries (#​7486)
Tests
Others
Contributors

The CuPy Team would like to thank all those who contributed to this release!

@​AdrianAbeyta @​asi1024 @​emcastillo @​kmaehashi @​seberg

v11.6.0

This is the release note of v11.6.0. See here for the complete list of solved issues and merged PRs.

This is the last planned release for CuPy v11 series. Please start testing your workload with the v12 release candidate to get ready for the final v12 release. To install:pip install -U --pre cupy-cuda11x -f https://pip.cupy.dev/pre. See the Upgrade Guide for the list of possible breaking changes in v12.

We are running a Gitter chat for general discussions and quick questions. Feel free to join the channel to talk with developers and users!

Highlights
Fixed Performance Issue with CUDA 12.0

This release fixes a critical performance regression in CUDA 12.0 that the on-disk kernel cache is ineffective, causing kernels to be recompiled for each python process. Users with CUDA 12.0 are strongly suggested to upgrade to this release.

Changes
Enhancements
  • Use warp size from runtime.getDeviceProperties (#​7353)
  • Update DLPack to v0.8 to support bool arrays (#​7376)
  • Mar

Configuration

📅 Schedule: Branch creation - At any time (no schedule defined), Automerge - At any time (no schedule defined).

🚦 Automerge: Disabled by config. Please merge this manually once you are satisfied.

Rebasing: Whenever PR becomes conflicted, or you tick the rebase/retry checkbox.

🔕 Ignore: Close this PR and you won't be reminded about this update again.


  • If you want to rebase/retry this PR, check this box

This PR was generated by Mend Renovate. View the repository job log.

@lucascolley lucascolley merged commit 03c8e60 into main Apr 8, 2025
11 checks passed
@lucascolley lucascolley added the dependencies Pull requests that update a dependency file label Apr 8, 2025
@renovate renovate bot deleted the renovate/cupy-13.x branch April 8, 2025 17:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dependencies Pull requests that update a dependency file

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant