Skip to content

v0.21.0

Latest
Compare
Choose a tag to compare
@ndgrigorian ndgrigorian released this 08 Oct 20:57
· 19 commits to master since this release
878cc19

This release features the addition of new function tensor.isin, indexing of tensor.usm_ndarray with numpy.ndarray, and support for building dpctl for specific CUDA architectures.

Improvements were also made to the build time and binary size of the project, and to the build driver script, making it more convenient when building for CUDA or AMD devices.

Added

  • Added tensor.isin per future Python Array API specification version gh-2098
  • numpy.ndarrays are now permitted when indexing on tensor.usm_ndarray gh-2128

Changed

  • Made a number of constexpr variables inline or static throughout the project, especially in headers, to reduce binary size and improve build time gh-2094, gh-2107
  • DPCTL_TARGET_CUDA and DPCTL_TARGET_HIP now permit specifying the CUDA or HIP architectures gh-2096, gh-2099
  • Extended build_locally.py build driver script to permit --target-cuda and --target-hip options, which match the behavior of DPCTL_TARGET_CUDA and DPCTL_TARGET_HIP gh-2109
  • Improved tensor.asnumpy and tensor.to_numpy for size-0 arrays gh-2120
  • Permit type casting size-0 tensor.usm_ndarray to arbitrary dtype via tensor.usm_ndarray constructor's buffer keyword (i.e., using the original memory as the buffer for the new size-0 array's underlying memory) gh-2123

Fixed

  • Fixed tensor.asarray failing when given device keyword with an input array of a dtype not supported by device gh-2097
  • Fixes undefined behavior in radix sort algorithm and avoids call to sorting algorithms when calling tensor.sort and tensor.argsort on size-1 arrays, or along a size-1 axis gh-2106
  • Fixed incorrect results when calling dpt.astype on tensor.usm_ndarray constructed from a boolean view into a numpy.ndarray gh-2122
  • Fixed dpctl imported in virtual environment on Windows failing to see devices or find DLLs gh-2130
  • Fixed Cythonization failure when testing the ability to create dpctl Cython API extensions with an editable install gh-2147

Maintenance