This release features the addition of new function tensor.isin
, indexing of tensor.usm_ndarray
with numpy.ndarray
, and support for building dpctl
for specific CUDA architectures.
Improvements were also made to the build time and binary size of the project, and to the build driver script, making it more convenient when building for CUDA or AMD devices.
Added
- Added
tensor.isin
per future Python Array API specification version gh-2098 numpy.ndarrays
are now permitted when indexing ontensor.usm_ndarray
gh-2128
Changed
- Made a number of constexpr variables inline or static throughout the project, especially in headers, to reduce binary size and improve build time gh-2094, gh-2107
DPCTL_TARGET_CUDA
andDPCTL_TARGET_HIP
now permit specifying the CUDA or HIP architectures gh-2096, gh-2099- Extended
build_locally.py
build driver script to permit--target-cuda
and--target-hip
options, which match the behavior ofDPCTL_TARGET_CUDA
andDPCTL_TARGET_HIP
gh-2109 - Improved
tensor.asnumpy
andtensor.to_numpy
for size-0 arrays gh-2120 - Permit type casting size-0
tensor.usm_ndarray
to arbitrary dtype viatensor.usm_ndarray
constructor'sbuffer
keyword (i.e., using the original memory as the buffer for the new size-0 array's underlying memory) gh-2123
Fixed
- Fixed
tensor.asarray
failing when givendevice
keyword with an input array of a dtype not supported bydevice
gh-2097 - Fixes undefined behavior in radix sort algorithm and avoids call to sorting algorithms when calling
tensor.sort
andtensor.argsort
on size-1 arrays, or along a size-1 axis gh-2106 - Fixed incorrect results when calling
dpt.astype
ontensor.usm_ndarray
constructed from a boolean view into anumpy.ndarray
gh-2122 - Fixed
dpctl
imported in virtual environment on Windows failing to see devices or find DLLs gh-2130 - Fixed Cythonization failure when testing the ability to create
dpctl
Cython API extensions with an editable install gh-2147
Maintenance
- Revert restricting Cython to below 3.1.0 when building dpctl for Python 3.13 gh-2118
- Add a link to
tensor.DLDeviceType
documentation from__dlpack_device__
docstring gh-2127 - Update pybind11 to 3.0.1 gh-2145
- Miscellaneous changes to continuous integration/delivery (CI/CD) supporting scripts gh-2043, gh-2044, gh-2065, gh-2066, gh-2068, gh-2070 gh-2088, gh-2104, gh-2151, gh-2154, gh-2155