Skip to content

0.3.0 — accelerated_scan.complex

Choose a tag to compare

@proger proger released this 25 Dec 04:21
· 4 commits to main since this release

accelerated_scan.complex now supports long variable-length complex-valued inputs. accelerated_triton has been renamed as accelerated_scan.scalar and now supports variable-length inputs by looping over short chunks (2048 items) of scans, similar to the warp implementation. The triton version has been tested on GB10 (DGX Spark).

Full Changelog: v0.2.0...v0.3.0