0.3.0 — accelerated_scan.complex
accelerated_scan.complex now supports long variable-length complex-valued inputs. accelerated_triton has been renamed as accelerated_scan.scalar and now supports variable-length inputs by looping over short chunks (2048 items) of scans, similar to the warp implementation. The triton version has been tested on GB10 (DGX Spark).
Full Changelog: v0.2.0...v0.3.0