·
26 commits
to master
since this release
This patch release consists of the major updates from cuSZp2 (SC'24 paper) to cuSZp3 (SC'25 paper).
cuSZp V3.x has following designs:
- Support for high dimensionality data (1D, 2D and 3D) with dimension-aware delta encoding.
- Each dimension has three algorithms: fixed, plain, and outlier.
- fixed: Only fixed-length encoding, suitable for unsmooth scientific data or machine learning weights/tokens.
- plain: Delta-encoding + fixed-length encoding; similar with previous plain design (but with high dimension support).
- outlier: Delta-encoding + fixed-length encoding + outlier preservation: similar with previous outlier (with dimension support).
- Versatility support (including memory-efficient compression and selective decompression) will be included in other branches later.
- Still pure-GPU design with single kernel function.
cuSZp V3.x has following features:
- Ultra-fast end-to-end throughput (even higher than cuSZp V2.x).
- High compression ratio in various data patterns (always higher compression ratios than cuSZp V2.x).
- F64 and F32 data types are supported.
- Executable binary, C/C++ API, and Python API are supported.