Skip to content

v1.2.1

Latest

Choose a tag to compare

@jonathanpwang jonathanpwang released this 27 Oct 01:22
dde6cda

This release offers major performance improvements to the CUDA backend. It includes a new Virtual Pool Memory Manager (VPMM) in openvm-cuda-common that provides multi-stream memory management using CUDA driver APIs to avoid memory fragmentation. Several kernels in openvm-cuda-backend were also optimized to give significant performance gains.

Added

  • (CUDA common) New memory manager with Virtual Pool (VPMM Spec) with multi-stream support built on top of the CUDA Virtual Memory Management driver API

Changed

  • (CUDA common) Multi-arch build support
  • (CUDA backend) Quotient values kernel optimization
  • (CUDA backend) FRI reduced opening kernel optimization by removing bit reversal for better memory access patterns

What's Changed

Full Changelog: v1.2.0...v1.2.1