Skip to content

v1.4.2

Choose a tag to compare

@davidesantangelo davidesantangelo released this 08 Jan 14:33
· 11 commits to main since this release

This release brings significant performance improvements, expanded SIMD support, and better cross-platform compatibility.
New Features

AVX-512 SIMD Support

  • Added ultra-high-performance AVX-512 search for patterns up to 64 bytes
  • Automatic detection and utilization of AVX-512 instructions on supported CPUs
  • Graceful fallback to AVX2/SSE4.2 on older hardware
    Enhanced Memory Performance
  • Added prefetching (__builtin_prefetch) in search functions for better cache utilization
  • Reduced MIN_CHUNK_SIZE to 2MB for improved parallelism on multi-core systems
  • Added compiler optimization hints (LIKELY/UNLIKELY, HOT_FUNCTION)
    Thread Pool Improvements
  • Adaptive mutex using PTHREAD_MUTEX_ADAPTIVE_NP where available
  • Reduced thread stack size to 256KB for lower memory overhead
  • Added batch task submission for improved efficiency
  • Smarter thread count selection (cores - 1 for system headroom)