Skip to content

Conversation

@itzmeanjan
Copy link
Owner

Closes #26

After introducing support for GFNI intrinsics, on AWS EC2 m7a.large with AMD EPYC 9R14, seeing some performance boost. Nothing too much exciting, but I'll keep it.

Before erasure-coding throughput was ~50GB/s, now it is ~60GB/s 🌟

Screenshot From 2025-09-21 14-46-44

@itzmeanjan itzmeanjan self-assigned this Sep 21, 2025
@itzmeanjan itzmeanjan merged commit 04d20d6 into main Sep 22, 2025
12 checks passed
@itzmeanjan itzmeanjan deleted the 26-optimize-gf28-multiplication-using-x86_64-gfni-intrinsics branch September 22, 2025 05:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics

2 participants