Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics" #27

itzmeanjan · 2025-09-21T09:24:21Z

Closes #26

After introducing support for GFNI intrinsics, on AWS EC2 m7a.large with AMD EPYC 9R14, seeing some performance boost. Nothing too much exciting, but I'll keep it.

Before erasure-coding throughput was ~50GB/s, now it is ~60GB/s 🌟

…ucible polynomial `x^8 + x^4 + x^3 + x + 1` Signed-off-by: Anjan Roy <[email protected]>

Signed-off-by: Anjan Roy <[email protected]>

…EADME Signed-off-by: Anjan Roy <[email protected]>

Signed-off-by: Anjan Roy <[email protected]>

itzmeanjan added 3 commits September 21, 2025 11:40

Update pre-computed GF(2**8) log and anti-log table - using AES irred…

39c9a88

…ucible polynomial `x^8 + x^4 + x^3 + x + 1` Signed-off-by: Anjan Roy <[email protected]>

Implement gf256 SIMD multiplication using GFNI intrinsics

2740cc7

Signed-off-by: Anjan Roy <[email protected]>

Mention correct irreducible polynomial everywhere

1bce359

Signed-off-by: Anjan Roy <[email protected]>

itzmeanjan self-assigned this Sep 21, 2025

itzmeanjan added 5 commits September 22, 2025 10:01

Bump dependency version

8f4de8a

Signed-off-by: Anjan Roy <[email protected]>

Add plots showing performance on AMD machine with GFNI+AVX512

a37c9d9

Signed-off-by: Anjan Roy <[email protected]>

Reflect latest optimization result using x86_64 GFNI intrinsics, on R…

41f8273

…EADME Signed-off-by: Anjan Roy <[email protected]>

Bump crate version to 0.8.5

47899f7

Signed-off-by: Anjan Roy <[email protected]>

Address clippy warnings

b70c25d

Signed-off-by: Anjan Roy <[email protected]>

itzmeanjan merged commit 04d20d6 into main Sep 22, 2025
12 checks passed

itzmeanjan deleted the 26-optimize-gf28-multiplication-using-x86_64-gfni-intrinsics branch September 22, 2025 05:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics" #27

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics" #27

Uh oh!

itzmeanjan commented Sep 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics" #27

Address "Optimize GF(2^8) multiplication using x86_64 GFNI intrinsics" #27

Uh oh!

Conversation

itzmeanjan commented Sep 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants