Skip to content

Commit e8fb4e7

Browse files
committed
Add intrinsics for the FEAT_SVE_AES2 feature introduced by the 2024 dpISA
FEAT_SVE_AES2 adds 1) SVE multi-vector Advanced Encryption Standard (AES) instructions Instructions added: AESE, AESD, AESEMC and AESDIMC For each instruction there are two variants a) Two registers variant b) Four registers variant 2) SVE multi-vector 128-bit polynomial multiply long instructions Instructions added: PMULL and PMLAL FEAT_SSVE_AES implements the same instructions but when in streaming mode.
1 parent 1db9c69 commit e8fb4e7

File tree

1 file changed

+36
-0
lines changed

1 file changed

+36
-0
lines changed

main/acle.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -465,6 +465,7 @@ Armv8.4-A [[ARMARMv84]](#ARMARMv84). Support is added for the Dot Product intrin
465465

466466
* Added feature test macro for FEAT_SSVE_FEXPA.
467467
* Added feature test macro for FEAT_CSSC.
468+
* Added support for FEAT_SVE_AES2, FEAT_SSVE_AES intrinsics.
468469

469470
### References
470471

@@ -2147,6 +2148,15 @@ support for the SVE2 AES (FEAT_SVE_AES) instructions and if the associated
21472148
ACLE intrinsics are available. This implies that `__ARM_FEATURE_AES`
21482149
and `__ARM_FEATURE_SVE2` are both nonzero.
21492150

2151+
In addition, `__ARM_FEATURE_SVE2_AES2` is defined to `1` if there is hardware
2152+
support for the SVE2 AES2 (FEAT_SVE_AES2) instructions and if the associated
2153+
ACLE intrinsics are available. This implies that `__ARM_FEATURE_AES`
2154+
and `__ARM_FEATURE_SVE2` are both nonzero.
2155+
2156+
`__ARM_FEATURE_SSVE_AES2` is defined to 1 if there is hardware support for
2157+
SVE2 AES2 (FEAT_SVE_AES2) instructions in Streaming SVE mode (FEAT_SSVE_AES)
2158+
and if the associated ACLE intrinsics are available.
2159+
21502160
#### SHA2 extension
21512161

21522162
`__ARM_FEATURE_SHA2` is defined to 1 if the SHA1 & SHA2-256 Crypto
@@ -2642,6 +2652,8 @@ be found in [[BA]](#BA).
26422652
| [`__ARM_FEATURE_SVE_VECTOR_OPERATORS`](#scalable-vector-extension-sve) | Level of support for C and C++ operators on SVE predicate types | 1 |
26432653
| [`__ARM_FEATURE_SVE2`](#sve2) | SVE version 2 (FEAT_SVE2) | 1 |
26442654
| [`__ARM_FEATURE_SVE2_AES`](#aes-extension) | SVE2 support for the AES cryptographic extension (FEAT_SVE_AES) | 1 |
2655+
| [`__ARM_FEATURE_SVE2_AES2`](#aes-extension) | SVE2 support for the SVE multi-vector AES cryptographic extension (FEAT_SVE_AES2) | 1 |
2656+
| [`__ARM_FEATURE_SSVE_AES2`](#aes-extension) | SVE2 support for the SVE multi-vector AES cryptographic extension (FEAT_SSVE_AES) | 1 |
26452657
| [`__ARM_FEATURE_SVE2_BITPERM`](#bit-permute-extension) | SVE2 bit permute extension | 1 |
26462658
| [`__ARM_FEATURE_SSVE_BITPERM`](#bit-permute-extension) | SVE2 bit permute extension | 1 |
26472659
| [`__ARM_FEATURE_SSVE_FEXPA`](#streaming-sve-fexpa-extension) | Streaming SVE FEXPA extension | 1 |
@@ -9405,6 +9417,30 @@ to work with `svboolx2_t` and `svboolx4_t`. For example:
94059417
svboolx2_t svundef2_b();
94069418
```
94079419

9420+
#### AESE, AESD, AESEMC, AESDIMC
9421+
9422+
Multi-vector Advanced Encryption Standard instructions
9423+
9424+
svuint8x2_t svaese[_u8_x2] (svuint8x2_t op1, svuint64_t op2, uint64_t index);
9425+
svuint8x4_t svaese[_u8_x4] (svuint8x4_t op1, svuint64_t op2, uint64_t index);
9426+
svuint8x2_t svaesd[_u8_x2] (svuint8x2_t op1, svuint64_t op2, uint64_t index);
9427+
svuint8x4_t svaesd[_u8_x4] (svuint8x4_t op1, svuint64_t op2, uint64_t index);
9428+
svuint8x2_t svaesemc[_u8_x2] (svuint8x2_t op1, svuint64_t op2, uint64_t index);
9429+
svuint8x4_t svaesemc[_u8_x4] (svuint8x4_t op1, svuint64_t op2, uint64_t index);
9430+
svuint8x2_t svaesdimc[_u8_x2] (svuint8x2_t op1, svuint64_t op2, uint64_t index);
9431+
svuint8x4_t svaesdimc[_u8_x4] (svuint8x4_t op1, svuint64_t op2, uint64_t index);
9432+
9433+
#### PMULL, PMLAL
9434+
9435+
Multi-vector 128-bit polynomial multiply long instructions
9436+
9437+
``` c
9438+
// Variants are also available for:
9439+
// _s64x2, _f64x2
9440+
svuint64x2_t svpmull[_u64x2](svuint64_t zn, svuint64_t zm);
9441+
svuint64x2_t svpmlal[_u64x2](svuint64_t zn, svuint64_t zm);
9442+
```
9443+
94089444
#### ADDQV, FADDQV
94099445

94109446
Unsigned/FP add reduction of quadword vector segments.

0 commit comments

Comments
 (0)