Skip to content

Commit 8cd1aeb

Browse files
krishna2803asl
authored andcommitted
add: more details
Signed-off-by: Krishna Pandey <[email protected]>
1 parent ba1cc34 commit 8cd1aeb

File tree

1 file changed

+3
-2
lines changed

1 file changed

+3
-2
lines changed

content/posts/2025-09-10-bfloat16-in-llvm-libc.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -46,13 +46,14 @@ The implementation status can be viewed at the libc `math.h` header implementati
4646

4747
## What was not done
4848

49-
- The implementation relied on a generic approach, so the `__bf16` compiler intrinsic was not used.
49+
- The implementation used a generic approach and did not rely on the `__bf16` compiler intrinsic, as it is not available in all compilers versions. Our goal is to ensure that the type is supported by all compilers and versions supported by [LLVM libc](https://libc.llvm.org/compiler_support.html).
5050
- Hardware optimizations provided by Intel's [AVX-512_BF16](https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#avx512techs=AVX512_BF16) were not utilized. These instructions only support round-to-nearest-even mode, always flush output denormals to zero, and treat input denormals as zero, which does not align with our goal. See [VCVTNE2PS2BF16 instruction description](https://www.felixcloutier.com/x86/vcvtne2ps2bf16#description).
51+
- ARMv9 [SVE instructions](https://developer.arm.com/documentation/ddi0602/2021-12/SVE-Instructions/) were not utilized, as they are relatively new and not yet widely supported.
5152
- Not all higher math functions were implemented due to time constraints.
5253

5354
## Future Work
5455
- Implement the remaining higher math functions.
55-
- Perform performance comparisons with other libc implementations once their `bfloat16` support is available.
56+
- Perform performance comparisons with other libc implementations once their `bfloat16` support is available and also with the [CORE-MATH](https://core-math.gitlabpages.inria.fr/) project.
5657
- Update the test suite when the `mpfr_get_bfloat16` function becomes available.
5758

5859
## Acknowledgements

0 commit comments

Comments
 (0)