cpu: riscv: softmax: add f16 support #4491

xiazhuozhao · 2026-01-02T17:20:08Z

Description

This PR adds support for FP16 (half-precision) data type to the RISC-V Vector (RVV) Softmax primitive.
Added compute_softmax_f16_rvv which utilizes Zvfh intrinsics. To ensure precision, the kernel performs accumulation and exponential/logarithm calculations in f32 (using widening conversions __riscv_vfwcvt) and converts back to f16 for the final output.

Checklist

General

Do all unit and benchdnn tests (make test and make test_benchdnn_*) pass locally for each commit?
Have you formatted the code using clang-format?

Performance improvements

Have you submitted performance data that demonstrates performance improvements?

Performance was evaluated on SG2044 (RISC-V) using 16 cores (pinned). The comparison is between the new RVV f16 implementation and the previous behavior (fallback to reference).

./benchdnn --mode=P --softmax --batch=test_softmax_float16

(Note: Only collected FWD_D cases)

speedup_ratio.csv
with_f16_softmax.csv
without_f16_softmax.csv

Average Speedup: ~23.40x

src/cpu/rv64/rvv_softmax.cpp

Co-authored-by: Fei Zhang <zhangfei@iscas.ac.cn>

github-actions bot added the platform:cpu-rv64 RISC-V label Jan 2, 2026

xiazhuozhao force-pushed the f16_softmax branch 2 times, most recently from 2481c54 to 066e7e7 Compare January 2, 2026 17:21

xiazhuozhao marked this pull request as ready for review January 7, 2026 11:46

xiazhuozhao requested a review from a team as a code owner January 7, 2026 11:46

zhangjian29 approved these changes Jan 8, 2026

View reviewed changes

zhangfeiv0 reviewed Jan 9, 2026

View reviewed changes

src/cpu/rv64/rvv_softmax.cpp Outdated Show resolved Hide resolved

xiazhuozhao and others added 2 commits January 14, 2026 00:49

cpu: riscv: softmax: add f16 support

ca39a4c

Co-authored-by: Fei Zhang <zhangfei@iscas.ac.cn>

cpu: riscv: softmax: Vectorize non-logsoftmax path in f16 kernel

04e8c86

Co-authored-by: Fei Zhang <zhangfei@iscas.ac.cn>

xiazhuozhao force-pushed the f16_softmax branch from 066e7e7 to 04e8c86 Compare January 13, 2026 16:50

xiazhuozhao requested a review from zhangfeiv0 January 14, 2026 17:59

zhangfeiv0 approved these changes Jan 15, 2026

View reviewed changes

zhangjian29 merged commit 97eb3d8 into uxlfoundation:main Jan 15, 2026
12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cpu: riscv: softmax: add f16 support #4491

cpu: riscv: softmax: add f16 support #4491

Uh oh!

xiazhuozhao commented Jan 2, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

cpu: riscv: softmax: add f16 support #4491

cpu: riscv: softmax: add f16 support #4491

Uh oh!

Conversation

xiazhuozhao commented Jan 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Checklist

General

Performance improvements

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

xiazhuozhao commented Jan 2, 2026 •

edited

Loading