Enable MMA for BF16 data types on Powerpc #12565

shalinib-ibm · 2025-03-25T10:01:04Z

Final Version

ggerganov

There are several stray printfs and debugging code. Also the indentation is not consistent. Seems like a work-in-progress - should be cleaned-up before consider merging.

shalinib-ibm · 2025-03-27T09:16:34Z

There are several stray printfs and debugging code. Also the indentation is not consistent. Seems like a work-in-progress - should be cleaned-up before consider merging.

Thank you @ggerganov . Moved it to draft. Will publish the final version soon.

This patch upstreams llamafile's cpu matrix multiplication kernels for ppc64le using MMA builtins for BF16 data type. This change results in 9x - 40x gains in total speed S t/s (ie all tokens/total time), across various batch sizes tested using llama-batched-bench benchmark. The patch is tested with Meta-Lllama-3-8B, and Mistral-7B models (BF16 models generated by using llama-quantize from corresponding FP32 models) on an IBM POWER10 machine. Signed-off-by: Shalini Salomi Bodapati <[email protected]>

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Mar 25, 2025

ggerganov reviewed Mar 27, 2025

View reviewed changes

shalinib-ibm marked this pull request as draft March 27, 2025 09:15

shalinib-ibm force-pushed the sgemm_bf16 branch from 505615a to 7306a24 Compare April 4, 2025 06:19

shalinib-ibm force-pushed the sgemm_bf16 branch from 7306a24 to 505d77f Compare April 4, 2025 07:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Enable MMA for BF16 data types on Powerpc #12565

Enable MMA for BF16 data types on Powerpc #12565

Uh oh!

shalinib-ibm commented Mar 25, 2025

Uh oh!

ggerganov left a comment

Uh oh!

shalinib-ibm commented Mar 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Enable MMA for BF16 data types on Powerpc #12565

Are you sure you want to change the base?

Enable MMA for BF16 data types on Powerpc #12565

Uh oh!

Conversation

shalinib-ibm commented Mar 25, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

shalinib-ibm commented Mar 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants