vulkan: Add bfloat16 support #12554

jeffbolznv · 2025-03-24T21:16:11Z

This adds bfloat16 matrix multiply support based on VK_KHR_shader_bfloat16. The extension is required for coopmat multiply support, but matrix-vector multiply trivially promotes bf16 to fp32 and doesn't require the extension. The copy/get_rows shaders also don't require the extension.

It's probably possible to fall back to non-coopmat and promote to fp32 when the extension isn't supported, but this change doesn't do that.

The coopmat support also requires a glslc that supports the extension, which currently requires a custom build.

jeffbolznv · 2025-03-24T21:18:52Z

The tooling for Vulkan bfloat16 is not all merged yet - see KhronosGroup/SPIRV-Tools#6057 and KhronosGroup/glslang#3905. So if anybody wants to try this locally, you'd need to build a custom glslc. I'll update when it's all merged

NVIDIA will release a Vulkan developer driver with support for this extension, hopefully tomorrow.

ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp

jeffbolznv · 2025-04-01T20:29:07Z

I've rebased this, and added a fallback (promote to fp32) when bf16 coopmat support isn't available. Still waiting on the tooling to be merged, so still Draft for now.

This adds bfloat16 matrix multiply support based on VK_KHR_shader_bfloat16. The extension is required for coopmat multiply support, but matrix-vector multiply trivially promotes bf16 to fp32 and doesn't require the extension. The copy/get_rows shaders also don't require the extension. It's probably possible to fall back to non-coopmat and promote to fp32 when the extension isn't supported, but this change doesn't do that. The coopmat support also requires a glslc that supports the extension, which currently requires a custom build.

…pport Compile a variant of the scalar mul_mm shader that will promote the bf16 values to float, and use that when either the bf16 extension or the coopmat extensions aren't available.

jeffbolznv · 2025-04-22T17:03:35Z

glslc support has landed. I've rebased this change again and it's ready for review.

0cc4m

LGTM

jeffbolznv requested a review from 0cc4m March 24, 2025 21:16

github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Mar 24, 2025

jeffbolznv commented Mar 24, 2025

View reviewed changes

ggml/src/ggml-vulkan/vulkan-shaders/vulkan-shaders-gen.cpp Outdated Show resolved Hide resolved

jeffbolznv force-pushed the bfloat16_rebase branch from 9ffb02b to 9cd10f8 Compare April 1, 2025 19:35

jeffbolznv force-pushed the bfloat16_rebase branch from 02160db to 21e8793 Compare April 1, 2025 20:29

jeffbolznv force-pushed the bfloat16_rebase branch from f4d8c68 to 7c47fd0 Compare April 12, 2025 05:11

jeffbolznv added 4 commits April 22, 2025 10:11

vulkan: Support bf16 tensors without the bf16 extension or coopmat su…

60b5d31

…pport Compile a variant of the scalar mul_mm shader that will promote the bf16 values to float, and use that when either the bf16 extension or the coopmat extensions aren't available.

vulkan: bfloat16 fixes (really works without bfloat16 support now)

2917cad

vulkan: fix spirv-val failure and reenable -O

83771ec

jeffbolznv force-pushed the bfloat16_rebase branch from 7c47fd0 to 83771ec Compare April 22, 2025 17:02

jeffbolznv changed the title ~~Draft: vulkan: Add bfloat16 support~~ vulkan: Add bfloat16 support Apr 22, 2025

0cc4m approved these changes May 1, 2025

View reviewed changes

0cc4m merged commit 79f26e9 into ggml-org:master May 1, 2025
48 checks passed

0cc4m mentioned this pull request Jul 7, 2025

Compile bug: cannot compile get_rows_iq1_m #14542

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

vulkan: Add bfloat16 support #12554

vulkan: Add bfloat16 support #12554

Uh oh!

jeffbolznv commented Mar 24, 2025

Uh oh!

jeffbolznv commented Mar 24, 2025

Uh oh!

Uh oh!

jeffbolznv commented Apr 1, 2025

Uh oh!

jeffbolznv commented Apr 22, 2025

Uh oh!

0cc4m left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

vulkan: Add bfloat16 support #12554

vulkan: Add bfloat16 support #12554

Uh oh!

Conversation

jeffbolznv commented Mar 24, 2025

Uh oh!

jeffbolznv commented Mar 24, 2025

Uh oh!

Uh oh!

jeffbolznv commented Apr 1, 2025

Uh oh!

jeffbolznv commented Apr 22, 2025

Uh oh!

0cc4m left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants