Commit 6cccaef
committed
vulkan: Update topk_moe fusion to handle gpt's late softmax
Based on ggml-org#16649.1 parent 73a48c9 commit 6cccaef
File tree
2 files changed
+251
-113
lines changed- ggml/src/ggml-vulkan
- vulkan-shaders
2 files changed
+251
-113
lines changed
0 commit comments