You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Ci (#11) (#12)
* Fix cl (#7)
* Rename build-amd.yml to build-amd.yml.disabled
* Rename winget.yml to winget.yml.disabled
* Rename server.yml to server.yml.disabled
* Rename build.yml to build.yml.disabled
* Update release.yml
* Rename build-cmake-pkg.yml to build-cmake-pkg.yml.disabled
* Rename build-linux-cross.yml to build-linux-cross.yml.disabled
* Rename build-riscv-native.yml.disabled to build-riscv-native.yml
* Rename docker.yml.disabled to docker.yml
* Rename update-ops-docs.yml to update-ops-docs.yml.disabled
* Remove macOS-arm64 job from release workflow
Removed macOS-arm64 job and its associated steps from the release workflow.
* CUDA: Fix bug in topk-moe for gpt-oss (ggml-org#16821)
* CUDA: Fix bug in topk-moe for gpt-oss
When using ggml_can_fuse_subgraph, the output nodes which are passed are wrong. This causes `test-backend-ops` to still fuse ndoes (because the nodes are not used elsewhere in the graph),
but it actually doesn't fuse in the actual gpt-oss
* fix for qwen3 too
* change ifndef to ifdef
* vulkan: Call ggml_vk_buffer_write_2d from ggml_vk_buffer_copy (ggml-org#16793)
This lets the copy to the destination device use the host-visible
vidmem optimization.
---------
Co-authored-by: Aman Gupta <[email protected]>
Co-authored-by: Jeff Bolz <[email protected]>
0 commit comments