Skip to content
Discussion options

You must be logged in to vote

Yes, the arm binaries do have metal enabled.

Why are you only offloading 5 layers though? I see you used --gpulayers 5 but why? With unified memory you should do something more like --gpulayers 999

Let's try with a simple launch command

./koboldcpp-mac-arm64 --model models/Gemma-3-12b-it-MAX-HORROR-D_AU-Q6_K-imat.gguf --gpulayers 999

Tell me if this works

Replies: 4 comments 6 replies

Comment options

You must be logged in to vote
2 replies
@crystal-coding-time
Comment options

@LostRuins
Comment options

Answer selected by crystal-coding-time
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
4 replies
@crystal-coding-time
Comment options

@crystal-coding-time
Comment options

@crystal-coding-time
Comment options

@LostRuins
Comment options

Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants