Skip to content

[newb] Llama.cpp crashes with gtp-oss 20B model on Tesla V100, but works fine with Llama 3.1 8B #15119

Answered by slaren
antonkratz asked this question in Q&A
Discussion options

You must be logged in to vote

Looks like some issue allocating virtual memory. It may work if you build with -DGGML_CUDA_NO_VMM=ON.

Replies: 1 comment 1 reply

Comment options

You must be logged in to vote
1 reply
@antonkratz
Comment options

Answer selected by antonkratz
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants