Skip to content

Conversation

@jeffbolznv
Copy link
Collaborator

Given the increase in shared memory usage from the new quant formats, I think it'll be helpful to know the shared memory size when people report issues.

@jeffbolznv jeffbolznv requested a review from 0cc4m February 6, 2025 20:24
@github-actions github-actions bot added Vulkan Issues specific to the Vulkan backend ggml changes relating to the ggml tensor library for machine learning labels Feb 6, 2025
@0cc4m
Copy link
Collaborator

0cc4m commented Feb 6, 2025

maxComputeSharedMemorySize, alongside other properties and limits, is static for devices and available on http://vulkan.gpuinfo.org/ . Wouldn't that be enough?

@jeffbolznv
Copy link
Collaborator Author

There have been times where I couldn't find a device name in that database, or where it's not obvious which driver I should be looking for in case they report different limits. It's much nicer to just see the true value.

@0cc4m 0cc4m merged commit c026ba3 into ggml-org:master Feb 7, 2025
42 of 46 checks passed
tinglou pushed a commit to tinglou/llama.cpp that referenced this pull request Feb 13, 2025
orca-zhang pushed a commit to orca-zhang/llama.cpp that referenced this pull request Feb 26, 2025
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Feb 26, 2025
mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants