Skip to content

Conversation

@alex-jw-brooks
Copy link
Contributor

This PR updates the granite vision docs with a few changes, outlined below:

  • Updates docs from the 3.1 preview -> the newly released 3.2 2b model!
  • Updates the image grid pinpoints - there is a small change in the image grid pinpoints from the 3.1 preview
  • Adds guidance on quantization; the LLM can be quantized, but the visual encoder can't be since siglip has tensor dims nondivisible by 32, which could be a point of confusion for some, so adding some explicit guidance
  • Changes the example image to use the llama cpp banner, as the current link is no longer valid, and this model is good for things like doc understanding anyway

@danbev can you please take a look when you have a moment? 🙂

@alex-jw-brooks alex-jw-brooks force-pushed the granite_vision_doc_updates branch from 2534d2d to 07dab1b Compare February 28, 2025 07:40
@alex-jw-brooks alex-jw-brooks force-pushed the granite_vision_doc_updates branch from 07dab1b to c716fb8 Compare February 28, 2025 07:42
@ericcurtin ericcurtin merged commit 84d5f4b into ggml-org:master Feb 28, 2025
2 checks passed
mglambda pushed a commit to mglambda/llama.cpp that referenced this pull request Mar 8, 2025
arthw pushed a commit to arthw/llama.cpp that referenced this pull request Mar 19, 2025
mostlyuseful pushed a commit to mostlyuseful/llama.cpp that referenced this pull request May 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants