Skip to content

Conversation

vladislavdonchev
Copy link

@vladislavdonchev vladislavdonchev commented Feb 28, 2025

Fellas, I gotta say - I grossly underestimated this task...

Work is still in progress in our fork here (comment here if you want to review the code, so we can update the PR):
https://github.com/Independent-AI-Labs/llama.cpp/commits/debug/build/

qwen25vl-arch-diagram

Qwen2.5 is a total overhaul of the Qwen VLM and introduces a bunch of concepts that require special handling.
I've documented my approach in this paper here:
https://github.com/Independent-AI-Labs/local-super-agents/blob/main/res/docs/papers/Implementing%20Qwen2.5VL.pdf

Comments & ideas are welcome!

@github-actions github-actions bot added examples python python script changes labels Feb 28, 2025
@vladislavdonchev vladislavdonchev changed the title gguf-py: Support for Qwen2.5 VL - DRAFT (#11483) clip / gguf-py: Support for Qwen2.5 VL - DRAFT (#11483) Mar 2, 2025
@vladislavdonchev vladislavdonchev changed the title clip / gguf-py: Support for Qwen2.5 VL - DRAFT (#11483) clip.cpp / gguf-py: Support for Qwen2.5 VL - DRAFT (#11483) Mar 2, 2025
@vladislavdonchev vladislavdonchev changed the title clip.cpp / gguf-py: Support for Qwen2.5 VL - DRAFT (#11483) clip.cpp / gguf-py: Support for Qwen2.5 VL - WIP / REVIEW NEEDED (#11483) Mar 3, 2025
@vladislavdonchev
Copy link
Author

This will be reopened as a follow-up to the vision API refactor: #11292

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

examples python python script changes

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant