- 
                Notifications
    You must be signed in to change notification settings 
- Fork 13.5k
[model] add support for qwen3vl series #16780
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
          
     Merged
      
      
    
      
        
          +1,125
        
        
          −97
        
        
          
        
      
    
  
  
     Merged
                    Changes from 21 commits
      Commits
    
    
            Show all changes
          
          
            24 commits
          
        
        Select commit
          Hold shift + click to select a range
      
      1e4fd19
              
                support qwen3vl series.
              
              
                JJJYmmm f84bd67
              
                bugfix: fix the arch check for qwen3vl-moe.
              
              
                JJJYmmm 0443a09
              
                use build_ffn
              
              
                JJJYmmm 3271877
              
                optimize deepstack structure
              
              
                JJJYmmm f321b9f
              
                optimize deepstack feature saving
              
              
                JJJYmmm b0169b7
              
                Merge branch 'ggml-org:master' into add_qwen3vl
              
              
                JJJYmmm 794481e
              
                Revert "optimize deepstack feature saving" for temporal fix
              
              
                JJJYmmm 0b37fff
              
                code clean
              
              
                JJJYmmm 2be9279
              
                use fused qkv in clip
              
              
                JJJYmmm 2a66926
              
                clean up / rm is_deepstack_layers for simplification
              
              
                ngxson 52e3dac
              
                Merge branch 'master' into add_qwen3vl
              
              
                ngxson 96037a8
              
                add test model
              
              
                ngxson 473ee90
              
                move test model to "big" section
              
              
                ngxson e9a167b
              
                fix imrope check
              
              
                JJJYmmm 69e26ff
              
                remove trailing whitespace
              
              
                JJJYmmm 0518b0a
              
                fix rope fail
              
              
                JJJYmmm 6a0191a
              
                metal : add imrope support
              
              
                ggerganov 10ce7fb
              
                add imrope support for sycl
              
              
                JJJYmmm cbca610
              
                vulkan: add imrope w/o check
              
              
                JJJYmmm 950c764
              
                fix vulkan
              
              
                JJJYmmm 19a458f
              
                Merge branch 'master' into add_qwen3vl
              
              
                JJJYmmm 0bed5d8
              
                webgpu: add imrope w/o check
              
              
                JJJYmmm b338010
              
                Update gguf-py/gguf/tensor_mapping.py
              
              
                JJJYmmm 7d9c149
              
                fix tensor mapping
              
              
                JJJYmmm File filter
Filter by extension
Conversations
          Failed to load comments.   
        
        
          
      Loading
        
  Jump to
        
          Jump to file
        
      
      
          Failed to load files.   
        
        
          
      Loading
        
  Diff view
Diff view
          Some comments aren't visible on the classic Files Changed page.
        
There are no files selected for viewing
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
              
  
    
      This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
      Learn more about bidirectional Unicode characters
    
  
  
    
              
              
      
      Oops, something went wrong.
        
    
  
  Add this suggestion to a batch that can be applied as a single commit.
  This suggestion is invalid because no changes were made to the code.
  Suggestions cannot be applied while the pull request is closed.
  Suggestions cannot be applied while viewing a subset of changes.
  Only one suggestion per line can be applied in a batch.
  Add this suggestion to a batch that can be applied as a single commit.
  Applying suggestions on deleted lines is not supported.
  You must change the existing code in this line in order to create a valid suggestion.
  Outdated suggestions cannot be applied.
  This suggestion has been applied or marked resolved.
  Suggestions cannot be applied from pending reviews.
  Suggestions cannot be applied on multi-line comments.
  Suggestions cannot be applied while the pull request is queued to merge.
  Suggestion cannot be applied right now. Please check back later.
  
    
  
    
Uh oh!
There was an error while loading. Please reload this page.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(No actions is needed, just a side note here)
The
is_deepstack_layersmetadata is no longer being used inclip.cpp, as I want to make the code more simple to maintain. We now use the same logic as MoE in llama.cpp, where if the tensor is not present, it will benullptr, and this will trigger the code branch for deepstack layersBu we will still keep this metadata in GGUF for future use