-
Notifications
You must be signed in to change notification settings - Fork 679
Qwen 2.5 VL #2868
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
albert-inflection
wants to merge
64
commits into
meta-pytorch:main
Choose a base branch
from
InflectionAI:albert/qwen2.5-vl
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Draft
Qwen 2.5 VL #2868
Changes from 1 commit
Commits
Show all changes
64 commits
Select commit
Hold shift + click to select a range
ede1463
qwen 2.5 vl code skeleton
albert-inflection 3269545
model builder progress
albert-inflection 6b013ec
more model building progress
albert-inflection 5cb7421
airplane update
albert-inflection 6d09f1f
WIP transform + rope
1c5dd67
image transform progress
8992e50
image transform progress
albert-inflection 74614b2
Qwen2_5_VLImageTransform complete
lawrence-inflection 59fe9cd
remove context.md from tracking
lawrence-inflection f9cdb83
Qwen2_5_VLTransform implemented
lawrence-inflection 3032d75
module progress
c634a4b
batch size in ViT forward
423a268
rehaul modules, start from near HF
d3d4bd2
Rope + Window attn attempt 1
ad39ebb
_positional_embeddings.py implementation
lawrencefeng17 0193832
progress on _component_builders.py for decoder
lawrencefeng17 caa77ff
upstream cleanup
f1a235e
more cleanup
a2eacc9
merge temp branch onto albert/qwen2.5-vl
lawrencefeng17 16902fa
refactored Qwen25VLRotaryPositionalEmbeddings; passed test cases
lawrencefeng17 d4fb9c2
refactored Qwen25VLRotaryPositionalEmbeddings; added summary context.md
lawrencefeng17 f2c3a0e
feat: Qwen25VLEarlyFusionModel wrapper class
lawrencefeng17 896b070
rebase
3db79f9
clean up mlps
7024fdc
clean up encoder builder
20728a0
fix: removed raise condition; decoder bias fix
lawrencefeng17 bb3b4a6
checkpointing + edits
045f71b
init
b959286
convert weights final
7bf0a09
model builder slight fix
06ce596
fixes: minor changes, early end-to-end testing
lawrencefeng17 e8ab57c
fix: completely rewrote mrope
lawrencefeng17 4e44c1f
fix: minor fixes to mrope
lawrencefeng17 00e79f8
transform edits
257cbcf
feat: mrope cache implemented for decoder (#2)
lawrence-inflection 801efb4
encoder forward pass edits
3df44cf
bug fixes, training works now
albert-inflection cc52ebb
tested and fixed _transform
lawrence-inflection 5ab217b
weight saving fix + import
albert-inflection 4928249
Lawrence/qwen2.5 vl/encoder tests
lawrence-inflection 47a9e19
feat: added other qwen variants in model builders
lawrencefeng17 a8b00df
custom collation + init edits
albert-inflection e63202a
fix: removed default args to transform
lawrencefeng17 50314d3
nits
albert-inflection f6e75d3
7B config
albert-inflection b2b74bc
config nit
albert-inflection 767b025
added test cases in torchtune style
lawrencefeng17 e03eb9c
cleanup
albert-inflection a82e72c
rm uv.lock
albert-inflection 47c60c5
trainable params
albert-inflection df68e52
updated model builders
albert-inflection e98578c
rename rope
albert-inflection 346987b
cleanup
lawrencefeng17 9438ca8
fix
lawrencefeng17 23e0640
cleanup:
lawrencefeng17 1ff7ffa
3B recipe and model builder edit
albert-inflection e7c8b85
32B config and modelbuilder changes'
albert-inflection d5ff0e9
72B config
albert-inflection 43f1cbe
nit diffs
c09279c
fix padding token
d782bff
recipe reg
albert-inflection 5cac20b
fixed linter errors
lawrencefeng17 49698b2
linter fixes
lawrencefeng17 ee8ad1c
fixes to pass linter and all unit tests
lawrencefeng17 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Current workaround for passing window_index into the positional embedding module.