-
Notifications
You must be signed in to change notification settings - Fork 16
Load model from torchstore into vLLM #55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 13 commits
Commits
Show all changes
37 commits
Select commit
Hold shift + click to select a range
46153ea
initial testing
a0fc785
more testing
9b6fa9f
init works
f26d829
use instruct in path
8f25f61
somewhat working
006f27e
kinda working, memory/timeout issue
10cce6b
store and load working!
1e4205c
clean up logging
d8de194
sharded working
aa916eb
it's working? but _get_tensor_parallel_sharding_strategy is hacky:
32f1683
it's working
a39444d
some cleanups
55c6a49
more clean up
52bbf3b
clean ups
082b138
get rid of if else logic
44caf68
mapping
e69dbcd
mostly working
08ba23e
mostly working 2
c5dd764
mostly working 3
4743217
single test passes
dd36d73
single and fsdp works with calculated sharding
ac6a212
convert from script to test
b944a2e
cleaning things up
8bb9710
more cleaning up
8d029f5
move sharding to helper
a3355f5
move sharding to helper 2
6fed9b6
refactor
6e36dd3
use sharding class in policy and test
a78be1b
renames
300fe86
use test fixture
6003b12
use helper in test
ec07ba9
remove extra comments
e0a1797
remove extra load
5af98a1
clean up prints
00c4a03
requested changes
d0fb772
requested changes 2
bdd2507
use remote dir
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.