-
Notifications
You must be signed in to change notification settings - Fork 222
[tx] WIP: Add MaxText backend support #788
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Draft
OhadRubin
wants to merge
19
commits into
NovaSky-AI:main
Choose a base branch
from
OhadRubin:maxtext_backend
base: main
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
+2,263
−637
Draft
Changes from all commits
Commits
Show all changes
19 commits
Select commit
Hold shift + click to select a range
119bc92
Refactor TinkerEngine to use backend architecture
OhadRubin 77ea8b2
Enhance TinkerEngine and backend integration
OhadRubin 2bab2cf
Add MaxTextBackend support to TinkerEngine
OhadRubin f25653a
Add maxtext dependency and improve engine configuration
OhadRubin 228c5fb
Add convert_maxtext_lora_to_hf function
OhadRubin 9177aed
Update skyrl-tx/tx/tinker/backends/maxtext.py
OhadRubin 6842b1f
Update skyrl-tx/tx/tinker/api.py
OhadRubin 7d75ce6
Update skyrl-tx/tx/tinker/backends/native.py
OhadRubin bdd8039
Update skyrl-tx/tx/tinker/backends/native.py
OhadRubin 282d79a
Pin maxtext dependency to specific commit
OhadRubin e6a7013
Merge branch 'maxtext_backend' of https://github.com/OhadRubin/SkyRL …
OhadRubin b9510f4
Remove duplicate pad_batch, import from utils
OhadRubin b1e1508
Remove hardcoded path fallback in _get_maxtext_base_config_path
OhadRubin 3bde089
Fix MaxText backend method signatures to match AbstractBackend
OhadRubin 56fdd40
Enhance TinkerEngine and backend functionality
OhadRubin 725f3cb
Implement LoRA weight reset functionality in MaxText backend
OhadRubin 9984637
Add TTL storage structure and update NativeBackend for eager sharding
OhadRubin 8cc0546
Add sampler checkpoint eviction logic and enhance MaxText backend
OhadRubin 58d52d5
Refactor retry logic in ExternalInferenceClient for improved error ha…
OhadRubin File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,7 @@ | ||
| """Tinker engine backends.""" | ||
|
|
||
| from tx.tinker.backends.backend import AbstractBackend | ||
| from tx.tinker.backends.native import NativeBackend | ||
| from tx.tinker.backends.maxtext import MaxTextBackend, parse_maxtext_config | ||
|
|
||
| __all__ = ["AbstractBackend", "NativeBackend", "MaxTextBackend", "parse_maxtext_config"] |
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The
requires-pythonhas been changed to==3.12.*, which is very restrictive. This will prevent users on other Python 3.12 patch versions or future minor versions from using the library. Unless there's a strong reason for this exact version, consider using a more flexible specifier like>=3.12.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
maxtext is annoying and requires python 3.12, actually, i haven't tested it with python 3.13