-
Notifications
You must be signed in to change notification settings - Fork 1.2k
feat: optimization technique related validations. #4921
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from 4 commits
Commits
Show all changes
28 commits
Select commit
Hold shift + click to select a range
7ec16e6
Enable quantization and compilation in the same optimization job via …
cf70f59
Require EULA acceptance when using a gated 1p draft model via ModelBu…
fcb5092
add accept_draft_model_eula to JumpStartModel when deployment config …
9489b8d
add map of valid optimization combinations
5512c26
Add ModelBuilder support for JumpStart-provided draft models.
c94a78b
Tweak draft model EULA validations and messaging. Remove redundant de…
d10c475
Add "Auto" speculative decoding ModelProvider option; add validations…
8fb27a0
Fix JumpStartModel.AdditionalModelDataSource model access config assi…
779f6d6
move the accept eula configurations into deploy flow
gwang111 aef3a90
Merge branch 'master' into QuicksilverV2
gwang111 b7b15b8
move the accept eula configurations into deploy flow
gwang111 748ea4b
Use correct bucket for SM/JS draft models and minor formatting/valida…
a7feb54
Remove obsolete docstring.
694b4f2
remove references to accept_draft_model_eula
gwang111 7b6aef1
renaming of eula fn and error msg
gwang111 ce47be5
Merge branch 'master' into QuicksilverV2
gwang111 1f75072
fix: pin testing deps (#4925)
benieric 277e0b1
Revert "change: add TGI 2.4.0 image uri (#4922)" (#4926)
Captainia 8f0083b
fix naming and messaging
gwang111 8b73f34
ModelBuilder speculative decoding UTs and minor fixes.
c06aef0
Merge branch 'master' into QuicksilverV2
gwang111 09a54dc
Fix set union.
3b147cd
add UTs for JumpStart deployment
gwang111 65cb5b3
fix formatting issues
gwang111 4d1e12b
address validation comments
gwang111 bf706ad
fix doc strings
gwang111 f121eb0
Add TRTLLM compilation + speculative decoding validation.
9148e70
address nits
gwang111 File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.