feat: support scaling config for knative deployment types#179
Merged
oleksii-donets merged 18 commits intodevelopmentfrom Mar 6, 2026
Merged
feat: support scaling config for knative deployment types#179oleksii-donets merged 18 commits intodevelopmentfrom
oleksii-donets merged 18 commits intodevelopmentfrom
Conversation
Move scaling from base deployment/request/response classes into concrete child models and DTOs so MCP, Adapter, Interceptor, and Inference carry explicit scaling contracts. Made-with: Cursor
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Collaborator
|
Can you explain please what are the benefits of not keeping scaling in base deployment classes? |
Pasichniuk
reviewed
Mar 2, 2026
...ain/java/com/epam/aidial/deployment/manager/service/manifest/InferenceManifestGenerator.java
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
src/main/java/com/epam/aidial/deployment/manager/service/manifest/KnativeManifestGenerator.java
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
src/main/java/com/epam/aidial/deployment/manager/service/manifest/KnativeManifestGenerator.java
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
src/main/java/com/epam/aidial/deployment/manager/service/manifest/KnativeManifestGenerator.java
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
src/main/resources/db/migration/H2/V1.47__AddScalingToKnativeDeploymentTables.sql
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
src/main/java/com/epam/aidial/deployment/manager/web/validation/ScalingValidator.java
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 2, 2026
...ain/java/com/epam/aidial/deployment/manager/service/deployment/KnativeDeploymentManager.java
Outdated
Show resolved
Hide resolved
Made-with: Cursor
Contributor
Author
It does not apply to NIM deployments yet |
Replace nested null-check + type-check with early return pattern in InferenceManifestGenerator and KnativeManifestGenerator. Made-with: Cursor
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Pasichniuk
reviewed
Mar 4, 2026
src/main/java/com/epam/aidial/deployment/manager/service/deployment/DeploymentService.java
Outdated
Show resolved
Hide resolved
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Move the scaling property from child-specific deployment entities, models, DTOs, and DB tables into the base deployment class hierarchy. This eliminates instanceof checks and switch statements for accessing scaling, as requested in PR #179 review. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
…ration Instead of adding scaling to child tables and then moving it, modify V1.47 to add scaling directly to the base deployment table and migrate existing inference_deployment scaling data. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Pasichniuk
reviewed
Mar 6, 2026
src/main/resources/db/migration/H2/V1.47__AddScalingToDeploymentTable.sql
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 6, 2026
...n/java/com/epam/aidial/deployment/manager/dao/entity/deployment/AdapterDeploymentEntity.java
Outdated
Show resolved
Hide resolved
Pasichniuk
reviewed
Mar 6, 2026
…d scaling to export/import test - Remove redundant @Getter, @Setter, @EqualsAndHashCode from Adapter and Interceptor classes that have no fields (inherited from parent @DaTa) - Add Scaling configuration to config export/import functional test Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Resolve merge conflicts in InferenceManifestGenerator and KnativeManifestGenerator: integrate ProgressDeadlineCalculator with the new Scaling-based approach. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Pasichniuk
reviewed
Mar 6, 2026
src/main/java/com/epam/aidial/deployment/manager/model/deployment/CreateDeployment.java
Outdated
Show resolved
Hide resolved
These fields were never used in production and are fully replaced by the Scaling object. Removed from entities, models, DTOs, validators, mappers, services, and all test resources. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Pasichniuk
reviewed
Mar 6, 2026
src/main/resources/db/migration/H2/V1.47__AddScalingToDeploymentTable.sql
Show resolved
Hide resolved
Add DROP COLUMN statements to V1.47 migration for H2, PostgreSQL, and MS SQL Server. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
Contributor
Dependency ReviewThe following issues were found:
License Issuessettings.gradle
OpenSSF Scorecard
Scanned Files
|
Pasichniuk
approved these changes
Mar 6, 2026
Collaborator
|
Please consider marking PR as a breaking change due to API changes & update description |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Applicable issues
Description of changes
deploymentDB table, so all deployment types share the same scaling contractinference_deploymentchild table to the basedeploymenttable (V1.47)initialScale/minScale/maxScalefields from entities, models, DTOs, validators, mappers, services, and all test resources — replaced entirely by theScalingobjectinitial_scale,min_scale,max_scalecolumns from thedeploymenttable (V1.47 migration)ValidScaleConfigurationannotation andScaleConfigurationValidator(no longer needed)ProgressDeadlineCalculatorwith the new scaling approach in manifest generatorsChecklist
By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.