Skip to content

Commit 13da4c1

Browse files
authored
patch service config identifier (#509)
1 parent 70d0e77 commit 13da4c1

File tree

2 files changed

+6
-2
lines changed

2 files changed

+6
-2
lines changed

charts/model-engine/templates/inference_framework_config.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
apiVersion: v1
22
kind: ConfigMap
33
metadata:
4-
name: model-engine-inference-framework-latest-config
4+
name: {{ include "modelEngine.fullname" . }}-inference-framework-latest-config
55
labels:
66
product: common
77
team: infra

model-engine/model_engine_server/domain/use_cases/llm_model_endpoint_use_cases.py

Lines changed: 5 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -251,7 +251,11 @@
251251
NUM_DOWNSTREAM_REQUEST_RETRIES = 80 # has to be high enough so that the retries take the 5 minutes
252252
DOWNSTREAM_REQUEST_TIMEOUT_SECONDS = 5 * 60 # 5 minutes
253253

254-
LATEST_INFERENCE_FRAMEWORK_CONFIG_MAP_NAME = "model-engine-inference-framework-latest-config"
254+
SERVICE_NAME = "model-engine"
255+
SERVICE_IDENTIFIER = os.getenv("SERVICE_IDENTIFIER")
256+
if SERVICE_IDENTIFIER:
257+
SERVICE_NAME += f"-{SERVICE_IDENTIFIER}"
258+
LATEST_INFERENCE_FRAMEWORK_CONFIG_MAP_NAME = f"{SERVICE_NAME}-inference-framework-latest-config"
255259

256260

257261
def count_tokens(input: str, model_name: str, tokenizer_repository: TokenizerRepository) -> int:

0 commit comments

Comments
 (0)