Skip to content

Commit 8c4fd6d

Browse files
Add timeout parameter to inference endpoint settings and update request examples
1 parent 71f0186 commit 8c4fd6d

File tree

6 files changed

+23
-5
lines changed

6 files changed

+23
-5
lines changed

specification/_json_spec/inference.put_openshift_ai.json

Lines changed: 7 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -31,6 +31,13 @@
3131
"body": {
3232
"description": "The inference endpoint's task and service settings",
3333
"required": true
34+
},
35+
"params": {
36+
"timeout": {
37+
"type": "time",
38+
"description": "Specifies the amount of time to wait for the inference endpoint to be created.",
39+
"default": "30s"
40+
}
3441
}
3542
}
3643
}

specification/inference/put_openshift_ai/examples/request/PutOpenShiftAiRequestExample1.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
summary: A text embedding task
2-
description: Run `PUT _inference/text_embedding/openshift-ai-text-embedding` to create an inference endpoint that performs a `text_embedding` task.
2+
description:
3+
Run `PUT _inference/text_embedding/openshift-ai-text-embedding` to create an inference endpoint
4+
that performs a `text_embedding` task.
35
method_request: 'PUT _inference/text_embedding/openshift-ai-text-embedding'
46
# type: "request"
57
value: |-

specification/inference/put_openshift_ai/examples/request/PutOpenShiftAiRequestExample2.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
summary: A completion task
2-
description: Run `PUT _inference/completion/openshift-ai-completion` to create an inference endpoint that performs a `completion` task.
2+
description:
3+
Run `PUT _inference/completion/openshift-ai-completion` to create an inference endpoint
4+
that performs a `completion` task.
35
method_request: 'PUT _inference/completion/openshift-ai-completion'
46
# type: "request"
57
value: |-

specification/inference/put_openshift_ai/examples/request/PutOpenShiftAiRequestExample3.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
summary: A chat completion task
2-
description: Run `PUT _inference/chat_completion/openshift-ai-chat-completion` to create an inference endpoint that performs a `chat_completion` task.
2+
description:
3+
Run `PUT _inference/chat_completion/openshift-ai-chat-completion` to create an inference endpoint
4+
that performs a `chat_completion` task.
35
method_request: 'PUT _inference/chat_completion/openshift-ai-chat-completion'
46
# type: "request"
57
value: |-

specification/inference/put_openshift_ai/examples/request/PutOpenShiftAiRequestExample4.yaml

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,7 @@
11
summary: A rerank task
2-
description: Run `PUT _inference/rerank/openshift-ai-rerank` to create an inference endpoint that performs a `rerank` task.
2+
description:
3+
Run `PUT _inference/rerank/openshift-ai-rerank` to create an inference endpoint
4+
that performs a `rerank` task.
35
method_request: 'PUT _inference/rerank/openshift-ai-rerank'
46
# type: "request"
57
value: |-

specification/inference/put_openshift_ai/examples/request/PutOpenShiftAiRequestExample5.yaml

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,8 @@
11
summary: A rerank task with custom `task_settings` and omitted `model_id`
2-
description: Run `PUT _inference/rerank/openshift-ai-rerank` to create an inference endpoint that performs a `rerank` task, specifying custom `task_settings` and omitting the `model_id` if deployed model doesn't require it.
2+
description:
3+
Run `PUT _inference/rerank/openshift-ai-rerank` to create an inference endpoint
4+
that performs a `rerank` task, specifying custom `task_settings` and omitting the `model_id`
5+
if deployed model doesn't require it.
36
method_request: 'PUT _inference/rerank/openshift-ai-rerank'
47
# type: "request"
58
value: |-

0 commit comments

Comments
 (0)