Skip to content

Commit 1c66d84

Browse files
authored
Update start-trained-model-deployment.asciidoc (#118887) (#118892)
Updating with changes in #115041
1 parent d9f98b8 commit 1c66d84

File tree

1 file changed

+4
-4
lines changed

1 file changed

+4
-4
lines changed

docs/reference/ml/trained-models/apis/start-trained-model-deployment.asciidoc

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -138,8 +138,8 @@ normal priority deployments.
138138
Controls how many inference requests are allowed in the queue at a time.
139139
Every machine learning node in the cluster where the model can be allocated
140140
has a queue of this size; when the number of requests exceeds the total value,
141-
new requests are rejected with a 429 error. Defaults to 1024. Max allowed value
142-
is 1000000.
141+
new requests are rejected with a 429 error. Defaults to 10000. Max allowed value
142+
is 100000.
143143

144144
`threads_per_allocation`::
145145
(Optional, integer)
@@ -173,7 +173,7 @@ The API returns the following results:
173173
"model_bytes": 265632637,
174174
"threads_per_allocation" : 1,
175175
"number_of_allocations" : 1,
176-
"queue_capacity" : 1024,
176+
"queue_capacity" : 10000,
177177
"priority": "normal"
178178
},
179179
"routing_table": {
@@ -229,4 +229,4 @@ POST _ml/trained_models/my_model/deployment/_start?deployment_id=my_model_for_se
229229
}
230230
}
231231
--------------------------------------------------
232-
// TEST[skip:TBD]
232+
// TEST[skip:TBD]

0 commit comments

Comments
 (0)