File tree Expand file tree Collapse file tree 1 file changed +4
-4
lines changed
docs/reference/ml/trained-models/apis Expand file tree Collapse file tree 1 file changed +4
-4
lines changed Original file line number Diff line number Diff line change @@ -138,8 +138,8 @@ normal priority deployments.
138138Controls how many inference requests are allowed in the queue at a time.
139139Every machine learning node in the cluster where the model can be allocated
140140has a queue of this size; when the number of requests exceeds the total value,
141- new requests are rejected with a 429 error. Defaults to 1024 . Max allowed value
142- is 1000000 .
141+ new requests are rejected with a 429 error. Defaults to 10000 . Max allowed value
142+ is 100000 .
143143
144144`threads_per_allocation`::
145145(Optional, integer)
@@ -173,7 +173,7 @@ The API returns the following results:
173173 "model_bytes": 265632637,
174174 "threads_per_allocation" : 1,
175175 "number_of_allocations" : 1,
176- "queue_capacity" : 1024 ,
176+ "queue_capacity" : 10000 ,
177177 "priority": "normal"
178178 },
179179 "routing_table": {
@@ -229,4 +229,4 @@ POST _ml/trained_models/my_model/deployment/_start?deployment_id=my_model_for_se
229229 }
230230}
231231--------------------------------------------------
232- // TEST[skip:TBD]
232+ // TEST[skip:TBD]
You can’t perform that action at this time.
0 commit comments