You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: deploy-manage/autoscaling/trained-model-autoscaling.md
+4-24Lines changed: 4 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -81,7 +81,7 @@ Refer to the tables in the [Model deployment resource matrix](#model-deployment-
81
81
82
82
In {{serverless-full}}, Search projects are given access to more processing resources, while Security and Observability projects have lower limits. This difference is reflected in the UI configuration: Search projects have higher resource limits compared to Security and Observability projects to accommodate their more complex operations.
83
83
84
-
On {{serverless-short}}, adaptive allocations are automatically enabled for all project types. However, the "Adaptive resources" control is not displayed in {{kib}} for Observability and Security projects.
84
+
On {{serverless-short}}, adaptive allocations are automatically enabled for all project types.
85
85
86
86
## Model deployment resource matrix [model-deployment-resource-matrix]
87
87
@@ -152,16 +152,6 @@ In case of ingest-optimized deployments, we maximize the number of model allocat
152
152
153
153
:::
154
154
155
-
:::{tab-item} {{serverless-short}}
156
-
157
-
| Level | Allocations | Threads | VCUs |
158
-
| --- | --- | --- | --- |
159
-
| Low | Exactly 2 | 1 | 16 |
160
-
| Medium | Exactly 32 | 1 | 256 |
161
-
| High | 512 for Search<br> No static allocations for Security and Observability<br> | 1 | 4096 for Search<br> No static allocations for Security and Observability<br> |
162
-
163
-
:::
164
-
165
155
::::
166
156
167
157
### Search optimized
@@ -188,9 +178,9 @@ In case of search-optimized deployments, we maximize the number of threads. The
188
178
189
179
| Level | Allocations | Threads | VCUs |
190
180
| --- | --- | --- | --- |
191
-
| Low | 0 to 1 dynamically |Always 2 | 0 to 16 dynamically |
192
-
| Medium |1 to 2 (if threads=16), dynamically | Maximum (for example, 16) | 8 to 256 dynamically |
193
-
| High |1 to 32 (if threads=16), dynamically<br> 1 to 128 for Security and Observability<br> |Maximum (for example, 16) | 8 to 4096 for Search<br> 8 to 1024 for Security and Observability<br> |
181
+
| Low | 0 to 1 dynamically | 2 | 0 to 16 dynamically |
182
+
| Medium |0 to 2 dynamically for Search and Observatibility<br> 1 to 2 dynamically for Security | 4 | 0 to 256 dynamically for Search and Observatibility<br> 8 to 256 dynamically for Security|
183
+
| High |0 to 32 dynamically for Search and Observatibility<br> 1 to 128 dynamically for Security<br> |8 | 0 to 4096 dynamically for Search<br> 0 to 1024 dynamically for Observability<br>8 to 1014 dynamically for Security|
194
184
195
185
:::
196
186
@@ -212,14 +202,4 @@ In case of search-optimized deployments, we maximize the number of threads. The
212
202
213
203
:::
214
204
215
-
:::{tab-item} {{serverless-short}}
216
-
217
-
| Level | Allocations | Threads | VCUs |
218
-
| --- | --- | --- | --- |
219
-
| Low | 1 statically | Always 2 | 16 |
220
-
| Medium | 2 statically (if threads=16) | Maximum (for example, 16) | 256 |
221
-
| High | 32 statically (if threads=16) for Search<br> No static allocations for Security and Observability<br> | Maximum (for example, 16) | 4096 for Search<br> No static allocations for Security and Observability<br> |
0 commit comments