You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: deploy-manage/tools/snapshot-and-restore/s3-repository.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -125,6 +125,9 @@ The following list contains the available S3 client settings. Those that must be
125
125
`max_retries`
126
126
: The number of retries to use when an S3 request fails. The default value is `3`.
127
127
128
+
`connection_max_idle_time`
129
+
: ([time value](elasticsearch://reference/elasticsearch/rest-apis/api-conventions.md#time-units)) The timeout after which {{es}} will close an idle connection. The default value is 60 seconds.
130
+
128
131
`path_style_access`
129
132
: Whether to force the use of the path style access pattern. If `true`, the path style access pattern will be used. If `false`, the access pattern will be automatically determined by the AWS Java SDK (See [AWS documentation](https://docs.aws.amazon.com/AWSJavaSDK/latest/javadoc/com/amazonaws/services/s3/AmazonS3Builder.html#setPathStyleAccessEnabled-java.lang.Boolean-) for details). Defaults to `false`.
@@ -107,37 +107,132 @@ By default, documents are split into sentences and grouped in sections up to 250
107
107
108
108
Several strategies are available for chunking:
109
109
110
-
`sentence`
111
-
: The `sentence` strategy splits the input text at sentence boundaries. Each chunk contains one or more complete sentences ensuring that the integrity of sentence-level context is preserved, except when a sentence causes a chunk to exceed a word count of `max_chunk_size`, in which case it will be split across chunks. The `sentence_overlap` option defines the number of sentences from the previous chunk to include in the current chunk which is either `0` or `1`.
110
+
#### `sentence`
112
111
113
-
`word`
114
-
: The `word` strategy splits the input text on individual words up to the `max_chunk_size` limit. The `overlap` option is the number of words from the previous chunk to include in the current chunk.
112
+
The `sentence` strategy splits the input text at sentence boundaries. Each chunk contains one or more complete sentences ensuring that the integrity of sentence-level context is preserved, except when a sentence causes a chunk to exceed a word count of `max_chunk_size`, in which case it will be split across chunks. The `sentence_overlap` option defines the number of sentences from the previous chunk to include in the current chunk which is either `0` or `1`.
115
113
116
-
`recursive`{applies_to}`stack: ga 9.1`
117
-
: The `recursive` strategy splits the input text based on a configurable list of separator patterns (for example, newlines or Markdown headers). The chunker applies these separators in order, recursively splitting any chunk that exceeds the `max_chunk_size` word limit. If no separator produces a small enough chunk, the strategy falls back to sentence-level splitting.
114
+
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model and configures the chunking behavior with the `sentence` strategy.
118
115
119
-
`none` {applies_to}`stack: ga 9.1`
120
-
121
-
: The `none` strategy disables chunking and processes the entire input text as a single block, without any splitting or overlap. When using this strategy, you can instead [pre-chunk](https://www.elastic.co/docs/reference/elasticsearch/mapping-reference/semantic-text#auto-text-chunking) the input by providing an array of strings, where each element acts as a separate chunk to be sent directly to the inference service without further chunking.
116
+
```console
117
+
PUT _inference/sparse_embedding/sentence_chunks
118
+
{
119
+
"service": "elasticsearch",
120
+
"service_settings": {
121
+
"model_id": ".elser_model_2",
122
+
"num_allocations": 1,
123
+
"num_threads": 1
124
+
},
125
+
"chunking_settings": {
126
+
"strategy": "sentence",
127
+
"max_chunk_size": 100,
128
+
"sentence_overlap": 0
129
+
}
130
+
}
131
+
```
122
132
123
133
The default chunking strategy is `sentence`.
124
134
125
-
#### Example of configuring the chunking behavior
135
+
#### `word`
136
+
137
+
The `word` strategy splits the input text on individual words up to the `max_chunk_size` limit. The `overlap` option is the number of words from the previous chunk to include in the current chunk.
126
138
127
-
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model by default and configures the chunking behavior.
139
+
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model and configures the chunking behavior with the `word` strategy, setting a maximum of 120 words per chunk and an overlap of 40 words between chunks.
128
140
129
141
```console
130
-
PUT _inference/sparse_embedding/small_chunk_size
142
+
PUT _inference/sparse_embedding/word_chunks
131
143
{
132
144
"service": "elasticsearch",
133
145
"service_settings": {
146
+
"model_id": ".elser_model_2",
134
147
"num_allocations": 1,
135
148
"num_threads": 1
136
149
},
137
150
"chunking_settings": {
138
-
"strategy": "sentence",
139
-
"max_chunk_size": 100,
140
-
"sentence_overlap": 0
151
+
"strategy": "word",
152
+
"max_chunk_size": 120,
153
+
"overlap": 40
154
+
}
155
+
}
156
+
```
157
+
158
+
#### `recursive`
159
+
160
+
```{applies_to}
161
+
stack: ga 9.1`
162
+
```
163
+
164
+
The `recursive` strategy splits the input text based on a configurable list of separator patterns (for example, newlines or Markdown headers). The chunker applies these separators in order, recursively splitting any chunk that exceeds the `max_chunk_size` word limit. If no separator produces a small enough chunk, the strategy falls back to sentence-level splitting.
165
+
166
+
##### Markdown separator group
167
+
168
+
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model and configures chunking with the `recursive` strategy using the markdown separator group and a maximum of 200 words per chunk.
169
+
170
+
```console
171
+
PUT _inference/sparse_embedding/recursive_markdown_chunks
172
+
{
173
+
"service": "elasticsearch",
174
+
"service_settings": {
175
+
"model_id": ".elser_model_2",
176
+
"num_allocations": 1,
177
+
"num_threads": 1
178
+
},
179
+
"chunking_settings": {
180
+
"strategy": "recursive",
181
+
"max_chunk_size": 200,
182
+
"separator_group": "markdown"
183
+
}
184
+
}
185
+
```
186
+
187
+
##### Custom separator group
188
+
189
+
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model and configures chunking with the `recursive` strategy. It uses a custom list of separators to split plaintext into chunks of up to 180 words.
190
+
191
+
192
+
```console
193
+
PUT _inference/sparse_embedding/recursive_custom_chunks
194
+
{
195
+
"service": "elasticsearch",
196
+
"service_settings": {
197
+
"model_id": ".elser_model_2",
198
+
"num_allocations": 1,
199
+
"num_threads": 1
200
+
},
201
+
"chunking_settings": {
202
+
"strategy": "recursive",
203
+
"max_chunk_size": 180,
204
+
"separators": [
205
+
"^(#{1,6})\\s",
206
+
"\\n\\n",
207
+
"\\n[-*]\\s",
208
+
"\\n\\d+\\.\\s",
209
+
"\\n"
210
+
]
211
+
}
212
+
}
213
+
```
214
+
215
+
#### `none`
216
+
217
+
```{applies_to}
218
+
stack: ga 9.1`
219
+
```
220
+
221
+
The `none` strategy disables chunking and processes the entire input text as a single block, without any splitting or overlap. When using this strategy, you can instead [pre-chunk](https://www.elastic.co/docs/reference/elasticsearch/mapping-reference/semantic-text#auto-text-chunking) the input by providing an array of strings, where each element acts as a separate chunk to be sent directly to the inference service without further chunking.
222
+
223
+
The following example creates an {{infer}} endpoint with the `elasticsearch` service that deploys the ELSER model and disables chunking by setting the strategy to `none`.
*[Index privileges](/deploy-manage/users-roles/cluster-or-deployment-auth/elasticsearch-privileges.md#privileges-list-indices): `create_doc` and `create_index`
*[Index privileges](elasticsearch://reference/elasticsearch/security-privileges.md#privileges-list-indices): `create_doc` and `create_index`
23
23
24
24
* Familiarity with [time series data stream concepts](time-series-data-stream-tsds.md) and [{{es}} index and search basics](/solutions/search/get-started.md)
Copy file name to clipboardExpand all lines: release-notes/elastic-security/index.md
+10Lines changed: 10 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -27,6 +27,16 @@ To check for security updates, go to [Security announcements for the Elastic sta
27
27
28
28
% *
29
29
30
+
## 9.1.4 [elastic-security-9.1.4-release-notes]
31
+
32
+
### Features and enhancements [elastic-security-9.1.4-features-enhancements]
33
+
* Adds more Linux diagnostic process `ptrace` events.
34
+
35
+
### Fixes [elastic-security-9.1.4-fixes]
36
+
* Fixes a bug where the toggle column functionality only functioned on the **Alerts** page [#234278]({{kib-pull}}234278).
37
+
* Fixes a bug where Linux capabilities were included in {{elastic-endpoint}} network events despite being disabled.
38
+
* Makes the delivery of {{elastic-endpoint}} command line commands more robust. In rare cases, commands could previously fail due to interprocess communication issues.
0 commit comments