Skip to content

Commit 11c965e

Browse files
Merge branch '9.1' into backport/9.1/pr-131767
2 parents 9160690 + adc1186 commit 11c965e

File tree

94 files changed

+3146
-529
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

94 files changed

+3146
-529
lines changed

build-tools-internal/src/main/java/org/elasticsearch/gradle/internal/RestrictedBuildApiService.java

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -59,9 +59,6 @@ private static ListMultimap<Class<?>, String> createLegacyRestTestBasePluginUsag
5959
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:deprecation:qa:early-deprecation-rest");
6060
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:deprecation:qa:rest");
6161
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:downsample:qa:with-security");
62-
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:enrich:qa:rest");
63-
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:enrich:qa:rest-with-advanced-security");
64-
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:enrich:qa:rest-with-security");
6562
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:ent-search:qa:rest");
6663
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:eql:qa:ccs-rolling-upgrade");
6764
map.put(LegacyRestTestBasePlugin.class, ":x-pack:plugin:eql:qa:correctness");

docs/changelog/130279.yaml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
pr: 130279
2+
summary: Fix missing removal of query cancellation callback in QueryPhase
3+
area: Search
4+
type: bug
5+
issues: [130071]

docs/changelog/131658.yaml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
pr: 131658
2+
summary: Fix `aggregate_metric_double` sorting and `mv_expand` issues
3+
area: ES|QL
4+
type: bug
5+
issues: []

docs/changelog/131680.yaml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
pr: 131680
2+
summary: Grant server module read/write entitlements for deprecated path setting "path.shared_data"
3+
area: Infra/Core
4+
type: bug
5+
issues: []

docs/changelog/131817.yaml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
pr: 131817
2+
summary: Change equals and hashcode for `ConstantNullBlock`
3+
area: ES|QL
4+
type: bug
5+
issues: []

docs/changelog/131917.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
pr: 131917
2+
summary: Fix NPE on empty to_lower/to_upper call
3+
area: ES|QL
4+
type: bug
5+
issues:
6+
- 131913

docs/changelog/131990.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
pr: 131990
2+
summary: Prevent the trained model deployment memory estimation from double-counting
3+
allocations
4+
area: Machine Learning
5+
type: bug
6+
issues: []

docs/reference/elasticsearch/mapping-reference/semantic-text.md

Lines changed: 45 additions & 11 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,9 @@
22
navigation_title: "Semantic text"
33
mapped_pages:
44
- https://www.elastic.co/guide/en/elasticsearch/reference/current/semantic-text.html
5+
applies_to:
6+
stack: ga 9.0
7+
serverless: ga
58
---
69

710
# Semantic text field type [semantic-text]
@@ -29,7 +32,8 @@ service.
2932
Using `semantic_text`, you won’t need to specify how to generate embeddings for
3033
your data, or how to index it. The {{infer}} endpoint automatically determines
3134
the embedding generation, indexing, and query to use.
32-
Newly created indices with `semantic_text` fields using dense embeddings will be
35+
36+
{applies_to}`stack: ga 9.1` Newly created indices with `semantic_text` fields using dense embeddings will be
3337
[quantized](/reference/elasticsearch/mapping-reference/dense-vector.md#dense-vector-quantization)
3438
to `bbq_hnsw` automatically.
3539

@@ -111,13 +115,13 @@ the [Create {{infer}} API](https://www.elastic.co/docs/api/doc/elasticsearch/ope
111115
to create the endpoint. If not specified, the {{infer}} endpoint defined by
112116
`inference_id` will be used at both index and query time.
113117

114-
`index_options`
118+
`index_options` {applies_to}`stack: ga 9.1`
115119
: (Optional, object) Specifies the index options to override default values
116120
for the field. Currently, `dense_vector` index options are supported.
117121
For text embeddings, `index_options` may match any allowed
118122
[dense_vector index options](/reference/elasticsearch/mapping-reference/dense-vector.md#dense-vector-index-options).
119123

120-
`chunking_settings`
124+
`chunking_settings` {applies_to}`stack: ga 9.1`
121125
: (Optional, object) Settings for chunking text into smaller passages.
122126
If specified, these will override the chunking settings set in the {{infer-cap}}
123127
endpoint associated with `inference_id`.
@@ -127,8 +131,8 @@ To completely disable chunking, use the `none` chunking strategy.
127131

128132
**Valid values for `chunking_settings`**:
129133

130-
`type`
131-
: Indicates the type of chunking strategy to use. Valid values are `none`, `word` or
134+
`strategy`
135+
: Indicates the strategy of chunking strategy to use. Valid values are `none`, `word` or
132136
`sentence`. Required.
133137

134138
`max_chunk_size`
@@ -144,7 +148,8 @@ To completely disable chunking, use the `none` chunking strategy.
144148
or `1`. Required for `sentence` type chunking settings
145149

146150
::::{warning}
147-
When using the `none` chunking strategy, if the input exceeds the maximum token limit of the underlying model, some
151+
When using the `none` chunking strategy, if the input exceeds the maximum token
152+
limit of the underlying model, some
148153
services (such as OpenAI) may return an
149154
error. In contrast, the `elastic` and `elasticsearch` services will
150155
automatically truncate the input to fit within the
@@ -181,6 +186,15 @@ For more details on chunking and how to configure chunking settings,
181186
see [Configuring chunking](https://www.elastic.co/docs/api/doc/elasticsearch/group/endpoint-inference)
182187
in the Inference API documentation.
183188

189+
Refer
190+
to [this tutorial](docs-content://solutions/search/semantic-search/semantic-search-semantic-text.md)
191+
to learn more about semantic search using `semantic_text`.
192+
193+
### Pre-chunking [pre-chunking]
194+
```{applies_to}
195+
stack: ga 9.1
196+
```
197+
184198
You can pre-chunk the input by sending it to Elasticsearch as an array of
185199
strings.
186200
Example:
@@ -227,10 +241,6 @@ PUT test-index/_doc/1
227241
* Others (such as `elastic` and `elasticsearch`) will automatically truncate
228242
the input.
229243

230-
Refer
231-
to [this tutorial](docs-content://solutions/search/semantic-search/semantic-search-semantic-text.md)
232-
to learn more about semantic search using `semantic_text`.
233-
234244
## Extracting relevant fragments from semantic text [semantic-text-highlighting]
235245

236246
You can extract the most relevant fragments from a semantic text field by using
@@ -294,8 +304,14 @@ specified. It enables you to quickstart your semantic search by providing
294304
automatic {{infer}} and a dedicated query so you don’t need to provide further
295305
details.
296306

307+
### Customizing using `semantic_text` parameters [custom-by-parameters]
308+
```{applies_to}
309+
stack: ga 9.1
310+
```
311+
297312
If you want to override those defaults and customize the embeddings that
298-
`semantic_text` indexes, you can do so by modifying [parameters](#semantic-text-params):
313+
`semantic_text` indexes, you can do so by
314+
modifying [parameters](#semantic-text-params):
299315

300316
- Use `index_options` to specify alternate index options such as specific
301317
`dense_vector` quantization methods
@@ -326,6 +342,24 @@ PUT my-index-000004
326342
}
327343
```
328344

345+
### Customizing using ingest pipelines [custom-by-pipelines]
346+
```{applies_to}
347+
stack: ga 9.0
348+
```
349+
350+
In case you want to customize data indexing, use the
351+
[`sparse_vector`](/reference/elasticsearch/mapping-reference/sparse-vector.md)
352+
or [`dense_vector`](/reference/elasticsearch/mapping-reference/dense-vector.md)
353+
field types and create an ingest pipeline with an
354+
[{{infer}} processor](/reference/enrich-processor/inference-processor.md) to
355+
generate the embeddings.
356+
[This tutorial](docs-content://solutions/search/semantic-search/semantic-search-inference.md)
357+
walks you through the process. In these cases - when you use `sparse_vector` or
358+
`dense_vector` field types instead of the `semantic_text` field type to
359+
customize indexing - using the
360+
[`semantic_query`](/reference/query-languages/query-dsl/query-dsl-semantic-query.md)
361+
is not supported for querying the field data.
362+
329363
## Updates to `semantic_text` fields [update-script]
330364

331365
For indices containing `semantic_text` fields, updates that use scripts have the

docs/reference/elasticsearch/rest-apis/reciprocal-rank-fusion.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -758,7 +758,7 @@ Highlighting on vector fields, using either the `knn` retriever or a `knn` query
758758
::::
759759

760760

761-
A more specific example of highlighting in RRF can also be found in the [retrievers examples](docs-content://solutions/search/retrievers-examples.md#retrievers-examples-highlighting-retriever-results) page.
761+
A more specific example of highlighting in RRF can also be found in the [retrievers examples](retrievers/retrievers-examples.md#retrievers-examples-highlighting-retriever-results) page.
762762

763763

764764
## Inner hits in RRF [_inner_hits_in_rrf]

docs/reference/elasticsearch/rest-apis/retrievers.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ applies_to:
1111
A retriever is a specification to describe top documents returned from a search. A retriever replaces other elements of the [search API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-search) that also return top documents such as [`query`](/reference/query-languages/querydsl.md) and [`knn`](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-search#search-api-knn). A retriever may have child retrievers where a retriever with two or more children is considered a compound retriever. This allows for complex behavior to be depicted in a tree-like structure, called the retriever tree, which clarifies the order of operations that occur during a search.
1212

1313
::::{tip}
14-
Refer to [*Retrievers*](docs-content://solutions/search/retrievers-overview.md) for a high level overview of the retrievers abstraction. Refer to [Retrievers examples](docs-content://solutions/search/retrievers-examples.md) for additional examples.
14+
Refer to [*Retrievers*](docs-content://solutions/search/retrievers-overview.md) for a high level overview of the retrievers abstraction. Refer to [Retrievers examples](retrievers/retrievers-examples.md) for additional examples.
1515

1616
::::
1717

@@ -238,5 +238,5 @@ Note, however, that wildcard field patterns will only resolve to fields that eit
238238

239239
### Examples
240240

241-
- [RRF with the multi-field query format](docs-content://solutions/search/retrievers-examples.md#retrievers-examples-rrf-multi-field-query-format)
242-
- [Linear retriever with the multi-field query format](docs-content://solutions/search/retrievers-examples.md#retrievers-examples-linear-multi-field-query-format)
241+
- [RRF with the multi-field query format](retrievers/retrievers-examples.md#retrievers-examples-rrf-multi-field-query-format)
242+
- [Linear retriever with the multi-field query format](retrievers/retrievers-examples.md#retrievers-examples-linear-multi-field-query-format)

0 commit comments

Comments
 (0)