Skip to content
Merged
Show file tree
Hide file tree
Changes from 23 commits
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
854bc26
Added inner hits builder to semantic query
Mikep86 Aug 12, 2024
733cae8
Pass inner hit builder to nested query builder
Mikep86 Aug 12, 2024
ada0b3a
Added InnerChunkBuilder
Mikep86 Aug 12, 2024
a8fac5f
Update InnerChunkBuilder to not inherit from InnerHitBuilder
Mikep86 Aug 12, 2024
557dc9a
Hard-code name in InnerChunkBuilder
Mikep86 Aug 12, 2024
8e94854
Updated semantic query builder tests
Mikep86 Aug 12, 2024
a860e69
Added YAML tests
Mikep86 Aug 13, 2024
dac2bd4
Resolved TODOs
Mikep86 Aug 13, 2024
dd67452
Update docs/changelog/111834.yaml
Mikep86 Aug 13, 2024
9d5fa1d
Fixed changelog
Mikep86 Aug 13, 2024
639adad
Set inner chunk builder name based on field name
Mikep86 Aug 13, 2024
4b8a62b
Add YAML test for querying multiple semantic text fields with inner c…
Mikep86 Aug 13, 2024
9311cc1
Fix YAML tests
Mikep86 Aug 13, 2024
df127b9
Rename inner_chunks to chunks
Mikep86 Aug 13, 2024
202314c
Fail the semantic query request if the transport version is not compa…
Mikep86 Aug 13, 2024
17e8edb
YAML test updates
Mikep86 Aug 13, 2024
91add83
Exclude embeddings from inner hit _source output
Mikep86 Aug 14, 2024
ae898fd
Updated YAML tests to check that embeddings are not in inner hits _so…
Mikep86 Aug 14, 2024
23d3344
Updated semantic query documentation
Mikep86 Aug 14, 2024
43b0a7f
Fix link
Mikep86 Aug 14, 2024
91f21f9
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Aug 14, 2024
a5a03d9
Docs adjustments
Mikep86 Aug 14, 2024
1982eda
Fix headings
Mikep86 Aug 14, 2024
b0244f1
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Aug 14, 2024
a5ee5d8
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Sep 24, 2024
e28e72f
Added cluster feature for semantic text inner hits support
Mikep86 Sep 24, 2024
ee95981
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Sep 24, 2024
8c73841
Rename chunks param to inner_hits
Mikep86 Sep 25, 2024
b779e29
Update documentation to address feedback and rename chunks to inner_hits
Mikep86 Sep 25, 2024
9f42742
Added reason for skipping doc tests
Mikep86 Sep 25, 2024
a19fd6e
Added "Query semantic text field in object with inner hits" YAML test
Mikep86 Sep 25, 2024
bb95eee
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Sep 25, 2024
f62649d
Merge branch 'main' into semantic-query_inner-hits
Mikep86 Sep 25, 2024
3cbd7a5
PR feedback
Mikep86 Sep 25, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions docs/changelog/111834.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
pr: 111834
summary: Add inner hits support to semantic query
area: Search
type: enhancement
issues: []
206 changes: 203 additions & 3 deletions docs/reference/query-dsl/semantic-query.asciidoc
Original file line number Diff line number Diff line change
Expand Up @@ -40,9 +40,209 @@ The `semantic_text` field to perform the query on.
(Required, string)
The query text to be searched for on the field.

`chunks`::
(Optional, object)
The passage ranking configuration.
See <<semantic-query-passage-ranking, Passage ranking with the `semantic` query>> for more information.
+
.Properties of `chunks`
[%collapsible%open]
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think we usually collapse query DSL properties, based on looking at a few other pages. It's probably not a huge deal but inconsistent with pages like knn and text_expansion in the same grouping. Those pages also aren't using the .Properties syntax. I'll defer to docs experts on which way is "right" 🙂

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I like the look of the collapsible blocks better personally, that's why I went with it :)

@leemthompo Any guidance you can offer here?

====
`from`::
(Optional, integer)
The offset from the first chunk to fetch.
Used to paginate through the chunks.
Defaults to `0`.

`size`::
(Optional, integer)
The maximum number of chunks to return.
Defaults to `3`.
====

Refer to <<semantic-search-semantic-text,this tutorial>> to learn more about semantic search using `semantic_text` and `semantic` query.

[discrete]
[[semantic-query-passage-ranking]]
==== Passage ranking with the `semantic` query
The `chunks` parameter can be used for _passage ranking_, which allows you to determine which chunk(s) in the document best match the query.
For example, if you have a document that covers varying topics:

[source,console]
------------------------------------------------------------
POST my-index/_doc/lake_tahoe
{
"inference_field": [
"Lake Tahoe is the largest alpine lake in North America",
"When hiking in the area, please be on alert for bears"
]
}
------------------------------------------------------------
// TEST[skip:TBD]

You can use passage ranking to find the chunk that best matches your query:

[source,console]
------------------------------------------------------------
GET my-index/_search
{
"query": {
"semantic": {
"field": "inference_field",
"query": "mountain lake",
"chunks": { }
}
}
}
------------------------------------------------------------
// TEST[skip:TBD]

[source,console-result]
------------------------------------------------------------
{
"took": 67,
"timed_out": false,
"_shards": {
"total": 1,
"successful": 1,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 1,
"relation": "eq"
},
"max_score": 10.844536,
"hits": [
{
"_index": "my-index",
"_id": "lake_tahoe",
"_score": 10.844536,
"_source": {
...
},
"inner_hits": { <1>
"inference_field": {
"hits": {
"total": {
"value": 2,
"relation": "eq"
},
"max_score": 10.844536,
"hits": [
{
"_index": "my-index",
"_id": "lake_tahoe",
"_nested": {
"field": "inference_field.inference.chunks",
"offset": 0
},
"_score": 10.844536,
"_source": {
"text": "Lake Tahoe is the largest alpine lake in North America"
}
},
{
"_index": "my-index",
"_id": "lake_tahoe",
"_nested": {
"field": "inference_field.inference.chunks",
"offset": 1
},
"_score": 3.2726858,
"_source": {
"text": "When hiking in the area, please be on alert for bears"
}
}
]
}
}
}
}
]
}
}
------------------------------------------------------------
<1> Ranked passages will be returned using the <<inner-hits,`inner_hits` response format>>, with `<inner_hits_name>` set to the `semantic_text` field name.

By default, the top three matching chunks will be returned.
You can use the `size` parameter to control the number of chunks returned and the `from` parameter to page through the matching chunks:

[source,console]
------------------------------------------------------------
GET my-index/_search
{
"query": {
"semantic": {
"field": "inference_field",
"query": "mountain lake",
"chunks": {
"from": 1,
"size": 1
}
}
}
}
------------------------------------------------------------
// TEST[skip:TBD]

[source,console-result]
------------------------------------------------------------
{
"took": 42,
"timed_out": false,
"_shards": {
"total": 1,
"successful": 1,
"skipped": 0,
"failed": 0
},
"hits": {
"total": {
"value": 1,
"relation": "eq"
},
"max_score": 10.844536,
"hits": [
{
"_index": "my-index",
"_id": "lake_tahoe",
"_score": 10.844536,
"_source": {
...
},
"inner_hits": {
"inference_field": {
"hits": {
"total": {
"value": 2,
"relation": "eq"
},
"max_score": 10.844536,
"hits": [
{
"_index": "my-index",
"_id": "lake_tahoe",
"_nested": {
"field": "inference_field.inference.chunks",
"offset": 1
},
"_score": 3.2726858,
"_source": {
"text": "When hiking in the area, please be on alert for bears"
}
}
]
}
}
}
}
]
}
}
------------------------------------------------------------

[discrete]
[[hybrid-search-semantic]]
==== Hybrid search with the `semantic` query
Expand Down Expand Up @@ -121,7 +321,7 @@ GET my-index/_search

[discrete]
[[advanced-search]]
=== Advanced search on `semantic_text` fields
==== Advanced search on `semantic_text` fields

The `semantic` query uses default settings for searching on `semantic_text` fields for ease of use.
If you want to fine-tune a search on a `semantic_text` field, you need to know the task type used by the `inference_id` configured in `semantic_text`.
Expand All @@ -135,7 +335,7 @@ on a `semantic_text` field, it is not supported to use the `semantic_query` on a

[discrete]
[[search-sparse-inference]]
==== Search with `sparse_embedding` inference
===== Search with `sparse_embedding` inference

When the {infer} endpoint uses a `sparse_embedding` model, you can use a <<query-dsl-sparse-vector-query,`sparse_vector` query>> on a <<semantic-text,`semantic_text`>> field in the following way:

Expand Down Expand Up @@ -164,7 +364,7 @@ You can customize the `sparse_vector` query to include specific settings, like <

[discrete]
[[search-text-inferece]]
==== Search with `text_embedding` inference
===== Search with `text_embedding` inference

When the {infer} endpoint uses a `text_embedding` model, you can use a <<query-dsl-knn-query,`knn` query>> on a `semantic_text` field in the following way:

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -189,6 +189,7 @@ static TransportVersion def(int id) {
public static final TransportVersion ESQL_ORIGINAL_INDICES = def(8_719_00_0);
public static final TransportVersion ML_INFERENCE_EIS_INTEGRATION_ADDED = def(8_720_00_0);
public static final TransportVersion INGEST_PIPELINE_EXCEPTION_ADDED = def(8_721_00_0);
public static final TransportVersion SEMANTIC_QUERY_INNER_HITS = def(8_722_00_0);
/*
* STOP! READ THIS FIRST! No, really,
* ____ _____ ___ ____ _ ____ _____ _ ____ _____ _ _ ___ ____ _____ ___ ____ ____ _____ _
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -49,9 +49,9 @@ public final class InnerHitBuilder implements Writeable, ToXContentObject {
public static final ParseField COLLAPSE_FIELD = new ParseField("collapse");
public static final ParseField FIELD_FIELD = new ParseField("field");

public static final int DEFAULT_FROM = 0;
public static final int DEFAULT_SIZE = 3;
private static final boolean DEFAULT_IGNORE_UNAMPPED = false;
private static final int DEFAULT_FROM = 0;
private static final int DEFAULT_SIZE = 3;
private static final boolean DEFAULT_VERSION = false;
private static final boolean DEFAULT_SEQ_NO_AND_PRIMARY_TERM = false;
private static final boolean DEFAULT_EXPLAIN = false;
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -38,6 +38,7 @@
import org.elasticsearch.index.mapper.ValueFetcher;
import org.elasticsearch.index.mapper.vectors.DenseVectorFieldMapper;
import org.elasticsearch.index.mapper.vectors.SparseVectorFieldMapper;
import org.elasticsearch.index.query.InnerHitBuilder;
import org.elasticsearch.index.query.MatchNoneQueryBuilder;
import org.elasticsearch.index.query.NestedQueryBuilder;
import org.elasticsearch.index.query.QueryBuilder;
Expand All @@ -52,6 +53,7 @@
import org.elasticsearch.xcontent.XContentParserConfiguration;
import org.elasticsearch.xpack.core.ml.inference.results.MlTextEmbeddingResults;
import org.elasticsearch.xpack.core.ml.inference.results.TextExpansionResults;
import org.elasticsearch.xpack.inference.queries.InnerChunkBuilder;

import java.io.IOException;
import java.util.ArrayList;
Expand Down Expand Up @@ -383,7 +385,12 @@ public IndexFieldData.Builder fielddataBuilder(FieldDataContext fieldDataContext
throw new IllegalArgumentException("[semantic_text] fields do not support sorting, scripting or aggregating");
}

public QueryBuilder semanticQuery(InferenceResults inferenceResults, float boost, String queryName) {
public QueryBuilder semanticQuery(
InferenceResults inferenceResults,
float boost,
String queryName,
InnerChunkBuilder innerChunkBuilder
) {
String nestedFieldPath = getChunksFieldName(name());
String inferenceResultsFieldName = getEmbeddingsFieldName(name());
QueryBuilder childQueryBuilder;
Expand Down Expand Up @@ -459,7 +466,10 @@ public QueryBuilder semanticQuery(InferenceResults inferenceResults, float boost
};
}

return new NestedQueryBuilder(nestedFieldPath, childQueryBuilder, ScoreMode.Max).boost(boost).queryName(queryName);
InnerHitBuilder innerHitBuilder = innerChunkBuilder != null ? innerChunkBuilder.toInnerHitBuilder() : null;
return new NestedQueryBuilder(nestedFieldPath, childQueryBuilder, ScoreMode.Max).boost(boost)
.queryName(queryName)
.innerHit(innerHitBuilder);
}
}

Expand Down
Loading