pawankartik-elastic
diff --git a/‎docs/redirects.yml‎
Lines changed: 14 additions & 0 deletions b/‎docs/redirects.yml‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎docs/reference/elasticsearch/mapping-reference/semantic-text-how-tos.md‎
Lines changed: 22 additions & 0 deletions b/‎docs/reference/elasticsearch/mapping-reference/semantic-text-how-tos.md‎
Lines changed: 22 additions & 0 deletions
diff --git a/‎docs/reference/elasticsearch/mapping-reference/semantic-text-ingestions.md‎
Lines changed: 150 additions & 0 deletions b/‎docs/reference/elasticsearch/mapping-reference/semantic-text-ingestions.md‎
Lines changed: 150 additions & 0 deletions
diff --git a/‎docs/reference/elasticsearch/mapping-reference/semantic-text-reference.md‎
Lines changed: 182 additions & 0 deletions b/‎docs/reference/elasticsearch/mapping-reference/semantic-text-reference.md‎
Lines changed: 182 additions & 0 deletions
@@ -112,3 +112,17 @@ redirects:
     many:
       - to: 'reference/elasticsearch/mapping-reference/match-only-text.md'
         anchors: { 'match-only-text-field-type', 'match-only-text-params' }
+
+  # semantic_text documentation restructuring - anchors moved to new pages
+  'reference/elasticsearch/mapping-reference/semantic-text.md':
+    to: 'reference/elasticsearch/mapping-reference/semantic-text.md'
+    anchors: {} # pass-through unlisted anchors in the `many` ruleset
+    many:
+      - to: 'reference/elasticsearch/mapping-reference/semantic-text-setup-configuration.md'
+        anchors: { 'using-elser-on-eis', 'default-and-preconfigured-endpoints', 'using-custom-endpoint', 'dedicated-endpoints-for-ingestion-and-search', 'configure-inference-endpoints' }
+      - to: 'reference/elasticsearch/mapping-reference/semantic-text-ingestions.md'
+        anchors: { 'pre-chunking', 'use-copy-to-with-semantic-text', 'updates-and-partial-updates', 'scripted-updates' }
+      - to: 'reference/elasticsearch/mapping-reference/semantic-text-search-retrieval.md'
+        anchors: { 'querying-semantic-text-fields', 'retrieving-indexed-chunks', 'returning-semantic-field-embeddings', 'returning-semantic-field-embeddings-in-_source', 'reindex-while-preserving-embeddings', 'troubleshooting-semantic-text-fields', 'returning-semantic-field-embeddings-using-fields', 'highlighting-fragments', 'cross-cluster-search' }
+      - to: 'reference/elasticsearch/mapping-reference/semantic-text-reference.md'
+        anchors: { 'semantic-text-params', 'configuring-inference-endpoints', 'chunking-behavior', 'limitations', 'document-count-discrepancy' }
@@ -0,0 +1,22 @@
+---
+navigation_title: "How-to guides"
+mapped_pages:
+  - https://www.elastic.co/guide/en/elasticsearch/reference/current/semantic-text.html
+applies_to:
+  stack: ga 9.0
+  serverless: ga
+---
+
+# How-to guides for the `semantic_text` field type
+
+These guides provide procedure descriptions and examples for common tasks when working with `semantic_text` fields:
+
+- [Set up and configure `semantic_text` fields](./semantic-text-setup-configuration.md): Learn how to configure {{infer}} endpoints, including default and preconfigured options, ELSER on EIS, custom endpoints, and dedicated endpoints for ingestion and search operations.
+
+- [Ingest data with `semantic_text` fields](./semantic-text-ingestions.md): Learn how to index pre-chunked content, use `copy_to` and multi-fields to collect values from multiple fields, and perform updates and partial updates to optimize ingestion costs.
+
+- [Search and retrieve `semantic_text` fields](./semantic-text-search-retrieval.md): Learn how to query `semantic_text` fields, retrieve indexed chunks, return field embeddings, and highlight the most relevant fragments from search results.
+
+
+
+
@@ -0,0 +1,150 @@
+---
+navigation_title: "Ingest data"
+mapped_pages:
+  - https://www.elastic.co/guide/en/elasticsearch/reference/current/semantic-text.html
+applies_to:
+  stack: ga 9.0
+  serverless: ga
+---
+
+# Ingest data with `semantic_text` fields [set-up-configuration-semantic-text]
+
+This page provides instructions for ingesting data into `semantic_text` fields. Learn how to index pre-chunked content, use `copy_to` and multi-fields to collect values from multiple fields, and perform updates and partial updates to optimize ingestion costs.
+
+## Index pre-chunked content [pre-chunking]
+```{applies_to}
+stack: ga 9.1
+```
+
+To index pre-chunked content, provide your text as an array of strings. Each element in the array represents a single chunk that will be sent directly to the {{infer}} service without further chunking.
+
+:::::{stepper}
+
+::::{step} Disable automatic chunking
+
+Disable automatic chunking in your index mapping by setting `chunking_settings.strategy` to `none`:
+
+```console
+PUT test-index
+{
+  "mappings": {
+    "properties": {
+      "my_semantic_field": {
+        "type": "semantic_text",
+        "chunking_settings": {
+          "strategy": "none"    <1>
+        }
+      }
+    }
+  }
+}
+```
+
+1. Disables automatic chunking on `my_semantic_field`.
+
+::::
+
+::::{step} Index documents
+
+Index documents with pre-chunked text as an array:
+
+```console
+PUT test-index/_doc/1
+{
+    "my_semantic_field": ["my first chunk", "my second chunk", ...]    <1>
+    ...
+}
+```
+
+1. The text is pre-chunked and provided as an array of strings. Each element represents a single chunk.
+
+::::
+
+:::::
+
+:::{important}
+When providing pre-chunked input:
+
+- Ensure that you set the chunking strategy to `none` to avoid additional processing.
+- Size each chunk carefully, staying within the token limit of the {{infer}} service and the underlying model.
+- If a chunk exceeds the model's token limit, the behavior depends on the service:
+  - Some services (such as OpenAI) will return an error.
+  - Others (such as `elastic` and `elasticsearch`) will automatically truncate the input.
+:::
+
+## Use `copy_to` and multi-fields with `semantic_text` [use-copy-to-with-semantic-text]
+
+You can use a single `semantic_text` field to collect values from multiple fields for semantic search. The `semantic_text` field type can serve as the target of [copy_to fields](/reference/elasticsearch/mapping-reference/copy-to.md), be part of a [multi-field](/reference/elasticsearch/mapping-reference/multi-fields.md) structure, or contain [multi-fields](/reference/elasticsearch/mapping-reference/multi-fields.md) internally.
+
+### Use copy_to
+
+Use `copy_to` to copy values from source fields to a `semantic_text` field:
+
+```console
+PUT test-index
+{
+    "mappings": {
+        "properties": {
+            "source_field": {
+                "type": "text",
+                "copy_to": "infer_field"
+            },
+            "infer_field": {
+                "type": "semantic_text",
+                "inference_id": ".elser-2-elasticsearch"
+            }
+        }
+    }
+}
+```
+% TEST[skip:Requires {{infer}} endpoint]
+
+### Use multi-fields
+
+Declare `semantic_text` as a multi-field:
+
+```console
+PUT test-index
+{
+    "mappings": {
+        "properties": {
+            "source_field": {
+                "type": "text",
+                "fields": {
+                    "infer_field": {
+                        "type": "semantic_text",
+                        "inference_id": ".elser-2-elasticsearch"
+                    }
+                }
+            }
+        }
+    }
+}
+```
+% TEST[skip:Requires {{infer}} endpoint]
+
+## Updates and partial updates [updates-and-partial-updates]
+
+When updating documents that contain `semantic_text` fields, it's important to understand how {{infer}} is triggered:
+
+Full document updates
+:   Full document updates re-run {{infer}} on all `semantic_text` fields, even if their values did not change. This ensures that embeddings remain consistent with the current document state but can increase ingestion costs.
+
+Partial updates using the Bulk API
+:   Partial updates submitted through the [Bulk API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-bulk) reuse existing embeddings when you omit `semantic_text` fields. {{infer}} does not run for omitted fields, which can significantly reduce processing time and cost.
+
+Partial updates using the Update API
+:   Partial updates submitted through the [Update API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-update) re-run {{infer}} on all `semantic_text` fields, even when you omit them from the `doc` object. Embeddings are re-generated regardless of whether field values changed.
+
+To preserve existing embeddings and avoid unnecessary {{infer}} costs:
+
+ * Use partial updates with the Bulk API.
+ * Omit any `semantic_text` fields that did not change from the `doc` object in your request.
+
+### Scripted updates [scripted-updates]
+
+For indices containing `semantic_text` fields, updates that use scripts have the
+following behavior:
+
+- ✅ **Supported:** [Update API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-update)
+- ❌ **Not supported:** [Bulk API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-bulk-1). Scripted updates will fail even if the script targets non-`semantic_text` fields.
@@ -0,0 +1,182 @@
+---
+navigation_title: "Reference"
+mapped_pages:
+  - https://www.elastic.co/guide/en/elasticsearch/reference/current/semantic-text.html
+applies_to:
+  stack: ga 9.0
+  serverless: ga
+---
+
+# Semantic text field type reference  [semantic-text]
+
+This page provides reference content for the `semantic_text` field type, including parameter descriptions, {{infer}} endpoint configuration options, chunking behavior, update operations, querying options, and limitations.
+
+## Parameters for `semantic_text` [semantic-text-params]
+
+The `semantic_text` field type uses default indexing settings based on the [{{infer}} endpoint](#configuring-inference-endpoints) specified, enabling you to get started without providing additional configuration details. You can override these defaults by customizing the parameters described below.
+
+`inference_id`
+:   (Optional, string) {{infer-cap}} endpoint that will be used to generate
+embeddings for the field. If `search_inference_id` is specified, the {{infer}}
+endpoint will only be used at index time. Learn more about [configuring this parameter](#configuring-inference-endpoints).
+
+**Updating the `inference_id` parameter**
+
+::::{applies-switch}
+
+:::{applies-item} { "stack": "ga 9.0" }
+This parameter cannot be updated.
+:::
+
+:::{applies-item} { "stack": "ga 9.3" }
+
+You can update this parameter by using
+the [Update mapping API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-indices-put-mapping).
+You can update the {{infer}} endpoint if no values have been indexed or if the new endpoint is compatible with the current one.
+
+::::{important}
+When updating an `inference_id` it is important to ensure the new {{infer}} endpoint produces embeddings compatible with those already indexed. This typically means using the same underlying model.
+::::
+
+:::
+
+::::
+
+`search_inference_id`
+:   (Optional, string) The {{infer}} endpoint that will be used to generate
+embeddings at query time. Use the [Create {{infer}} API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-inference-put) to create the endpoint. If not specified, the {{infer}} endpoint defined by
+`inference_id` will be used at both index and query time. 
+    
+    You can update this parameter by using
+the [Update mapping API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-indices-put-mapping).     
+    
+    Learn how to [use dedicated endpoints for ingestion and search](./semantic-text-setup-configuration.md#dedicated-endpoints-for-ingestion-and-search).
+
+`index_options` {applies_to}`stack: ga 9.1`
+:   (Optional, object) Specifies the index options to override default values
+for the field. Currently, `dense_vector` and `sparse_vector` index options are supported. For text embeddings, `index_options` may match any allowed.
+
+- [dense_vector index options](/reference/elasticsearch/mapping-reference/dense-vector.md#dense-vector-index-options)
+
+- [sparse_vector index options](/reference/elasticsearch/mapping-reference/sparse-vector.md#sparse-vectors-params) {applies_to}`stack: ga 9.2`
+
+`chunking_settings` {applies_to}`stack: ga 9.1`
+:   (Optional, object) Settings for chunking text into smaller passages.
+If specified, these will override the chunking settings set in the {{infer-cap}}
+endpoint associated with `inference_id`.
+
+    If chunking settings are updated, they will not be applied to existing documents
+until they are reindexed.  Defaults to the optimal chunking settings for [Elastic Rerank](docs-content://explore-analyze/machine-learning/nlp/ml-nlp-rerank.md).
+
+    To completely disable chunking, use the `none` chunking strategy.
+
+    ::::{important}
+    When using the `none` chunking strategy, if the input exceeds the maximum token limit of the underlying model,
+    some services (such as OpenAI) may return an error. In contrast, the `elastic` and `elasticsearch` services will
+    automatically truncate the input to fit within the model's limit.
+     ::::
+
+### Customizing semantic_text indexing
+
+The following example shows how to configure `inference_id`, `index_options` and `chunking_settings` for a `semantic_text` field type:
+
+```console
+PUT my-index-000004
+{
+  "mappings": {
+    "properties": {
+      "inference_field": {
+        "type": "semantic_text",
+        "inference_id": "my-text-embedding-endpoint", <1>
+        "index_options": { <2>
+          "dense_vector": {
+            "type": "int4_flat"
+          }
+        },
+        "chunking_settings": { <3>
+          "type": "none"
+        }
+      }
+    }
+  }
+}
+```
+% TEST[skip:Requires {{infer}} endpoint]
+
+1. The `inference_id` of the {{infer}} endpoint to use for generating embeddings.
+2. Overrides default index options by specifying `int4_flat` quantization for dense vector embeddings.
+3. Disables automatic chunking by setting the chunking strategy to `none`.
+
+::::{note}
+{applies_to}`stack: ga 9.1`  Newly created indices with `semantic_text` fields using dense embeddings will be
+[quantized](/reference/elasticsearch/mapping-reference/dense-vector.md#dense-vector-quantization)
+to `bbq_hnsw` automatically as long as they have a minimum of 64 dimensions.
+::::
+
+## {{infer-cap}} endpoints [configuring-inference-endpoints]
+
+The `semantic_text` field type specifies an {{infer}} endpoint identifier (`inference_id`) that is used to generate embeddings.
+
+The following {{infer}} endpoint configurations are available:
+
+- [Default and preconfigured endpoints](./semantic-text-setup-configuration.md#default-and-preconfigured-endpoints): Use `semantic_text` without creating an {{infer}} endpoint manually. 
+
+- [ELSER on EIS](./semantic-text-setup-configuration.md#using-elser-on-eis): Use the ELSER model through the Elastic {{infer-cap}} Service. 
+
+- [Custom endpoints](./semantic-text-setup-configuration.md#using-custom-endpoint): Create your own {{infer}} endpoint using the [Create {{infer}} API](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-inference-put) to use custom models or third-party services.
+
+If you use a [custom {{infer}} endpoint](./semantic-text-setup-configuration.md#using-custom-endpoint) through your ML node and not through Elastic {{infer-cap}} Service (EIS), the recommended method is to [use dedicated endpoints for ingestion and search](./semantic-text-setup-configuration.md#dedicated-endpoints-for-ingestion-and-search). 
+
+{applies_to}`stack: ga 9.1.0` If you use EIS, you don't have to set up dedicated endpoints.
+
+::::{warning}
+Removing an {{infer}} endpoint will cause ingestion of documents and semantic
+queries to fail on indices that define `semantic_text` fields with that
+{{infer}} endpoint as their `inference_id`. Trying
+to [delete an {{infer}} endpoint](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-inference-delete)
+that is used on a `semantic_text` field will result in an error.
+::::
+
+## Chunking [chunking-behavior]
+
+{{infer-cap}} endpoints have a limit on the amount of text they can process. To
+allow for large amounts of text to be used in semantic search, `semantic_text`
+automatically generates smaller passages if needed, called chunks.
+
+Each chunk refers to a passage of the text and the corresponding embedding
+generated from it. When querying, the individual passages will be automatically
+searched for each document, and the most relevant passage will be used to
+compute a score.
+
+Chunks are stored as start and end character offsets rather than as separate
+text strings. These offsets point to the exact location of each chunk within the
+original input text.
+
+You can [pre-chunk content](./semantic-text-ingestions.md#pre-chunking) by providing text as arrays before indexing.
+
+Refer to the [{{infer-cap}} API documentation](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-inference-put#operation-inference-put-body-application-json-chunking_settings) for values for `chunking_settings` and to [Configuring chunking](docs-content://explore-analyze/elastic-inference/inference-api.md#infer-chunking-config) to learn about different chunking strategies.
+
+
+## Limitations [limitations]
+
+`semantic_text` field types have the following limitations:
+
+* `semantic_text` fields are not currently supported as elements
+  of [nested fields](/reference/elasticsearch/mapping-reference/nested.md).
+* `semantic_text` fields can't currently be set as part
+  of [dynamic templates](docs-content://manage-data/data-store/mapping/dynamic-templates.md).
+* `semantic_text` fields are not supported in indices created prior to 8.11.0.
+* `semantic_text` fields do not support [Cross-Cluster Search (CCS)](docs-content://solutions/search/cross-cluster-search.md) when [`ccs_minimize_roundtrips`](docs-content://solutions/search/cross-cluster-search.md#ccs-network-delays) is set to `false`.
+* `semantic_text` fields do not support [Cross-Cluster Search (CCS)](docs-content://solutions/search/cross-cluster-search.md) in [ES|QL](/reference/query-languages/esql.md).
+* `semantic_text` fields do not support [Cross-Cluster Replication (CCR)](docs-content://deploy-manage/tools/cross-cluster-replication.md).
+
+
+## Document count discrepancy in `_cat/indices` [document-count-discrepancy]
+
+When an index contains a `semantic_text` field, the `docs.count` value returned by the [`_cat/indices`](https://www.elastic.co/docs/api/doc/elasticsearch/operation/operation-cat-indices) API may be higher than the number of documents you indexed. 
+This occurs because `semantic_text` stores embeddings in [nested documents](/reference/elasticsearch/mapping-reference/nested.md), one per chunk. The `_cat/indices` API counts all documents in the Lucene index, including these hidden nested documents.
+
+To count only top-level documents, excluding the nested documents that store embeddings, use one of the following APIs:
+
+* `GET /<index>/_count`
+* `GET _cat/count/<index>`