Skip to content

Commit 5989017

Browse files
authored
Update search-indexer-overview.md
Improving wording to avoid confusion regarding image extraction processing.
1 parent b32a199 commit 5989017

File tree

1 file changed

+7
-2
lines changed

1 file changed

+7
-2
lines changed

articles/search/search-indexer-overview.md

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.service: azure-ai-search
1010
ms.custom:
1111
- ignite-2023
1212
ms.topic: conceptual
13-
ms.date: 12/19/2024
13+
ms.date: 04/09/2025
1414
---
1515

1616
# Indexers in Azure AI Search
@@ -81,7 +81,9 @@ For each document it receives, an indexer implements or coordinates multiple ste
8181

8282
### Stage 1: Document cracking
8383

84-
Document cracking is the process of opening files and extracting content. Text-based content can be extracted from files on a service, rows in a table, or items in container or collection. If you add a skillset and [image skills](cognitive-search-concept-image-scenarios.md), document cracking can also extract images and queue them for image processing.
84+
Document cracking is the process of opening files and extracting content. Text-based content can be extracted from files on a service, rows in a table, or items in container or collection.
85+
86+
You can also enable image extraction during document cracking for an [extra fee](https://azure.microsoft.com/en-us/pricing/details/search/). This is disabled by default and can be enabled via the `imageAction` property in the [indexer parameters configuration](/rest/api/searchservice/indexers/create-or-update).
8587

8688
Depending on the data source, the indexer will try different operations to extract potentially indexable content:
8789

@@ -91,6 +93,9 @@ Depending on the data source, the indexer will try different operations to extra
9193

9294
+ When the document is a record in [Azure Cosmos DB](search-howto-index-cosmosdb.md), the indexer will extract non-binary content from fields and subfields from the Azure Cosmos DB document.
9395

96+
Note that the document cracking process can also be triggered later during the optional [skillset execution](cognitive-search-concept-intro.md) stage, using skillsets, for data transformation. Adding a skillset with [image skills](cognitive-search-concept-image-scenarios.md) allows document cracking to extract images and queue them for processing.
97+
98+
9499
### Stage 2: Field mappings
95100

96101
An indexer extracts text from a source field and sends it to a destination field in an index or knowledge store. When field names and data types coincide, the path is clear. However, you might want different names or types in the output, in which case you need to tell the indexer how to map the field.

0 commit comments

Comments
 (0)