Skip to content

Commit 3c1fa26

Browse files
committed
Modifications to the intro
1 parent f99cdda commit 3c1fa26

File tree

2 files changed

+10
-5
lines changed

2 files changed

+10
-5
lines changed

articles/search/cognitive-search-skill-document-extraction.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -15,16 +15,17 @@ ms.date: 05/27/2025
1515

1616
# Document Extraction cognitive skill
1717

18-
The **Document Extraction** skill extracts content from a file within the enrichment pipeline. Default content extraction or retrieval is built into the indexer pipeline. However, by using the Document Extraction skill, you can set parameters for more control over image extraction.
18+
The **Document Extraction** skill extracts content from a file within the enrichment pipeline. By default, content extraction or retrieval is built into the indexer pipeline. However, by using the Document Extraction skill, you can control how parameters are set, and extracted content is represented in the enrichment tree.
1919

20-
For skills that operate over images, such as [GenAI Prompt skill](cognitive-search-skill-genai-prompt.md) for image verbalization or [Optical Character Recognition (OCR)](cognitive-search-skill-ocr.md), using the Document Extraction skill in your pipeline allows you to downstream manage skill inputs by name and path. The following tutorials demonstrate several scenarios:
20+
Document Extraction is more affordable than other data chunking approaches. The following tutorials demonstrate skill usage for different scenarios:
2121

2222
+ [Tutorial: Index mixed content using multimodal embeddings and the Document Extraction skill](tutorial-multimodal-indexing-with-embedding-and-doc-extraction.md)
2323

2424
+ [Tutorial: Index mixed content using image verbalizations and the Document Extraction skill](tutorial-multimodal-indexing-with-image-verbalization-and-doc-extraction.md)
2525

2626
> [!NOTE]
2727
> This skill isn't bound to Azure AI services and has no Azure AI services key requirement.
28+
>
2829
> This skill extracts text and images. Text extraction is free. Image extraction is [billable by Azure AI Search](https://azure.microsoft.com/pricing/details/search/). On a free search service, the cost of 20 transactions per indexer per day is absorbed so that you can complete quickstarts, tutorials, and small projects at no charge. For basic and higher tiers, image extraction is billable.
2930
>
3031

articles/search/cognitive-search-skill-document-intelligence-layout.md

Lines changed: 7 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -18,16 +18,20 @@ ms.date: 05/27/2025
1818

1919
[!INCLUDE [Feature preview](./includes/previews/preview-generic.md)]
2020

21-
The **Document Layout** skill analyzes a document to extract regions of interest and their inter-relationships to produce a syntactical representation of the document in Markdown or Text format. This skill uses the [Document Intelligence layout model](/azure/ai-services/document-intelligence/concept-layout) provided in [Azure AI Document Intelligence](/azure/ai-services/document-intelligence/overview).
21+
The **Document Layout** skill analyzes a document to extract regions of interest and their inter-relationships to produce a syntactical representation of the document in Markdown or Text format. You can use it to extract text and images. Image extraction includes location metadata that preserves image position within the document. Image proximity to related content is better for Retrieval Augmented Generation (RAG) workloads and [multimodal search](multimodal-search-overview.md).
2222

23-
This article is the reference documentation for the Document Layout skill. For usage information, see [Structure-aware chunking and vectorization](search-how-to-semantic-chunking.md). It's common to use this skill on content such as PDFs that have sections and multimodal content. The following tutorials demonstrate several scenarios:
23+
This article is the reference documentation for the Document Layout skill. For usage information, see [Structure-aware chunking and vectorization](search-how-to-semantic-chunking.md).
24+
25+
It's common to use this skill on content such as PDFs that have structure and images. The following tutorials demonstrate several scenarios:
2426

2527
+ [Tutorial: Index mixed content using image verbalizations and the Document Layout skill](tutorial-multimodal-index-image-verbalization-skill.md)
2628

2729
+ [Tutorial: Index mixed content using multimodal embeddings and the Document Layout skill](tutorial-multimodal-index-embeddings-skill.md)
2830

2931
> [!NOTE]
30-
> This skill is bound to Azure AI services and requires [a billable resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day. Execution of built-in skills is charged at the existing [Azure AI services Standard price](https://azure.microsoft.com/pricing/details/cognitive-services/).
32+
> This skill uses the [Document Intelligence layout model](/azure/ai-services/document-intelligence/concept-layout) provided in [Azure AI Document Intelligence](/azure/ai-services/document-intelligence/overview).
33+
>
34+
> This skill is bound to a [billable Azure AI multi-service resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day. Execution of built-in skills is charged at the existing [Azure AI services Standard price](https://azure.microsoft.com/pricing/details/cognitive-services/).
3135
>
3236
3337
## Supported regions

0 commit comments

Comments
 (0)