Skip to content

Commit c9da546

Browse files
committed
H2 edits
1 parent 13052a3 commit c9da546

File tree

1 file changed

+12
-12
lines changed

1 file changed

+12
-12
lines changed

articles/search/cognitive-search-skill-document-intelligence-layout.md

Lines changed: 12 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -34,6 +34,12 @@ It's common to use this skill on content such as PDFs that have structure and im
3434
> This skill is bound to a [billable Azure AI multi-service resource](cognitive-search-attach-cognitive-services.md) for transactions that exceed 20 documents per indexer per day. Execution of built-in skills is charged at the existing [Azure AI services Standard price](https://azure.microsoft.com/pricing/details/cognitive-services/).
3535
>
3636
37+
## Limitations
38+
39+
During the public preview, this skill has the following restrictions:
40+
41+
+ The skill isn't suitable for large documents requiring more than 5 minutes of processing in the AI Document Intelligence layout model. The skill times out, but charges still apply to the AI Services multi-services resource if it attaches to the skillset for billing purposes. Ensure documents are optimized to stay within processing limits to avoid unnecessary costs.
42+
3743
## Supported regions
3844

3945
The Document Layout skill calls the [Document Intelligence Public preview version 2024-07-31-preview](/rest/api/aiservices/operation-groups?view=rest-aiservices-v4.0%20(2024-07-31-preview)&preserve-view=true).
@@ -61,6 +67,10 @@ This skill recognizes the following file formats.
6167
+ .PPTX
6268
+ .HTML
6369

70+
## Supported languages
71+
72+
Refer to [Azure AI Document Intelligence layout model supported languages](/azure/ai-services/document-intelligence/language-support/ocr?view=doc-intel-3.1.0&tabs=read-print%2Clayout-print%2Cgeneral#layout&preserve-view=true) for printed text.
73+
6474
## Supported parameters
6575

6676
Several parameters are version-specific. The skills parameter table notes the API version in which a parameter was introduced so that you know how to configure the skill. To use version-specific features such as image and location metadata extraction in [2025-05-01-preview REST API](/rest/api/searchservice/skillsets/create?view=rest-searchservice-2025-05-01-preview&preserve-view=true), you can use the Azure portal, or target 2025-05-01-preview, or check an Azure SDK change log to see if it supports the new parameters.
@@ -77,16 +87,6 @@ Microsoft.Skills.Util.DocumentIntelligenceLayoutSkill
7787
+ Even if the file size for analyzing documents is 500 MB for [Azure AI Document Intelligence paid (S0) tier](https://azure.microsoft.com/pricing/details/cognitive-services/) and 4 MB for [Azure AI Document Intelligence free (F0) tier](https://azure.microsoft.com/pricing/details/cognitive-services/), indexing is subject to the [indexer limits](search-limits-quotas-capacity.md#indexer-limits) of your search service tier.
7888
+ Image dimensions must be between 50 pixels x 50 pixels or 10,000 pixels x 10,000 pixels.
7989
+ If your PDFs are password-locked, remove the lock before running the indexer.
80-
81-
## Supported languages
82-
83-
Refer to [Azure AI Document Intelligence layout model supported languages](/azure/ai-services/document-intelligence/language-support/ocr?view=doc-intel-3.1.0&tabs=read-print%2Clayout-print%2Cgeneral#layout&preserve-view=true) for printed text.
84-
85-
## Limitations
86-
87-
During the public preview, this skill has the following restrictions:
88-
89-
+ The skill isn't suitable for large documents requiring more than 5 minutes of processing in the AI Document Intelligence layout model. The skill times out, but charges still apply to the AI Services multi-services resource if it attaches to the skillset for billing purposes. Ensure documents are optimized to stay within processing limits to avoid unnecessary costs.
9090

9191
## Skill parameters
9292

@@ -145,7 +145,7 @@ The file reference object can be generated in one of following ways:
145145
| `text_sections` | Only applies if `outputFormat` is set to `text`. A collection of text chunk objects, which represent the text within the bounds of a page (factoring in any more chunking configured), *inclusive* of any section headers themselves. The text chunk object includes `locationMetadata` if applicable.|
146146
| `normalized_images` | Only applies if `outputFormat` is set to `text` and `extractionOptions` includes `images`. A collection of images that were extracted from the document, including `locationMetadata` if applicable.|
147147

148-
## Sample definition for markdown output mode
148+
### Sample definition for markdown output mode
149149

150150
```json
151151
{
@@ -173,7 +173,7 @@ The file reference object can be generated in one of following ways:
173173
}
174174
```
175175

176-
## Sample output for markdown output mode
176+
### Sample output for markdown output mode
177177

178178
```json
179179
{

0 commit comments

Comments
 (0)