Skip to content

Commit c90369f

Browse files
committed
update quickstart
1 parent 1c1ccf0 commit c90369f

File tree

4 files changed

+33
-32
lines changed

4 files changed

+33
-32
lines changed

articles/ai-services/document-intelligence/prebuilt/layout.md

Lines changed: 13 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: laujan
66
manager: nitinme
77
ms.service: azure-ai-document-intelligence
88
ms.topic: conceptual
9-
ms.date: 01/16/2025
9+
ms.date: 03/17/2025
1010
ms.author: lajanuar
1111
---
1212

@@ -103,7 +103,7 @@ After you retrieve you key and endpoint, you can use the following development o
103103
### [Document Intelligence Studio](#tab/studio)
104104

105105
* [Studio](https://documentintelligence.ai.azure.com/studio)
106-
* [How-to guide](../studio-overview.md#authentication-in-studio)
106+
* [How-to guide](../quickstarts/get-started-studio.md#authentication-in-studio)
107107

108108
---
109109

@@ -222,7 +222,7 @@ The new machine-learning based page object detection extracts logical roles like
222222

223223
The document layout model in Document Intelligence extracts print and handwritten style text as `lines` and `words`. The `styles` collection ../includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](../language-support/prebuilt.md).
224224

225-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 2024-11-30 (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
225+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 `2024-11-30` (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
226226

227227
#### [Sample code](#tab/sample-code)
228228

@@ -343,8 +343,8 @@ Here are a few factors to consider when using the Document Intelligence bale ext
343343

344344
> [!NOTE]
345345
>
346-
> * Table analysis is not supported if the input file is XLSX.
347-
> * For *2024-11-30 (GA)*, the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
346+
> * Table analysis isn't supported if the input file is XLSX.
347+
> * For `2024-11-30` (GA), the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
348348
349349
#### [Sample code](#tab/sample-code)
350350

@@ -399,7 +399,8 @@ if result.tables:
399399
The Layout API can output the extracted text in markdown format. Use the `outputContentFormat=markdown` to specify the output format in markdown. The markdown content is output as part of the `content` section.
400400

401401
> [!NOTE]
402-
> For v4.0 *2024-11-30 (GA)*, the representation of tables is changed to HTML tables to enable rendering of merged cells, multi-row headers, etc. Another related change is to use Unicode checkbox characters ☒ and ☐ for selection marks instead of :selected: and :unselected:. Note that this means that the content of selection mark fields will contain :selected: even though their spans refer to Unicode characters in the top-level span.
402+
>
403+
> For v4.0 `2024-11-30` (GA), the representation of tables is changed to HTML tables to enable rendering of merged cells, multi-row headers, etc. Another related change is to use Unicode checkbox characters ☒ and ☐ for selection marks instead of `:selected:` and `:unselected:`. This update means that the content of selection mark fields contains `:selected:` even though their spans refer to Unicode characters in the top-level span.
403404
404405
#### [Sample code](#tab/sample-code)
405406

@@ -472,7 +473,7 @@ When *output=figures* is specified during the initial analyze operation, the ser
472473
`FigureId` is included in each figure object, following an undocumented convention of `{pageNumber}.{figureIndex}` where `figureIndex` resets to one per page.
473474

474475
> [!NOTE]
475-
> For v4.0 *2024-11-30 (GA)*, the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
476+
> For v4.0 `2024-11-30` (GA), the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
476477
477478
#### [Sample code](#tab/sample-code)
478479

@@ -637,7 +638,7 @@ Supported file formats:
637638

638639
* For custom extraction model training, the total size of training data is 50 MB for template model and `1` GB for the neural model.
639640

640-
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For 2024-11-30 (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
641+
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For `2024-11-30` (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
641642

642643
:::moniker-end
643644

@@ -688,7 +689,7 @@ After you retrieve you key and endpoint, you can use the following development o
688689
### [Document Intelligence Studio](#tab/studio)
689690

690691
* [Studio](https://documentintelligence.ai.azure.com/studio)
691-
* [How-to guide](../studio-overview.md#authentication-in-studio)
692+
* [How-to guide](../quickstarts/get-started-studio.md#authentication-in-studio)
692693

693694
---
694695

@@ -901,7 +902,7 @@ The new machine-learning based page object detection extracts logical roles like
901902

902903
The document layout model in Document Intelligence extracts print and handwritten style text as `lines` and `words`. The `styles` collection ../includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](../language-support/prebuilt.md).
903904

904-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 2024-11-30 (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
905+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 `2024-11-30` (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
905906

906907
:::moniker-end
907908

@@ -1078,8 +1079,8 @@ Here are a few factors to consider when using the Document Intelligence bale ext
10781079

10791080
> [!NOTE]
10801081
>
1081-
> * Table analysis is not supported if the input file is XLSX.
1082-
> * Document Intelligence v4.0 2024-11-30 (GA) supports bounding regions for figures and tables that cover only the core content and exclude associated caption and footnotes.
1082+
> * Table analysis isn't supported if the input file is XLSX.
1083+
> * Document Intelligence v4.0 `2024-11-30` (GA) supports bounding regions for figures and tables that cover only the core content and exclude associated caption and footnotes.
10831084
10841085
:::moniker-end
10851086

0 commit comments

Comments
 (0)