You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -222,7 +222,7 @@ The new machine-learning based page object detection extracts logical roles like
222
222
223
223
The document layout model in Document Intelligence extracts print and handwritten style text as `lines` and `words`. The `styles` collection ../includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](../language-support/prebuilt.md).
224
224
225
-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 2024-11-30 (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
225
+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 `2024-11-30` (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
226
226
227
227
#### [Sample code](#tab/sample-code)
228
228
@@ -343,8 +343,8 @@ Here are a few factors to consider when using the Document Intelligence bale ext
343
343
344
344
> [!NOTE]
345
345
>
346
-
> * Table analysis is not supported if the input file is XLSX.
347
-
> * For *2024-11-30 (GA)*, the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
346
+
> * Table analysis isn't supported if the input file is XLSX.
347
+
> * For `2024-11-30` (GA), the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
348
348
349
349
#### [Sample code](#tab/sample-code)
350
350
@@ -399,7 +399,8 @@ if result.tables:
399
399
The Layout API can output the extracted text in markdown format. Use the `outputContentFormat=markdown` to specify the output format in markdown. The markdown content is output as part of the `content` section.
400
400
401
401
> [!NOTE]
402
-
> For v4.0 *2024-11-30 (GA)*, the representation of tables is changed to HTML tables to enable rendering of merged cells, multi-row headers, etc. Another related change is to use Unicode checkbox characters ☒ and ☐ for selection marks instead of :selected: and :unselected:. Note that this means that the content of selection mark fields will contain :selected: even though their spans refer to Unicode characters in the top-level span.
402
+
>
403
+
> For v4.0 `2024-11-30` (GA), the representation of tables is changed to HTML tables to enable rendering of merged cells, multi-row headers, etc. Another related change is to use Unicode checkbox characters ☒ and ☐ for selection marks instead of `:selected:` and `:unselected:`. This update means that the content of selection mark fields contains `:selected:` even though their spans refer to Unicode characters in the top-level span.
403
404
404
405
#### [Sample code](#tab/sample-code)
405
406
@@ -472,7 +473,7 @@ When *output=figures* is specified during the initial analyze operation, the ser
472
473
`FigureId` is included in each figure object, following an undocumented convention of `{pageNumber}.{figureIndex}` where `figureIndex` resets to one per page.
473
474
474
475
> [!NOTE]
475
-
> For v4.0 *2024-11-30 (GA)*, the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
476
+
> For v4.0 `2024-11-30` (GA), the bounding regions for figures and tables cover only the core content and exclude associated caption and footnotes.
476
477
477
478
#### [Sample code](#tab/sample-code)
478
479
@@ -637,7 +638,7 @@ Supported file formats:
637
638
638
639
* For custom extraction model training, the total size of training data is 50 MB for template model and `1` GB for the neural model.
639
640
640
-
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For 2024-11-30 (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
641
+
* For custom classification model training, the total size of training data is `1` GB with a maximum of 10,000 pages. For `2024-11-30` (GA), the total size of training data is `2` GB with a maximum of 10,000 pages.
641
642
642
643
:::moniker-end
643
644
@@ -688,7 +689,7 @@ After you retrieve you key and endpoint, you can use the following development o
@@ -901,7 +902,7 @@ The new machine-learning based page object detection extracts logical roles like
901
902
902
903
The document layout model in Document Intelligence extracts print and handwritten style text as `lines` and `words`. The `styles` collection ../includes any handwritten style for lines if detected along with the spans pointing to the associated text. This feature applies to [supported handwritten languages](../language-support/prebuilt.md).
903
904
904
-
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 2024-11-30 (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
905
+
For Microsoft Word, Excel, PowerPoint, and HTML, Document Intelligence v4.0 `2024-11-30` (GA) Layout model extract all embedded text as is. Texts are extracted as words and paragraphs. Embedded images aren't supported.
905
906
906
907
:::moniker-end
907
908
@@ -1078,8 +1079,8 @@ Here are a few factors to consider when using the Document Intelligence bale ext
1078
1079
1079
1080
> [!NOTE]
1080
1081
>
1081
-
> * Table analysis is not supported if the input file is XLSX.
1082
-
> * Document Intelligence v4.0 2024-11-30 (GA) supports bounding regions for figures and tables that cover only the core content and exclude associated caption and footnotes.
1082
+
> * Table analysis isn't supported if the input file is XLSX.
1083
+
> * Document Intelligence v4.0 `2024-11-30` (GA) supports bounding regions for figures and tables that cover only the core content and exclude associated caption and footnotes.
0 commit comments