You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Azure AI Content Understanding's document analysis capabilities help you transform unstructured document data into structured, machine-readable information. By precisely identifying and extracting document elements while preserving their structural relationships, you can build powerful document processing workflows for a wide range of applications.
25
+
Azure AI Content Understanding's analysis capabilities help you transform unstructured data into structured, machine-readable information. By precisely identifying and extracting elements while preserving their structural relationships, you can build powerful processing workflows for a wide range of applications.
26
26
27
-
This article explains the document analysis features that enable you to extract meaningful content from your documents, preserve document structures, and unlock the full potential of your document data.
27
+
The `contents` object with kind "document" supports output for a range of different input files including document, image, text, and structured file. These outputs enable you to extract meaningful content from your files, preserve document structures, and unlock the full potential of your data.
28
28
29
-
This document provides examples for **document file types** including `.pdf`, `.tiff`, `.jpg`, `.png`, `.bmp`, `.heif`, `.docx`, `.xlsx`, `.pptx`, `.txt`, `.html`, `.md`, `.rtf`, `.eml`, `.msg`, and `.xml` files. For complete details about supported file types, file size limits, and other constraints, see [service quotas and limits](../service-limits.md#analyzers).
29
+
**Document content kind** includes output for input files like:
30
+
-**Documents**: PDFs, Word documents, PowerPoint presentations, Excel spreadsheets
For complete details about supported file types, file size limits, and other constraints, see [service quotas and limits](../service-limits.md#analyzers).
30
37
31
38
## JSON response structure
32
39
@@ -92,12 +99,16 @@ A `word` is a content element composed of a sequence of characters. Content Unde
:::image type="content" source="../media/document/selection-marks.png" alt-text="Screenshot of detected selection marks.":::
125
140
126
141
#### Barcodes
127
142
128
-
A `barcode` is a content element that describes both linear (ex. UPC, EAN) and 2D (ex. QR, MaxiCode) barcodes. Content Understanding represents barcodes using its detected type and extracted value. The following barcode formats are currently accepted:
129
-
130
-
*`QR Code`
131
-
*`Code 39`
132
-
*`Code 93`
133
-
*`Code 128`
134
-
*`UPC (UPC-A & UPC-E)`
135
-
*`PDF417`
136
-
*`EAN-8`
137
-
*`EAN-13`
138
-
*`Codabar`
139
-
*`Databar`
140
-
*`Databar (expanded)`
141
-
*`ITF`
142
-
*`Data Matrix`
143
+
A `barcode` is a content element that describes both linear (ex. UPC, EAN) and 2D (ex. QR, MaxiCode) barcodes. Content Understanding represents barcodes using its detected type and extracted value. The following barcode formats are currently supported:
144
+
145
+
| Barcode Type | Description |
146
+
|--------------|-------------|
147
+
|`QRCode`| QR code, as defined in ISO/IEC 18004:2015 |
148
+
|`PDF417`| PDF417, as defined in ISO 15438 |
149
+
|`UPCA`| GS1 12-digit Universal Product Code |
150
+
|`UPCE`| GS1 6-digit Universal Product Code |
151
+
|`Code39`| Code 39 barcode, as defined in ISO/IEC 16388:2007 |
152
+
|`Code128`| Code 128 barcode, as defined in ISO/IEC 15417:2007 |
153
+
|`EAN8`| GS1 8-digit International Article Number (European Article Number) |
154
+
|`EAN13`| GS1 13-digit International Article Number (European Article Number) |
155
+
|`DataBar`| GS1 DataBar barcode |
156
+
|`Code93`| Code 93 barcode, as defined in ANSI/AIM BC5-1995 |
157
+
|`Codabar`| Codabar barcode, as defined in ANSI/AIM BC3-1995 |
An `image` is a content element that represents an embedded image, figure, or chart in the document. Content Understanding extracts any embedded text from the images, and any associated captions and footnotes.
202
+
An `figure` is a content element that represents an embedded image, figure, or chart in the document. Content Understanding extracts any embedded text from the images, and any associated captions and footnotes.
Page numbers are 1-indexed. The bounding polygon describes a sequence of points, clockwise from the left relative to the natural orientation of the element. For quadrilaterals, the points represent the top-left, top-right, bottom-right, and bottom-left corners. Each point represents the **x**, **y** coordinate in the length unit specified by the `unit` property. In general, the unit of measure for images is pixels while PDFs use inches.
392
+
Page numbers are one indexed. The bounding polygon describes a sequence of points, clockwise from the left relative to the natural orientation of the element. For quadrilaterals, the points represent the top-left, top-right, bottom-right, and bottom-left corners. Each point represents the **x**, **y** coordinate in the length unit specified by the `unit` property. In general, the unit of measure for images is pixels while PDFs use inches.
350
393
351
394
:::image type="content" source="../media/document/bounding-regions.png" alt-text="Screenshot of detected bounding regions.":::
352
395
353
396
> [!NOTE]
354
-
> Currently, Content Understanding only returns 4-point quadrilaterals as bounding polygons. Future versions might return different number of points to describe more complex shapes, such as curved lines or nonrectangular images. Currently, source is only returned for elements from rendered files (pdf/image).
397
+
> Currently, Content Understanding only returns a four-point quadrilateral as bounding polygons. Future versions might return different number of points to describe more complex shapes, such as curved lines or nonrectangular images. Currently, source is only returned for elements from rendered files (pdf/image).
355
398
356
399
## Next steps
357
400
358
401
* Try processing your document content using Content Understanding in [Azure AI Foundry](https://aka.ms/cu-landing).
359
-
* Learn to analyze document content [**analyzer templates**](../quickstart/use-ai-foundry.md).
402
+
* Learn to analyze document content with [**analyzer templates**](../quickstart/use-ai-foundry.md).
The following example shows the complete JSON response structure from analyzing a document. This represents the full output from Content Understanding when processing a PDF document with multiple element types:
408
+
The following example shows the complete JSON response structure from analyzing a document. This JSON represents the full output from Content Understanding when processing a PDF document with multiple element types:
366
409
367
410
:::image type="content" source="../media/document/demo-pdf-screenshot.png" alt-text="Screenshot of the demo PDF document showing example content including checkboxes, barcodes, formulas, images, and tables.":::
0 commit comments