Skip to content

Commit b887725

Browse files
Merge pull request #5090 from bojunehsu/paulhsu/CU-faceSample
Content Understanding - Update face sample url
2 parents 740e355 + b2c50fe commit b887725

File tree

4 files changed

+19
-26
lines changed

4 files changed

+19
-26
lines changed

articles/ai-services/content-understanding/document/elements.md

Lines changed: 11 additions & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -51,14 +51,14 @@ Content Understanding generates richly formatted markdown that preserves the ori
5151

5252
#### Words
5353

54-
A `word` is a content element composed of a sequence of characters. Content Understanding uses word boundaries defined by [Unicode Standard Annex #29](https://www.unicode.org/reports/tr29/#Word_Boundaries). For Latin languages, words may be split from punctuation even without intervening spaces. In some language, such as Chinese, supplemental word dictionaries are used to enable word breaking at semantic boundaries. For more information, *see* [Boundary Analysis](https://unicode-org.github.io/icu/userguide/boundaryanalysis/).
54+
A `word` is a content element composed of a sequence of characters. Content Understanding uses word boundaries defined by [Unicode Standard Annex #29](https://www.unicode.org/reports/tr29/#Word_Boundaries). For Latin languages, words might be split from punctuation even without intervening spaces. In some language, such as Chinese, supplemental word dictionaries are used to enable word breaking at semantic boundaries. For more information, *see* [Boundary Analysis](https://unicode-org.github.io/icu/userguide/boundaryanalysis/).
5555

5656

5757
:::image type="content" source="../media/document/word-boundaries.png" alt-text="Screenshot of detected words.":::
5858

5959
#### Selection marks
6060

61-
A `selection mark` is a content element that represents a visual glyph indicating the state of a selection. They may be represented as check boxes, check marks, radio buttons, etc. The state of a selection mark can be selected or unselected, with different visual representation to indicate the state. They're encoded as words in the document analysis result using `` (selected) and `` (unselected).
61+
A `selection mark` is a content element that represents a visual glyph indicating the state of a selection. They might appear in the document as check boxes, check marks, radio buttons, etc. The state of a selection mark can be selected or unselected, with different visual representation to indicate the state. They're encoded as words in the document analysis result using `` (selected) and `` (unselected).
6262

6363
Content Understanding detects check marks inside table cell as selection marks in the selected state. However, it doesn't detect empty table cells as selection marks in the unselected state.
6464

@@ -85,7 +85,7 @@ A `barcode` is a content element that describes both linear (ex. UPC, EAN) and 2
8585

8686
#### Formulas
8787

88-
A `formula` is a content element representing mathematical expressions in the document. It may be an `inline` formula embedded with other text, or an `display` formula that takes up an entire line. Multiline formulas are represented as multiple `display` formula elements grouped into `paragraphs` to preserve mathematical relationships.
88+
A `formula` is a content element representing mathematical expressions in the document. It might be an `inline` formula embedded with other text, or an `display` formula that takes up an entire line. Multiline formulas are represented as multiple `display` formula elements grouped into `paragraphs` to preserve mathematical relationships.
8989

9090
#### Images
9191

@@ -97,22 +97,22 @@ Document layout elements are visual and structural components, such as pages, ta
9797

9898
#### Pages
9999

100-
A `page` is a grouping of content that typically corresponds to one side of a sheet of paper. A rendered page is characterized via `width` and `height` in the specified `unit`. In general, images use pixel while PDFs use inch. The `angle` property describes the overall text angle in degrees for pages that may be rotated.
100+
A `page` is a grouping of content that typically corresponds to one side of a sheet of paper. A rendered page is characterized via `width` and `height` in the specified `unit`. In general, images use pixel while PDFs use inch. The `angle` property describes the overall text angle in degrees for pages that might be rotated.
101101

102102
> [!NOTE]
103103
> For spreadsheets like Excel, each sheet is mapped to a page. For presentations, like PowerPoint, each slide is mapped to a page. For file formats like HTML or Word documents, which lack a native page concept without rendering, the entire main content is treated as a single page.
104104
105105
#### Paragraphs
106106

107-
A `paragraph` is an ordered sequence of lines that form a logical unit. Typically, the lines share common alignment and spacing between lines. Paragraphs are often delimited via indentation, added spacing, or bullets/numbering. Some paragraphs may have special functional `role` in the document. Currently supported roles include page header, page footer, page number, title, section heading, footnote, and formula block.
107+
A `paragraph` is an ordered sequence of lines that form a logical unit. Typically, the lines share common alignment and spacing between lines. Paragraphs are often delimited via indentation, added spacing, or bullets/numbering. Some paragraphs have special functional `role` in the document. Currently supported roles include page header, page footer, page number, title, section heading, footnote, and formula block.
108108

109109
#### Lines
110110

111111
A `line` is an ordered sequence of consecutive content elements, often separated by visual spaces. Content elements in the same horizontal plane (row) but separated by more than a single visual space are most often split into multiple lines. While this feature sometimes splits semantically contiguous content into separate lines, it enables the representation of textual content split into multiple columns or cells. Lines in vertical writing are detected in the vertical direction.
112112

113113
#### Tables
114114

115-
A `table` organizes content into a group of cells in a grid layout. The rows and columns may be visually separated by grid lines, color banding, or greater spacing. The position of a table cell is specified via its row and column indices. A cell can span across multiple rows and columns.
115+
A `table` organizes content into a group of cells in a grid layout. The rows and columns might be visually separated by grid lines, color banding, or greater spacing. The position of a table cell is specified via its row and column indices. A cell can span across multiple rows and columns.
116116

117117
Based on its position and styling, a cell can be classified as general content, row header, column header, stub head, or description:
118118

@@ -128,7 +128,7 @@ Based on its position and styling, a cell can be classified as general content,
128128

129129
A table caption specifies content that explains the table. A table can further have a set of footnotes. Unlike a description cell, a caption typically lies outside the grid layout. Table footnotes annotate content inside the table, often marked with footnote symbols. They're often found below the table grid.
130130

131-
A table may span across consecutive pages of a document. In this situation, table continuations in subsequent pages generally maintain the same column count, width, and styling. They often repeat the column headers. Other than page headers, footers, and page numbers, there's generally no intervening content between the initial table and its continuations.
131+
A table might span across consecutive pages of a document. In this situation, table continuations in subsequent pages generally maintain the same column count, width, and styling. They often repeat the column headers. Other than page headers, footers, and page numbers, there's generally no intervening content between the initial table and its continuations.
132132

133133
> [!NOTE]
134134
> The span for tables covers only the core content and exclude associated caption and footnotes.
@@ -137,7 +137,7 @@ A table may span across consecutive pages of a document. In this situation, tabl
137137

138138
#### Sections
139139

140-
A `section` is a logical grouping of related content elements that form a hierarchical structure within the document. It often starts with a section heading as the first paragraph. A section may contain subsections, creating a nested document structure that preserves semantic relationships.
140+
A `section` is a logical grouping of related content elements that form a hierarchical structure within the document. It often starts with a section heading as the first paragraph. A section might contain subsections, creating a nested document structure that preserves semantic relationships.
141141

142142
### Element properties
143143

@@ -149,26 +149,16 @@ The `span` property specifies the logical position of the element in the documen
149149

150150
#### Source
151151

152-
The `source` property describes the visual position of the element in the file using an encoded string. For documents, the source string may be in one of the following formats:
152+
The `source` property describes the visual position of the element in the file using an encoded string. For documents, the source string can be in one of the following formats:
153153
* Bounding polygon: `D({pageNumber},{x1},{y1},{x2},{y2},{x3},{y3},{x4},{y4})`
154154
* Axis-aligned bounding box: `D({pageNumber},{left},{top},{width},{height})`
155155

156-
Page numbers are `1-indexed`. The bounding polygon describes a sequence of points, clockwise from the left relative to the natural orientation of the element. For quadrilaterals, the points represent the top-left, top-right, bottom-right, and bottom-left corners. Each point represents the **x**, **y** coordinate in the length unit specified by the `unit` property. In general, the unit of measure for images is pixels while PDFs use inches.
156+
Page numbers are 1-indexed. The bounding polygon describes a sequence of points, clockwise from the left relative to the natural orientation of the element. For quadrilaterals, the points represent the top-left, top-right, bottom-right, and bottom-left corners. Each point represents the **x**, **y** coordinate in the length unit specified by the `unit` property. In general, the unit of measure for images is pixels while PDFs use inches.
157157

158158
:::image type="content" source="../media/document/bounding-regions.png" alt-text="Screenshot of detected bounding regions.":::
159159

160160
> [!NOTE]
161-
> Currently, Content Understanding only returns `4-point` quadrilaterals as bounding polygons. Future versions may return different number of points to describe more complex shapes, such as curved lines or nonrectangular images. Currently, source is only returned for elements from rendered files (pdf/image).
162-
163-
## Supported content and layout elements
164-
165-
Different file formats support different subsets of content and layout elements. The following table lists the currently supported elements for each file type.
166-
167-
|Document type|Supported format|
168-
|-----|-----|
169-
|**Portable Document Format**|`.pdf`|
170-
|**Image**|`.jpeg/.jpg`, `.png`, `.bmp`, `.tiff`, `.heif`|
171-
|**Microsoft Office**|`.docx`, `.pptx`, `.xls`|
161+
> Currently, Content Understanding only returns 4-point quadrilaterals as bounding polygons. Future versions might return different number of points to describe more complex shapes, such as curved lines or nonrectangular images. Currently, source is only returned for elements from rendered files (pdf/image).
172162
173163
## Next steps
174164

articles/ai-services/content-understanding/face/overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -92,4 +92,4 @@ Azure AI Content Understanding adheres to Microsoft's strict policies on custome
9292
## Next steps
9393

9494
* Learn how to build a [**person directory**](../tutorial/build-person-directory.md).
95-
* Review code sample: [**person directory**](https://github.com/Azure-Samples/azure-ai-content-understanding-python/blob/zhizho/face/notebooks/build_person_directory.ipynb).
95+
* Review code sample: [**person directory**](https://github.com/Azure-Samples/azure-ai-content-understanding-python/blob/main/notebooks/build_person_directory.ipynb).

articles/ai-services/content-understanding/language-region-support.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,10 @@ To use Azure AI Content Understanding, create your Azure AI Service resource in
2727

2828
† Australia East doesn't support data zone as a processing location.
2929

30+
> [!NOTE]
31+
>
32+
> [Pro mode](concepts/standard-pro-modes.md) currently only supports data zone and global as processing location.
33+
3034
## Language support
3135

3236
Azure AI Content Understanding enables you to process data in multiple languages simultaneously. Our language support capabilities enable users to communicate with your applications in natural ways and empower global outreach.

articles/ai-services/content-understanding/video/overview.md

Lines changed: 3 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ With the prebuilt video analyzer (prebuilt-videoAnalyzer), you can upload a vide
5454

5555
* For example, creating the base `prebuilt-videoAnalyzer` as follows:
5656

57-
```jsonc
57+
```json
5858
{
5959
"config": {},
6060
"BaseAnalyzerId": "prebuilt-videoAnalyzer",
@@ -154,8 +154,7 @@ Shape the output to match your business vocabulary. Use a `fieldSchema` object w
154154

155155
**Example:**
156156

157-
```jsonc
158-
157+
```json
159158
"fieldSchema": {
160159
"description": "Extract brand presence and sentiment per scene",
161160
"fields": {
@@ -207,7 +206,7 @@ Content Understanding offers three ways to slice a video, letting you get the ou
207206
**Example:**
208207
* Break a news broadcast up into stories.
209208

210-
```jsonc
209+
```json
211210
{
212211
"segmentationMode": "custom",
213212
"segmentationDefinition": "news broadcasts divided by individual stories"

0 commit comments

Comments
 (0)