Skip to content

Commit 8ad4e6f

Browse files
committed
update spacing
1 parent a9df2ad commit 8ad4e6f

File tree

1 file changed

+7
-7
lines changed
  • articles/ai-services/content-understanding/capabilities

1 file changed

+7
-7
lines changed

articles/ai-services/content-understanding/capabilities/overview.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -45,8 +45,8 @@ The following section details the content extraction capabilities and optional a
4545
# [Document](#tab/document)
4646

4747
|Content Extraction|Add-on Capabilities|
48-
|--------|-------------|
49-
|&bullet; **Optical Character Recognition (OCR)**: Extract printed and handwritten text from documents in various file formats, converting it into structured data. </br>| &bullet; **Layout**:Extracts layout information such as paragraphs, sections, tables, and more.. </br>&bullet; **Barcode**: Identifies and decodes all barcodes in the documents. </br> &bullet; **Formula**: Recognizes all identified mathematical equations from the documents. </br> |
48+
|-------------|-------------|
49+
|&bullet; **Optical Character Recognition (OCR)**: Extract printed and handwritten text from </br> documents in various file formats, converting it into structured data. </br>| &bullet; **Layout**:Extracts layout information such as paragraphs, sections, tables, and more.. </br>&bullet; **Barcode**: Identifies and decodes all barcodes in the documents. </br> &bullet; **Formula**: Recognizes all identified mathematical equations from the documents. </br> |
5050

5151
# [Image](#tab/image)
5252
> [!NOTE]
@@ -55,14 +55,14 @@ The following section details the content extraction capabilities and optional a
5555
# [Audio](#tab/audio)
5656

5757
|Content Extraction|Add-on Capabilities|
58-
|--------|-------------|
59-
|&bullet; **Transcription**:Converts conversational audio into searchable and analyzable text-based transcripts in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level timestamps are available upon request. </br> &bullet; **Diarization**: Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers. </br> &bullet; **Language detection**: Automatically detects the language spoken in the audio to be processed.</br>| &bullet; **Speaker role detection**: Identifies speaker roles based on diarization results and replaces generic labels like "Speaker 1" with specific role names, such as "Agent" or "Customer." </br>|
58+
|-------------|-------------|
59+
|&bullet; **Transcription**:Converts conversational audio into searchable and analyzable text-based transcripts </br> in WebVTT format. Customizable fields can be generated from transcription data. Sentence-level and word-level </br> timestamps are available upon request. </br> &bullet; **Diarization**: Distinguishes between speakers in a conversation, attributing parts of the transcript to specific speakers. </br> &bullet; **Language detection**: Automatically detects the language spoken in the audio to be processed.</br> | &bullet; **Speaker role detection**: Identifies speaker roles based on diarization results and replaces generic </br> labels like "Speaker 1" with specific role names, such as "Agent" or "Customer." </br>|
6060

6161
# [Video](#tab/video)
6262

6363
|Content Extraction|Add-on Capabilities|
64-
|--------|-------------|
65-
|&bullet; **Transcription**: Converts speech to structured, searchable text via Azure AI Speech, allowing users to specify recognition languages. </br>&bullet; **Shot Detection**: Identifies segments of the video aligned with shot boundaries where possible, allowing for precise editing and repackaging of content with breaks exactly on shot boundaries. </br> &bullet; **Key Frame Extraction**: Extracts key frames from videos to represent each shot completely, ensuring each shot has enough key frames to enable Field Extraction to work effectively.</br> | **Face Grouping**: Grouped faces appearing in a video to extract one representative face image for each person and provides segments where each one is present. The grouped face data is available as metadata and can be used to generate customized metadata fields.This feature is limited access and involves face identification and grouping; customers need to register for access at Face Recognition. |
64+
|-------------|-------------|
65+
|&bullet; **Transcription**: Converts speech to structured, searchable text via Azure AI Speech, allowing users to specify recognition languages. </br>&bullet; **Shot Detection**: Identifies segments of the video aligned with shot boundaries where possible, allowing for precise editing and </br> repackaging of content with breaks exactly on shot boundaries. </br> &bullet; **Key Frame Extraction**: Extracts key frames from videos to represent each shot completely, ensuring each </br> shot has enough key frames to enable Field Extraction to work effectively.</br> | &bullet; **Face Grouping**: Grouped faces appearing in a video to extract one representative face image for each person and provides segments where each one is present. </br> The grouped face data is available as metadata and can be used to generate customized metadata fields.</br> This feature is limited access and involves face identification and grouping; customers need to register for access at Face Recognition. |
6666

6767
----
6868
### Field Extraction
@@ -81,7 +81,7 @@ Each modality supports specific generation approaches optimized for that content
8181

8282
|Supported generation methods|
8383
|--------------|
84-
|**Extract**: In document, users can extract field values from input content, such as dates from receipts or item details from invoices. |
84+
|&bullet; **Extract**: In document, users can extract field values from input content, such as dates from receipts or item details from invoices. |
8585

8686
:::image type="content" source="../media/capabilities/documentextraction.gif" alt-text="Illustration of Document extraction method workflow.":::
8787

0 commit comments

Comments
 (0)