You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/cognitive-search-skill-image-analysis.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -52,7 +52,7 @@ Parameters are case-sensitive.
52
52
| Output name | Description |
53
53
|---------------|-------------------------------|
54
54
|`adult`| Output is a single [Adult](../cognitive-services/computer-vision/concept-detecting-adult-content.md) object of a complex type, consisting of boolean fields (`isAdultContent`, `isGoryContent`, `isRacyContent`) and double type scores (`AdultScore`, `GoreScore`, `RacyScore`). |
55
-
|`brands`| Output is an array of of [Brand](../cognitive-services/computer-vision/concept-brand-detection.md) objects, where the object is a complex type consisting of `Name` (string) and a `Confidence` score (double). It also returns a `Rectangle` with four bounding box coordinates (in pixels) indicating placement inside the image. |
55
+
|`brands`| Output is an array of [Brand](../cognitive-services/computer-vision/concept-brand-detection.md) objects, where the object is a complex type consisting of `Name` (string) and a `Confidence` score (double). It also returns a `Rectangle` with four bounding box coordinates (in pixels) indicating placement inside the image. |
56
56
|`categories`| Output is an array of [category](../cognitive-services/computer-vision/concept-categorizing-images.md) objects, where each category object is a complex type consisting of a `Name` (string), `Score` (double), and optional `Detail` that contains celebrity or landmark details. See the [category taxonomy](../cognitive-services/Computer-vision/Category-Taxonomy.md) for the full list of category names. A detail is a nested complex type. A celebrity detail consists of a name, confidence score, and face bounding box. A landmark detail consists of a name and confidence score.|
57
57
|`description`| Output is a single [Description](../cognitive-services/computer-vision/concept-describing-images.md) object of a complex type, consisting of lists of `Tags` and `Caption` (an array consisting of `Text` (string) and `Confidence` (double)). |
58
58
|`faces`| Complex type consisting of `Age`, `Gender`, and `FaceBoundingBox` having four bounding box coordinates (in pixels) indicating placement inside the image.|
Copy file name to clipboardExpand all lines: articles/search/cognitive-search-skill-ocr.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -41,7 +41,7 @@ Parameters are case-sensitive.
41
41
| Parameter name | Description |
42
42
|--------------------|-------------|
43
43
|`detectOrientation`| Detects image orientation. Valid values are `true` or `false`.|
44
-
|`defaultLanguageCode`| Language code of the input text. Supported languages include: <br/> `zh-Hans` (ChineseSimplified) <br/> `zh-Hant` (ChineseTraditional) <br/>`cs` (Czech) <br/>`da` (Danish) <br/>`nl` (Dutch) <br/>`en` (English) <br/>`fi` (Finnish) <br/>`fr` (French) <br/>`de` (German) <br/>`el` (Greek) <br/>`hu` (Hungarian) <br/>`it` (Italian) <br/>`ja` (Japanese) <br/>`ko` (Korean) <br/>`nb` (Norwegian) <br/>`pl` (Polish) <br/>`pt` (Portuguese) <br/>`ru` (Russian) <br/>`es` (Spanish) <br/>`sv` (Swedish) <br/>`tr` (Turkish) <br/>`ar` (Arabic) <br/>`ro` (Romanian) <br/>`sr-Cyrl` (SerbianCyrillic) <br/>`sr-Latn` (SerbianLatin) <br/>`sk` (Slovak) <br/>`unk` (Unknown) <br/><br/> If the language code is unspecified or null, the language is set to English. If the language is explicitly set to "unk", the language is auto-detected. </p> |
44
+
|`defaultLanguageCode`| Language code of the input text. Supported languages include: <br/> `zh-Hans` (ChineseSimplified) <br/> `zh-Hant` (ChineseTraditional) <br/>`cs` (Czech) <br/>`da` (Danish) <br/>`nl` (Dutch) <br/>`en` (English) <br/>`fi` (Finnish) <br/>`fr` (French) <br/>`de` (German) <br/>`el` (Greek) <br/>`hu` (Hungarian) <br/>`it` (Italian) <br/>`ja` (Japanese) <br/>`ko` (Korean) <br/>`nb` (Norwegian) <br/>`pl` (Polish) <br/>`pt` (Portuguese) <br/>`ru` (Russian) <br/>`es` (Spanish) <br/>`sv` (Swedish) <br/>`tr` (Turkish) <br/>`ar` (Arabic) <br/>`ro` (Romanian) <br/>`sr-Cyrl` (SerbianCyrillic) <br/>`sr-Latn` (SerbianLatin) <br/>`sk` (Slovak) <br/>`unk` (Unknown) <br/><br/> If the language code is unspecified or null, the language is set to English. If the language is explicitly set to `unk`, the language is auto-detected. </p> |
45
45
|`lineEnding`| The value to use as a line separator. Possible values: "Space", "CarriageReturn", "LineFeed". The default is "Space". |
46
46
47
47
In previous versions, there was a parameter called "textExtractionAlgorithm" to specify extraction of "printed" or "handwritten" text. This parameter is deprecated because the current Read API algorithm extracts both types of text at once. If your skill includes this parameter, you don't need to remove it, but it won't be used during skill execution.
@@ -131,9 +131,9 @@ In previous versions, there was a parameter called "textExtractionAlgorithm" to
131
131
}
132
132
```
133
133
134
-
## Sample: Merging text extracted from embedded images with the content of the document.
134
+
## Sample: Merging text extracted from embedded images with the content of the document
135
135
136
-
Document cracking, which is the first action in skillset execution, separates text and image content. A common use case for Text Merger is merging the textual representation of images (text from an OCR skill, or the caption of an image) into the content field of a documentfor scenarios where the source document is a PDF or Word document that includes text with embedded images.
136
+
Document cracking, the first step in skillset execution, separates text and image content. A common use case for Text Merger is merging the textual representation of images (text from an OCR skill, or the caption of an image) into the content field of a document. This is for scenarios where the source document is a PDF or Word document that combines text with embedded images.
137
137
138
138
The following example skillset creates a *merged_text* field. This field contains the textual content of your document and the OCRed text from each of the images embedded in that document.
0 commit comments