Skip to content

Commit 30d0e93

Browse files
authored
Merge pull request #107659 from Careyjmac/customEntityWarning
[Azure Cognitive Search] Add warning detail for custom entity lookup skill
2 parents 7057dc9 + 64afbd0 commit 30d0e93

File tree

1 file changed

+18
-12
lines changed

1 file changed

+18
-12
lines changed

articles/search/cognitive-search-skill-custom-entity-lookup.md

Lines changed: 18 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -11,7 +11,7 @@ ms.topic: conceptual
1111
ms.date: 01/30/2020
1212
---
1313

14-
# Custom Entity Lookup cognitive skill (Preview)
14+
# Custom Entity Lookup cognitive skill (Preview)
1515

1616
> [!IMPORTANT]
1717
> This skill is currently in public preview. Preview functionality is provided without a service level agreement, and is not recommended for production workloads. For more information, see [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/). There is currently no portal or .NET SDK support.
@@ -32,25 +32,25 @@ Microsoft.Skills.Text.CustomEntityLookupSkill
3232

3333
Parameters are case-sensitive.
3434

35-
| Parameter name | Description |
35+
| Parameter name | Description |
3636
|--------------------|-------------|
37-
| entitiesDefinitionUri | Path to a JSON or CSV file containing all the target text to match against. This entity definition is read at the beginning of an indexer run; any updates to this file mid-run wont be realized until subsequent runs. This config must be accessible over HTTPS. See [Custom Entity Definition](#custom-entity-definition-format) Format below for expected CSV or JSON schema.|
37+
| entitiesDefinitionUri | Path to a JSON or CSV file containing all the target text to match against. This entity definition is read at the beginning of an indexer run; any updates to this file mid-run won't be realized until subsequent runs. This config must be accessible over HTTPS. See [Custom Entity Definition](#custom-entity-definition-format) Format" below for expected CSV or JSON schema.|
3838
|inlineEntitiesDefinition | Inline JSON entity definitions. This parameter supersedes the entitiesDefinitionUri parameter if present. No more than 10 KB of configuration may be provided inline. See [Custom Entity Definition](#custom-entity-definition-format) below for expected JSON schema. |
39-
|defaultLanguageCode | (Optional) Language code of the input text used to tokenize and delineate input text. The following languages are supported: `da, de, en, es, fi, fr, it, ko, pt`. The default is English (`en`). If you pass a languagecode-countrycode format, only the languagecode part of the format is used. |
39+
|defaultLanguageCode | (Optional) Language code of the input text used to tokenize and delineate input text. The following languages are supported: `da, de, en, es, fi, fr, it, ko, pt`. The default is English (`en`). If you pass a languagecode-countrycode format, only the languagecode part of the format is used. |
4040

4141

4242
## Skill inputs
4343

44-
| Input name | Description |
44+
| Input name | Description |
4545
|---------------|-------------------------------|
4646
| text | The text to analyze. |
47-
| languageCode | Optional. Default is `"en"`. |
47+
| languageCode | Optional. Default is `"en"`. |
4848

4949

5050
## Skill outputs
5151

5252

53-
| Output name | Description |
53+
| Output name | Description |
5454
|---------------|-------------------------------|
5555
| entities | An array of objects that contain information about the matches that were found, and related metadata. Each of the entities identified may contain the following fields: <ul> <li> *name*: The top-level entity identified. The entity represents the "normalized" form. </li> <li> *id*: A unique identifier for the entity as defined by the user in the "Custom Entity Definition Format".</li> <li> *description*: Entity description as defined by the user in the "Custom Entity Definition Format". </li> <li> *type:* Entity type as defined by the user in the "Custom Entity Definition Format".</li> <li> *subtype:* Entity subtype as defined by the user in the "Custom Entity Definition Format".</li> <li> *matches*: Collection that describes each of the matches for that entity on the source text. Each match will have the following members: </li> <ul> <li> *text*: The raw text match from the source document. </li> <li> *offset*: The location where the match was found in the text. </li> <li> *length*: The length of the matched text. </li> <li> *matchDistance*: The number of characters different this match was from original entity name or alias. </li> </ul> </ul>
5656
|
@@ -155,16 +155,16 @@ The tables below describe in more details the different configuration parameters
155155
| Alias properties | Description |
156156
|------------------|-------------|
157157
| text | The alternative spelling or representation of some target entity name. |
158-
| caseSensitive | (Optional) Acts the same as root entity caseSensitive parameter above, but applies to only this one alias. |
159-
| fuzzyEditDistance | (Optional) Acts the same as root entity fuzzyEditDistance parameter above, but applies to only this one alias. |
158+
| caseSensitive | (Optional) Acts the same as root entity "caseSensitive" parameter above, but applies to only this one alias. |
159+
| fuzzyEditDistance | (Optional) Acts the same as root entity "fuzzyEditDistance" parameter above, but applies to only this one alias. |
160160

161161

162162
### Inline format
163163

164164
In some cases, it may be more convenient to provide the list of custom entities to match inline directly into the skill definition. In that case you can use a similar JSON format to the one described above, but it is inlined in the skill definition.
165165
Only configurations that are less than 10 KB in size (serialized size) can be defined inline.
166166

167-
## Sample definition
167+
## Sample definition
168168

169169
A sample skill definition using an inline format is shown below:
170170

@@ -227,7 +227,7 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
227227

228228
```
229229

230-
## Sample input
230+
## Sample input
231231

232232
```json
233233
{
@@ -244,7 +244,7 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
244244
}
245245
```
246246

247-
## Sample output
247+
## Sample output
248248

249249
```json
250250
{
@@ -292,6 +292,12 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
292292
}
293293
```
294294

295+
## Errors and warnings
296+
297+
### Warning: Reached maximum capacity for matches, skipping all further duplicate matches.
298+
299+
This warning will be emitted if the number of matches detected is greater than the maximum allowed. In this case, we will stop including duplicate matches. If this is unacceptable to you, please file a [support ticket](https://ms.portal.azure.com/#create/Microsoft.Support) so we can assist you with your individual use case.
300+
295301
## See also
296302

297303
+ [Built-in skills](cognitive-search-predefined-skills.md)

0 commit comments

Comments
 (0)