You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/cognitive-search-skill-custom-entity-lookup.md
+18-12Lines changed: 18 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ ms.topic: conceptual
11
11
ms.date: 01/30/2020
12
12
---
13
13
14
-
#Custom Entity Lookup cognitive skill (Preview)
14
+
#Custom Entity Lookup cognitive skill (Preview)
15
15
16
16
> [!IMPORTANT]
17
17
> This skill is currently in public preview. Preview functionality is provided without a service level agreement, and is not recommended for production workloads. For more information, see [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/). There is currently no portal or .NET SDK support.
| entitiesDefinitionUri| Path to a JSON or CSV file containing all the target text to match against. This entity definition is read at the beginning of an indexer run; any updates to this file mid-run won’t be realized until subsequent runs. This config must be accessible over HTTPS. See [Custom Entity Definition](#custom-entity-definition-format) Format” below for expected CSV or JSON schema.|
37
+
| entitiesDefinitionUri| Path to a JSON or CSV file containing all the target text to match against. This entity definition is read at the beginning of an indexer run; any updates to this file mid-run won't be realized until subsequent runs. This config must be accessible over HTTPS. See [Custom Entity Definition](#custom-entity-definition-format) Format" below for expected CSV or JSON schema.|
38
38
|inlineEntitiesDefinition | Inline JSON entity definitions. This parameter supersedes the entitiesDefinitionUri parameter if present. No more than 10 KB of configuration may be provided inline. See [Custom Entity Definition](#custom-entity-definition-format) below for expected JSON schema. |
39
-
|defaultLanguageCode |(Optional) Language code of the input text used to tokenize and delineate input text. The following languages are supported: `da, de, en, es, fi, fr, it, ko, pt`. The default is English (`en`). If you pass a languagecode-countrycode format, only the languagecode part of the format is used. |
39
+
|defaultLanguageCode |(Optional) Language code of the input text used to tokenize and delineate input text. The following languages are supported: `da, de, en, es, fi, fr, it, ko, pt`. The default is English (`en`). If you pass a languagecode-countrycode format, only the languagecode part of the format is used. |
40
40
41
41
42
42
## Skill inputs
43
43
44
-
| Input name| Description |
44
+
| Input name| Description |
45
45
|---------------|-------------------------------|
46
46
| text | The text to analyze. |
47
-
| languageCode| Optional. Default is `"en"`. |
47
+
| languageCode| Optional. Default is `"en"`. |
48
48
49
49
50
50
## Skill outputs
51
51
52
52
53
-
| Output name| Description |
53
+
| Output name| Description |
54
54
|---------------|-------------------------------|
55
55
| entities | An array of objects that contain information about the matches that were found, and related metadata. Each of the entities identified may contain the following fields: <ul> <li> *name*: The top-level entity identified. The entity represents the "normalized" form. </li> <li> *id*: A unique identifier for the entity as defined by the user in the "Custom Entity Definition Format".</li> <li> *description*: Entity description as defined by the user in the "Custom Entity Definition Format". </li> <li> *type:* Entity type as defined by the user in the "Custom Entity Definition Format".</li> <li> *subtype:* Entity subtype as defined by the user in the "Custom Entity Definition Format".</li> <li> *matches*: Collection that describes each of the matches for that entity on the source text. Each match will have the following members: </li> <ul> <li> *text*: The raw text match from the source document. </li> <li> *offset*: The location where the match was found in the text. </li> <li> *length*: The length of the matched text. </li> <li> *matchDistance*: The number of characters different this match was from original entity name or alias. </li> </ul> </ul>
56
56
|
@@ -155,16 +155,16 @@ The tables below describe in more details the different configuration parameters
155
155
| Alias properties | Description |
156
156
|------------------|-------------|
157
157
| text | The alternative spelling or representation of some target entity name. |
158
-
| caseSensitive | (Optional) Acts the same as root entity “caseSensitive” parameter above, but applies to only this one alias. |
159
-
| fuzzyEditDistance | (Optional) Acts the same as root entity “fuzzyEditDistance” parameter above, but applies to only this one alias. |
158
+
| caseSensitive | (Optional) Acts the same as root entity "caseSensitive" parameter above, but applies to only this one alias. |
159
+
| fuzzyEditDistance | (Optional) Acts the same as root entity "fuzzyEditDistance" parameter above, but applies to only this one alias. |
160
160
161
161
162
162
### Inline format
163
163
164
164
In some cases, it may be more convenient to provide the list of custom entities to match inline directly into the skill definition. In that case you can use a similar JSON format to the one described above, but it is inlined in the skill definition.
165
165
Only configurations that are less than 10 KB in size (serialized size) can be defined inline.
166
166
167
-
##Sample definition
167
+
##Sample definition
168
168
169
169
A sample skill definition using an inline format is shown below:
170
170
@@ -227,7 +227,7 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
227
227
228
228
```
229
229
230
-
##Sample input
230
+
##Sample input
231
231
232
232
```json
233
233
{
@@ -244,7 +244,7 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
244
244
}
245
245
```
246
246
247
-
##Sample output
247
+
##Sample output
248
248
249
249
```json
250
250
{
@@ -292,6 +292,12 @@ Alternatively, if you decide to provide a pointer to the entities definition fil
292
292
}
293
293
```
294
294
295
+
## Errors and warnings
296
+
297
+
### Warning: Reached maximum capacity for matches, skipping all further duplicate matches.
298
+
299
+
This warning will be emitted if the number of matches detected is greater than the maximum allowed. In this case, we will stop including duplicate matches. If this is unacceptable to you, please file a [support ticket](https://ms.portal.azure.com/#create/Microsoft.Support) so we can assist you with your individual use case.
0 commit comments