You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/language-service/conversational-language-understanding/concepts/data-formats.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,7 +17,7 @@ If you're uploading your data into conversational language understanding, it mus
17
17
18
18
## Import project file format
19
19
20
-
If you're [importing a project](../how-to/create-project.md#import-a-project) into conversational language understanding, the file uploaded must be in the following format:
20
+
If you're [importing a project](../how-to/create-project.md#import-an-existing-azure-ai-project) into conversational language understanding, the file uploaded must be in the following format:
21
21
22
22
```json
23
23
{
@@ -101,7 +101,7 @@ If you're [importing a project](../how-to/create-project.md#import-a-project) in
101
101
|Key |Placeholder |Value | Example |
102
102
|---------|---------|----------|--|
103
103
|`{API-VERSION}`| The [version](../../concepts/model-lifecycle.md#api-versions) of the API you're calling. |`2023-04-01`|
104
-
|`confidenceThreshold`|`{CONFIDENCE-THRESHOLD}`|This is the threshold score below which the intent is predicted as [None intent](none-intent.md). Values are from `0` to `1`.|`0.7`|
104
+
|`confidenceThreshold`|`{CONFIDENCE-THRESHOLD}`|The threshold score for which the intent is predicted as [None intent](none-intent.md). Values are from `0` to `1`.|`0.7`|
105
105
|`projectName`|`{PROJECT-NAME}`| The name of your project. This value is case sensitive. |`EmailApp`|
106
106
|`multilingual`|`true`| A Boolean value that enables you to have utterances in multiple languages in your dataset. When your model is deployed, you can query the model in any supported language (not necessarily included in your training documents. For more information about supported language codes, see [Language support](../language-support.md#multi-lingual-option). |`true`|
107
107
|`sublists`|`[]`|Array that contains sublists. Each sublist is a key and its associated values.|`[]`|
@@ -110,7 +110,7 @@ If you're [importing a project](../how-to/create-project.md#import-a-project) in
110
110
|`language`|`{LANGUAGE-CODE}`| A string specifying the language code for the utterances, synonyms, and regular expressions used in your project. If your project is a multilingual project, choose the [language code](../language-support.md) of most the utterances. |`en-us`|
111
111
|`intents`|`[]`| Array that contains all the intents you have in the project. These intents are classified from your utterances.|`[]`|
112
112
|`entities`|`[]`| Array that contains all the entities in your project. These entities are extracted from your utterances. Every entity can have other optional components defined with them: list, prebuilt, or regex. |`[]`|
113
-
|`dataset`|`{DATASET}`| The test set to which this utterance goes to when it's split before training. To learn more about data splitting, see [Train your conversational language understanding model](../how-to/train-model.md#data-splitting). Possible values for this field are `Train` and `Test`. |`Train`|
113
+
|`dataset`|`{DATASET}`| The test set that this utterance is assigned to when the data is split before training. To learn more about data splitting, see [Train your conversational language understanding model](../how-to/train-model.md#data-splitting). Possible values for this field are `Train` and `Test`. |`Train`|
114
114
|`category`|``| The type of entity associated with the span of text specified. |`Entity1`|
115
115
|`offset`|``| The inclusive character position of the start of the entity. |`5`|
116
116
|`length`|``| The character length of the entity. |`5`|
@@ -166,7 +166,7 @@ Conversational language understanding offers the option to upload your utterance
|`language`|`{LANGUAGE-CODE}`| A string that specifies the language code for the utterances used in your project. If your project is a multilingual project, choose the language code of most of the utterances. For more information about supported language codes, see [Language support](../language-support.md). |`en-us`|
169
-
|`dataset`|`{DATASET}`| The test set to which this utterance goes to when it's split before training. To learn more about data splitting, see [Train your conversational language understanding model](../how-to/train-model.md#data-splitting). Possible values for this field are `Train` and `Test`. |`Train`|
169
+
|`dataset`|`{DATASET}`| The test set that this utterance is assigned to when the data is split before training. To learn more about data splitting, see [Train your conversational language understanding model](../how-to/train-model.md#data-splitting). Possible values for this field are `Train` and `Test`. |`Train`|
|`entity`|`{entity}`|The entity to be extracted.| entity1|
172
172
|`category`|``| The type of entity associated with the span of text specified. |`Entity1`|
@@ -175,5 +175,5 @@ Conversational language understanding offers the option to upload your utterance
175
175
176
176
## Related content
177
177
178
-
* For more information on importing your labeled data into your project directly, see [Import project](../how-to/create-project.md#import-a-project).
178
+
* For more information on importing your labeled data into your project directly, see [Import project](../how-to/create-project.md#import-an-existing-azure-ai-project).
179
179
* For more information about labeling your data, see [Label your utterances in Language Studio](../how-to/tag-utterances.md). After you label your data, you can [train your model](../how-to/train-model.md).
Copy file name to clipboardExpand all lines: articles/ai-services/language-service/conversational-language-understanding/how-to/tag-utterances.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -15,7 +15,7 @@ ms.custom: language-service-clu
15
15
16
16
After you [build a schema](build-schema.md) for your fine-tuning task, you add training utterances to your project. The utterances should be similar to what your users use when they interact with the project. When you add an utterance, you have to assign which intent it belongs to. After the utterance is added, label the words within your utterance that you want to extract as entities.
17
17
18
-
Data labeling is a crucial step in the conversational language understanding (CLU) trained development lifecycle. This data is used in the next step when you train your model so that your model can learn from the labeled data. If you already labeled utterances, you can directly [import them into your project](create-project.md#import-a-project), if your data follows the [accepted data format](../concepts/data-formats.md). To learn more about importing labeled data, see [Create a CLU fine-tuning task](create-project.md#import-a-project). Labeled data informs the model about how to interpret text and is used for training and evaluation.
18
+
Data labeling is a crucial step in the conversational language understanding (CLU) trained development lifecycle. This data is used in the next step when you train your model so that your model can learn from the labeled data. If you already labeled utterances, you can directly [import them into your project](create-project.md#import-an-existing-azure-ai-project), if your data follows the [accepted data format](../concepts/data-formats.md). To learn more about importing labeled data, see [Create a CLU fine-tuning task](create-project.md#import-an-existing-azure-ai-project). Labeled data informs the model about how to interpret text and is used for training and evaluation.
19
19
20
20
> [!TIP]
21
21
> Use the **Quick Deploy** option to implement custom CLU intent routing, which is powered by your own large language model deployment without adding or labeling any training data.
@@ -43,9 +43,9 @@ As you add utterances and label them, keep in mind:
43
43
44
44
* For [multilingual projects](../language-support.md#multi-lingual-option), adding utterances in other languages increases the model's performance in these languages. Avoid duplicating your data across all the languages that you want to support. For example, to improve a calender bot's performance with users, a developer might add examples mostly in English and a few in Spanish or French. They might add utterances such as:
45
45
46
-
*"Set a meeting with **Matt** and **Kevin****tomorrow** at **12 PM**." (English)
47
-
*"Reply as **tentative** to the **weekly update** meeting." (English)
48
-
*"Cancelar mi **próxima** reunión." (Spanish)
46
+
*`Set a meeting with **Matt** and **Kevin** **tomorrow** at **12 PM**.` (English)
47
+
*`Reply as **tentative** to the **weekly update** meeting.` (English)
48
+
*`Cancelar mi **próxima** reunión.` (Spanish)
49
49
50
50
## Label your utterances
51
51
@@ -111,7 +111,7 @@ Before you get started, the suggested utterances feature is available only if yo
111
111
112
112
On the **Data labeling** page:
113
113
114
-
1. Select **Suggest utterances**. A pane opens on the right and prompts you to select your Azure OpenAI resource and deployment.
114
+
1. Select **Suggest utterances**. A pane opens and prompts you to select your Azure OpenAI resource and deployment.
115
115
1. After you select an Azure OpenAI resource, select **Connect** so that your Language resource has direct access to your Azure OpenAI resource. It assigns your Language resource the **Cognitive Services User** role to your Azure OpenAI resource. Now your current Language resource has access to Azure OpenAI. If the connection fails, follow [these steps](#add-required-configurations-to-azure-openai-resource) to manually add the correct role to your Azure OpenAI resource.
116
116
1. After the resource is connected, select the deployment. The model that we recommend for the Azure OpenAI deployment is `gpt-35-turbo-instruct`.
117
117
1. Select the intent for which you want to get suggestions. Make sure the intent that you selected has at least five saved utterances to be enabled for utterance suggestions. The suggestions provided by Azure OpenAI are based on the most recent utterances that you added for that intent.
0 commit comments