MicrosoftDocs
diff --git a/‎articles/ai-services/language-service/conversational-language-understanding/concepts/app-architecture.md
Lines changed: 73 additions & 0 deletions b/‎articles/ai-services/language-service/conversational-language-understanding/concepts/app-architecture.md
Lines changed: 73 additions & 0 deletions
diff --git a/‎articles/ai-services/language-service/conversational-language-understanding/concepts/best-practices.md
Lines changed: 130 additions & 3 deletions b/‎articles/ai-services/language-service/conversational-language-understanding/concepts/best-practices.md
Lines changed: 130 additions & 3 deletions
diff --git a/‎articles/ai-services/language-service/conversational-language-understanding/how-to/tag-utterances.md
Lines changed: 3 additions & 1 deletion b/‎articles/ai-services/language-service/conversational-language-understanding/how-to/tag-utterances.md
Lines changed: 3 additions & 1 deletion
diff --git a/‎articles/ai-services/language-service/conversational-language-understanding/how-to/train-model.md
Lines changed: 5 additions & 1 deletion b/‎articles/ai-services/language-service/conversational-language-understanding/how-to/train-model.md
Lines changed: 5 additions & 1 deletion
@@ -0,0 +1,73 @@
+---
+title: When to choose conversational language understanding or orchestration workflow
+titleSuffix: Azure AI services
+description: Learn when to choose conversational language understanding or orchestration workflow
+services: cognitive-services
+author: aahill
+manager: nitinme
+ms.service: cognitive-services
+ms.subservice: language-service
+ms.topic: best-practice
+ms.date: 08/15/2023
+ms.author: aahi
+ms.custom: language-service-clu
+---
+
+# When to use conversational language understanding or orchestration workflow apps
+
+When you create large applications, you should consider whether your use-case would be best served by a single conversational app (flat architecture), or multiple apps that are orchestrated.
+
+
+## Orchestration overview 
+
+Orchestration workflow is a feature that allows you to connect different projects from [LUIS](../../../LUIS/what-is-luis.md) [conversational language understanding](../overview.md), and [custom question answering](../../question-answering/overview.md) in one project. You can then use this project for predictions using one endpoint. The orchestration project makes a prediction on which child project should be called, automatically routes the request, and returns with its response.
+
+The key point is that orchestration involves two steps:
+
+1.	Predicting which child project to call. <!--The model that performs this classification can be trained either with a standard or an advanced recipe. (Please see footnotes on instructions for training with advanced recipe).-->
+2.	Routing the utterance to the destination child app, and returning the child app's response.
+
+### Advantages
+
+* Clear decomposition and faster development:
+    * If your overall schema has a substantial number of domains, the orchestration approach can help decompose your application into several child apps (each serving a specific domain). For example, an automotive conversational app might have a *navigation domain*, a *media domain*, and so on.
+    * Developing each domain app in parallel is easier. People and teams with specific domain expertise  can work on individual apps collaboratively and in parallel.
+    * Since each domain app is smaller, the development cycle becomes faster. Smaller sized domain apps take much less time to train than a single large app.
+* More flexible [confidence score thresholds](/legal/cognitive-services/clu/clu-characteristics-and-limitations?context=/azure/ai-services/language-service/context/context#understand-confidence-scores):
+    * Since there are separate child apps serving each domain, it's easy to set separate thresholds for different child apps.
+* AI quality improvements where appropriate:
+    * Some applications require that certain entities are domain restricted. Orchestration makes this easy to achieve. Once the orchestration project has predicted which child app should be called, the other child apps won't be called.
+
+    For example, if your app contains a `Person.Name` prebuilt entity, consider the utterance *"How do I use a jack?"*, in the context of a vehicle question.  In this context, *jack* is an automotive tool, and shouldn’t be recognized as a person's name. Using orchestration, this utterance can be redirected to a child app created to answer such questions, which doesn’t have a `Person.Name` entity.
+
+### Disadvantages
+
+* Redundant entities in child apps:
+    * If you need a particular prebuilt entity being returned in all utterances irrespective of the domain, for example `Quantity.Number` or `Geography.Location`, there is no way of adding an entity to the Orchestration app (it is an intent-only model). You would need to add it to all individual child apps.
+* Efficiency:
+    * Orchestration apps take two model inferences. One for predicting which child app to call, another for the prediction in the child app. Inference times will typically be slower than single apps with a flat architecture.
+* Train/test split for orchestrator:
+    * Training an orchestration app does not allow you to granularly split data between the testing and training sets. For example, you cannot train a 90-10 split for child app A, and then an 80-20 split for child app B. This may be a minor point, but worth keeping in mind.
+
+## Flat architecture overview
+
+Flat architecture is the other method of developing conversational apps. Instead of using an orchestration app to send utterances to one of multiple child apps, you develop a singular (or flat) app to handle utterances.  
+
+### Advantages
+
+* Simplicity:
+    * For small sized apps or domains, the orchestrator approach can be overly complex.
+    * Since all intents and entities are at the same app level, it might be easier to make changes to all of them together.
+* It's easier to add entities that should always be returned:
+    * If you want certain prebuilt or list entities to be returned for all utterances, you only need to add it alongside other entities in a single app. If you use orchestration, as mentioned above, you would need to add it to every child app.
+
+### Disadvantages
+
+* Unwieldy for large apps:
+    * For large apps (say > 50 intents or entities) it can become difficult to keep track of evolving schemas and datasets. This is particularly evident in cases where the app has to serve several domains. For example an automotive conversational app might have a *navigation domain*, a *media domain*, and so on.
+* Limited control over entity matches:
+    * In a flat architecture, there is no way to restrict entities to be returned only in certain cases. You can accomplish this using orchestration by assigning those specific entities to particular child apps.
+
+## Next steps
+* [Orchestration workflow overview](../../orchestration-workflow/overview.md)
+* [Conversational language understanding overview](../overview.md)
@@ -8,7 +8,7 @@ manager: nitinme
 ms.service: cognitive-services
 ms.subservice: language-service
 ms.topic: best-practice
-ms.date: 10/11/2022
+ms.date: 08/30/2023
 ms.author: aahi
 ms.custom: language-service-clu
 ---
@@ -26,22 +26,149 @@ Schema is the definition of your intents and entities. There are different appro
 
 You can typically think of actions and queries as _intents_, while the information required to fulfill those queries as _entities_. 
 
-For example, assume you want your customers to cancel subscriptions for various products that you offer through your chatbot. You can create a _Cancel_ intent with various examples like _"Cancel the Contoso service"_, or _"stop charging me for the Fabrikam subscription"_. The user's intent here is to _cancel_, the _Contoso service_ or _Fabrikam subscription_ are the subscriptions they would like to cancel. Therefore, you can create an entity for _subscriptions_. You can then model your entire project to capture actions as intents and use entities to fill in those actions. This allows you to cancel anything you define as an entity, such as other products. You can then have intents for signing up, renewing, upgrading, etc. that all make use of the _subscriptions_ and other entities. 
+For example, assume you want your customers to cancel subscriptions for various products that you offer through your chatbot. You can create a _Cancel_ intent with various examples like _"Cancel the Contoso service,"_ or _"stop charging me for the Fabrikam subscription."_ The user's intent here is to _cancel,_ the _Contoso service_ or _Fabrikam subscription_ are the subscriptions they would like to cancel. Therefore, you can create an entity for _subscriptions_. You can then model your entire project to capture actions as intents and use entities to fill in those actions. This allows you to cancel anything you define as an entity, such as other products. You can then have intents for signing up, renewing, upgrading, etc. that all make use of the _subscriptions_ and other entities. 
 
 The above schema design makes it easy for you to extend existing capabilities (canceling, upgrading, signing up) to new targets by creating a new entity. 
 
-Another approach is to model the _information_ as intents and _actions_ as entities. Let's take the same example, allowing your customers to cancel subscriptions through your chatbot. You can create an intent for each subscription available, such as _Contoso_ with utterances like _"cancel Contoso"_, _"stop charging me for contoso services"_, _"Cancel the Contoso subscription"_. You would then create an entity to capture the action, _cancel_. You can define different entities for each action or consolidate actions as one entity with a list component to differentiate between actions with different keys.
+Another approach is to model the _information_ as intents and _actions_ as entities. Let's take the same example, allowing your customers to cancel subscriptions through your chatbot. You can create an intent for each subscription available, such as _Contoso_ with utterances like _"cancel Contoso,"_ _"stop charging me for contoso services,"_ _"Cancel the Contoso subscription."_ You would then create an entity to capture the action, _cancel._ You can define different entities for each action or consolidate actions as one entity with a list component to differentiate between actions with different keys.
 
 This schema design makes it easy for you to extend new actions to existing targets by adding new action entities or entity components.
 
 Make sure to avoid trying to funnel all the concepts into just intents, for example don't try to create a _Cancel Contoso_ intent that only has the purpose of that one specific action. Intents and entities should work together to capture all the required information from the customer. 
 
 You also want to avoid mixing different schema designs. Do not build half of your application with actions as intents and the other half with information as intents. Ensure it is consistent to get the possible results.
 
+[!INCLUDE [Balance training data](../includes/balance-training-data.md)]
 
+[!INCLUDE [Label data](../includes/label-data-best-practices.md)]
 
+## Use standard training before advanced training
 
+[Standard training](../how-to/train-model.md#training-modes) is free and faster than Advanced training, making it useful to quickly understand the effect of changing your training set or schema while building the model. Once you are satisfied with the schema, consider using advanced training to get the best AIQ out of your model. 
 
+## Use the evaluation feature
+ 
+When you build an app, it's often helpful to catch errors early. It’s usually a good practice to add a test set when building the app, as training and evaluation results are very useful in identifying errors or issues in your schema.
 
+## Machine-learning components and composition
 
+See [Component types](./entity-components.md#component-types).
 
+## Using the "none" score Threshold
+
+If you see too many false positives, such as out-of-context utterances being marked as valid intents, See [confidence threshold](./none-intent.md) for information on how it affects inference.
+
+* Non machine-learned entity components like lists and regex are by definition not contextual. If you see list or regex entities in unintended places, try labeling the list synonyms as the machine-learned component.
+
+* For entities, you can use learned component as the ‘Required’ component, to restrict when a composed entity should fire.
+
+For example, suppose you have an entity called "*ticket quantity*" that attempts to extract the number of tickets you want to reserve for booking flights, for utterances such as "*Book two tickets tomorrow to Cairo.*"
+
+
+Typically, you would add a prebuilt component for `Quantity.Number` that already extracts all numbers in utterances. However if your entity was only defined with the prebuilt component, it would also extract other numbers as part of the *ticket quantity* entity, such as "*Book two tickets tomorrow to Cairo at 3 PM.*"
+
+To resolve this, you would label a learned component in your training data for all the numbers that are meant to be a *ticket quantity*. The entity now has two components:
+* The prebuilt component that can interpret all numbers, and 
+* The learned component that predicts where the *ticket quantity* is in a sentence. 
+
+If you require the learned component, make sure that *ticket quantity* is only returned when the learned component predicts it in the right context. If you also require the prebuilt component, you can then guarantee that the returned *ticket quantity* entity is both a number and in the correct position.
+
+
+## Addressing casing inconsistencies
+
+If you have poor AI quality and determine the casing used in your training data is dissimilar to the testing data, you can use the `normalizeCasing` project setting. This normalizes the casing of utterances when training and testing the model. If you've migrated from LUIS, you might recognize that LUIS did this by default.
+
+```json
+{
+  "projectFileVersion": "2022-10-01-preview",
+    ...
+    "settings": {
+      "confidenceThreshold": 0.5,
+      "normalizeCasing": true
+    }
+...
+```
+
+## Addressing model overconfidence
+
+Customers can use the LoraNorm  recipe version in case the model is being incorrectly overconfident. An example of this can be like the below (note that the model predicts the incorrect intent with 100% confidence). This makes the confidence threshold project setting unusable.
+
+| Text |	Predicted intent |	Confidence score |
+|----|----|----|
+| "*Who built the Eiffel Tower?*" |	 `Sports` | 1.00 |
+| "*Do I look good to you today?*" | `QueryWeather` |	1.00 |
+| "*I hope you have a good evening.*" | `Alarm` | 1.00 |
+
+To address this, use the `2023-04-15` configuration version that normalizes confidence scores. The confidence threshold project setting can then be adjusted to achieve the desired result.
+
+```console
+curl --location 'https://<your-resource>.cognitiveservices.azure.com/language/authoring/analyze-conversations/projects/<your-project>/:train?api-version=2022-10-01-preview' \
+--header 'Ocp-Apim-Subscription-Key: <your subscription key>' \
+--header 'Content-Type: application/json' \
+--data '{
+      "modelLabel": "<modelLabel>",
+      "trainingMode": "advanced",
+      "trainingConfigVersion": "2023-04-15",
+      "evaluationOptions": {
+            "kind": "percentage",
+            "testingSplitPercentage": 0,
+            "trainingSplitPercentage": 100
+      }
+}
+```
+
+Once the request is sent, you can track the progress of the training job in Language Studio as usual.
+
+> [!NOTE]
+> You have to retrain your model after updating the `confidenceThreshold` project setting. Afterwards, you'll need to republish the app for the new threshold to take effect.
+
+## Debugging composed entities
+
+Entities are functions that emit spans in your input with an associated type. The function is defined by one or more components. You can mark components as needed, and you can decide whether to enable the *combine components* setting. When you combine components, all spans that overlap will be merged into a single span. If the setting isn't used, each individual component span will be emitted.
+ 
+To better understand how individual components are performing, you can disable the setting and set each component to "not required". This lets you inspect the individual spans that are emitted, and experiment with removing components so that only problematic components are generated.
+
+## Evaluate a model using multiple test sets
+
+Data in a conversational language understanding project can have two data sets. A "testing" set, and a "training" set. If you want to use multiple test sets to evaluate your model, you can:
+
+* Give your test sets different names (for example, "test1" and "test2").
+* Export your project to get a JSON file with its parameters and configuration.
+* Use the JSON to import a new project, and rename your second desired test set to "test".
+* Train the model to run the evaluation using your second test set.  
+
+## Custom parameters for target apps and child apps
+
+If you are using [orchestrated apps](./app-architecture.md), you may want to send custom parameter overrides for various child apps. The `targetProjectParameters` field allows users to send a dictionary representing the parameters for each target project. For example, consider an orchestrator app named `Orchestrator` orchestrating between a conversational language understanding app named `CLU1` and a custom question answering app named `CQA1`. If you want to send a parameter named "top" to the question answering app, you can use the above parameter.
+
+```console
+curl --request POST \
+   --url 'https://<your-language-resource>.cognitiveservices.azure.com/language/:analyze-conversations?api-version=2022-10-01-preview' \
+   --header 'ocp-apim-subscription-key: <your subscription key>' \
+   --data '{
+     "kind": "Conversation",
+     "analysisInput": {
+         "conversationItem": {
+             "id": "1",
+             "text": "Turn down the volume",
+             "modality": "text",
+             "language": "en-us",
+             "participantId": "1"
+         }
+     },
+     "parameters": {
+         "projectName": "Orchestrator",
+         "verbose": true,
+         "deploymentName": "std",
+         "stringIndexType": "TextElement_V8",
+"targetProjectParameters": {
+            "CQA1": {
+                "targetProjectKind": "QuestionAnswering",
+                "callingOptions": {
+                    "top": 1
+                }
+             }
+         }
+     }
+ }'
+```
@@ -8,7 +8,7 @@ manager: nitinme
 ms.service: cognitive-services
 ms.subservice: language-service
 ms.topic: how-to
-ms.date: 06/03/2022
+ms.date: 08/25/2023
 ms.author: aahi
 ms.custom: language-service-clu, ignite-fall-2021
 ---
@@ -41,6 +41,8 @@ As you add utterances and label them, keep in mind:
     * **Label consistently**:  The same entity should have the same label across all the utterances.
     * **Label completely**: Provide varied utterances for every intent. Label all the instances of the entity in all your utterances.
 
+[!INCLUDE [Label data best practices](../includes/label-data-best-practices.md)]
+
 * For [Multilingual projects](../language-support.md#multi-lingual-option), adding utterances in other languages increases the model's performance in these languages, but avoid duplicating your data across all the languages you would like to support. For example, to improve a calender bot's performance with users, a developer might add examples mostly in English, and a few in Spanish or French as well. They might add utterances such as:
 
   * "_Set a meeting with **Matt** and **Kevin** **tomorrow** at **12 PM**._" (English)
 
@@ -8,7 +8,7 @@ manager: nitinme
 ms.service: cognitive-services
 ms.subservice: language-service
 ms.topic: how-to
-ms.date: 05/12/2022
+ms.date: 08/25/2023
 ms.author: aahi
 ms.custom: language-service-clu, ignite-fall-2021
 ---
@@ -30,6 +30,10 @@ Model evaluation is triggered automatically after training is completed successf
 
 <!--See the [project development lifecycle](../overview.md#project-development-lifecycle) for more information.-->
 
+[!INCLUDE [Balance training data](../includes/balance-training-data.md)]
+
+
+
 ## Data splitting
 
 Before you start the training process, labeled utterances in your project are divided into a training set and a testing set. Each one of them serves a different function.