Merge pull request #178378 from mrbullwinkle/mrb_11_2_2021_updates_004

jborsecnik · web-flow · commit 709f91d4e87f · 2021-11-02T15:53:36.000-07:00
[Cognitive Services] Q&amp;A Updates
diff --git a/articles/cognitive-services/language-service/question-answering/concepts/limits.md b/articles/cognitive-services/language-service/question-answering/concepts/limits.md
@@ -0,0 +1,132 @@
+---
+title: Limits and boundaries - question answering
+description: Question answering has meta-limits for parts of the knowledge base and service. It is important to keep your knowledge base within those limits in order to test and publish.
+ms.service: cognitive-services
+ms.subservice: language-service
+author: mrbullwinkle
+ms.author: mbullwin
+ms.topic: conceptual
+ms.date: 11/02/2021
+---
+
+# Project limits and boundaries
+
+Question answering limits provided below are a combination of the [Azure Cognitive Search pricing tier limits](../../../../search/search-limits-quotas-capacity.md) and question answering limits. Both sets of limits affect how many knowledge bases you can create per resource and how large each knowledge base can grow.
+
+## Knowledge bases
+
+The maximum number of knowledge bases is based on [Azure Cognitive Search tier limits](../../../../search/search-limits-quotas-capacity.md).
+
+|**Azure Cognitive Search tier** | **Free** | **Basic** |**S1** | **S2**| **S3** |**S3 HD**|
+|---|---|---|---|---|---|----|
+|Maximum number of published knowledge bases allowed|2|14|49|199|199|2,999|
+
+ For example, if your tier has 15 allowed indexes, you can publish 14 knowledge bases (one index per published knowledge base). The 15th index, `testkb`, is used for all the knowledge bases for authoring and testing.
+
+## Extraction limits
+
+### File naming constraints
+
+File names may not include the following characters:
+
+|Do not use character|
+|--|
+|Single quote `'`|
+|Double quote `"`|
+
+### Maximum file size
+
+|Format|Max file size (MB)|
+|--|--|
+|`.docx`|10|
+|`.pdf`|25|
+|`.tsv`|10|
+|`.txt`|10|
+|`.xlsx`|3|
+
+### Maximum number of files
+
+> [!NOTE]
+> Question answering currently has no limits on the number of sources that can be added. Throughput is currently capped at 10 transactions per second for both management APIs and prediction APIs.
+
+### Maximum number of deep-links from URL
+
+The maximum number of deep-links that can be crawled for extraction of question answer pairs from a URL page is **20**.
+
+## Metadata limits
+
+Metadata is presented as a text-based `key:value` pair, such as `product:windows 10`. It is stored and compared in lower case. Maximum number of metadata fields is based on your **[Azure Cognitive Search tier limits](../../../../search/search-limits-quotas-capacity.md)**.
+
+If you choose to projects with multiple languages in a single language resource, there is a dedicated test index per project/knowledge base. So the limit is applied per project/knowledge base in the language service.
+
+|**Azure Cognitive Search tier** | **Free** | **Basic** |**S1** | **S2**| **S3** |**S3 HD**|
+|---|---|---|---|---|---|----|
+|Maximum metadata fields per language service (per knowledge base)|1,000|100*|1,000|1,000|1,000|1,000|
+
+If you don't choose the option to have projects with multiple different languages, then the limits are applied across all knowledge bases in the language service.
+
+|**Azure Cognitive Search tier** | **Free** | **Basic** |**S1** | **S2**| **S3** |**S3 HD**|
+|---|---|---|---|---|---|----|
+|Maximum metadata fields per Language service (across all knowledge bases)|1,000|100*|1,000|1,000|1,000|1,000|
+
+### By name and value
+
+The length and acceptable characters for metadata name and value are listed in the following table.
+
+|Item|Allowed chars|Regex pattern match|Max chars|
+|--|--|--|--|
+|Name (key)|Allows<br>Alphanumeric (letters and digits)<br>`_` (underscore)<br> Must not contain spaces.|`^[a-zA-Z0-9_]+$`|100|
+|Value|Allows everything except<br>`:` (colon)<br>`|` (vertical pipe)<br>Only one value allowed.|`^[^:|]+$`|500|
+|||||
+
+## Knowledge base content limits
+Overall limits on the content in the knowledge base:
+* Length of answer text: 25,000 characters
+* Length of question text: 1,000 characters
+* Length of metadata key text: 100 characters
+* Length of metadata value text: 500 characters
+* Supported characters for metadata name: Alphabets, digits, and `_`
+* Supported characters for metadata value: All except `:` and `|`
+* Length of file name: 200
+* Supported file formats: ".tsv", ".pdf", ".txt", ".docx", ".xlsx".
+* Maximum number of alternate questions: 300
+* Maximum number of question-answer pairs: Depends on the **[Azure Cognitive Search tier](../../../../search/search-limits-quotas-capacity.md#document-limits)** chosen. A question and answer pair maps to a document on Azure Cognitive Search index.
+* URL/HTML page: 1 million characters
+
+## Create project call limits:
+
+These represent the limits for each create project/knowledge base action; that is, selecting *Create new project* or calling the REST API to create a project/knowledge base.
+
+* Recommended maximum number of alternate questions per answer: 300
+* Maximum number of URLs: 10
+* Maximum number of files: 10
+* Maximum number of QnAs permitted per call: 1000
+
+## Update knowledge base call limits
+
+These represent the limits for each update action; that is, selecting *Save* or calling the REST API with an update request.
+* Length of each source name: 300
+* Recommended maximum number of alternate questions added or deleted: 300
+* Maximum number of metadata fields added or deleted: 10
+* Maximum number of URLs that can be refreshed: 5
+* Maximum number of QnAs permitted per call: 1000
+
+## Add unstructured file limits
+
+> [!NOTE]
+> * If you need to use larger files than the limit allows, you can break the file into smaller files before sending them to the API. 
+
+These represent the limits when unstructured files are used to *Create new project* or call the REST API to create a knowledge base:
+* Length of file: We will extract first 32000 characters
+* Maximum three responses per file.
+
+## Prebuilt question answering limits
+
+> [!NOTE]
+> * If you need to use larger documents than the limit allows, you can break the text into smaller chunks of text before sending them to the API. 
+> * A document is a single string of text characters.  
+
+These represent the limits when REST API is used to answer a question based without having to create a project/knowledge base:
+* Number of documents: 5
+* Maximum size of a single document:  5,120 characters
+* Maximum three responses per document.
diff --git a/articles/cognitive-services/language-service/question-answering/concepts/plan.md b/articles/cognitive-services/language-service/question-answering/concepts/plan.md
@@ -0,0 +1,151 @@
+---
+title: Plan your app - question answering
+description: Learn how to plan your question answering app. Understand how question answering works and interacts with other Azure services and some knowledge base concepts.
+ms.service: cognitive-services
+ms.subservice: language-service
+author: mrbullwinkle
+ms.author: mbullwin
+ms.topic: conceptual
+ms.date: 11/02/2021
+---
+
+# Plan your question answering app
+
+To plan your question answering app, you need to understand how question answering works and interacts with other Azure services. You should also have a solid grasp of knowledge base concepts.
+
+## Azure resources
+
+Each [Azure resource](azure-resources.md#resource-purposes) created with question answering has a specific purpose. Each resource has its own purpose, limits, and [pricing tier](azure-resources.md#pricing-tier-considerations). It's important to understand the function of these resources so that you can use that knowledge into your planning process.
+
+| Resource | Purpose |
+|--|--|
+| [Language resource](azure-resources.md) resource | Authoring, query prediction endpoint and telemetry|
+| [Cognitive Search](azure-resources.md#azure-cognitive-search-resource) resource | Data storage and search |
+
+### Resource planning
+
+Question answering throughput is currently capped at 10 transactions per second for both management APIs and prediction APIs. To target 10 transactions per second for your service, we recommend the S1 (one instance) SKU of Azure Cognitive Search.
+
+### Language resource
+
+A single language resource with the custom question answering feature enabled can host more than one project/knowledge base. The number of projects/knowledge bases is determined by the Cognitive Search pricing tier's quantity of supported indexes. Learn more about the [relationship of indexes to knowledge bases](azure-resources.md#index-usage).
+
+### Knowledge base size and throughput
+
+When you build a real app, plan sufficient resources for the size of your knowledge base and for your expected query prediction requests.
+
+A knowledge base size is controlled by the:
+* [Cognitive Search resource](../../../../search/search-limits-quotas-capacity.md) pricing tier limits
+* [Question answering limits](./limits.md)
+
+The knowledge base query prediction request is controlled by the web app plan and web app. Refer to [recommended settings](azure-resources.md#recommended-settings) to plan your pricing tier.
+
+### Understand the impact of resource selection
+
+Proper resource selection means your knowledge base answers query predictions successfully.
+
+If your knowledge base isn't functioning properly, it's typically an issue of improper resource management.
+
+Improper resource selection requires investigation to determine which [resource needs to change](azure-resources.md#pricing-tier-considerations).
+
+## Project
+
+A project/knowledge base is directly tied its language resource. It holds the question and answer (QnA) pairs that are used to answer query prediction requests.
+
+### Language considerations
+
+You can now have projects in different languages within the same language resource where the custom question answering feature is enabled. When you create the first project, you can choose whether you want to use the resource for projects/knowledge bases in a single language that will apply to all subsequent projects or make a language selection each time a project is created.
+
+### Ingest data sources
+
+Question answering also supports unstructured content. You can upload a file that has unstructured content.
+
+Currently we do not support URLs for unstructured content.
+
+The ingestion process converts supported content types to markdown. All further editing of the *answer* is done with markdown. After you create a knowledge base, you can edit QnA pairs in the Language Studio portal with rich text authoring.
+
+### Data format considerations
+
+Because the final format of a QnA pair is markdown, it's important to understand markdown support.
+
+### Bot personality
+
+Add a bot personality to your project/knowledge base with [chit-chat](../how-to/chit-chat.md). This personality comes through with answers provided in a certain conversational tone such as *professional* and *friendly*. This chit-chat is provided as a conversational set, which you have total control to add, edit, and remove.
+
+A bot personality is recommended if your bot connects to your knowledge base. You can choose to use chit-chat in your knowledge base even if you also connect to other services, but you should review how the bot service interacts to know if that is the correct architectural design for your use.
+
+### Conversation flow with a project
+
+Conversation flow usually begins with a salutation from a user, such as `Hi` or `Hello`. Your knowledge base can answer with a general answer, such as `Hi, how can I help you`, and it can also provide a selection of follow-up prompts to continue the conversation.
+
+You should design your conversational flow with a loop in mind so that a user knows how to use your bot and isn't abandoned by the bot in the conversation. [Follow-up prompts](../tutorials/guided-conversations.md) provide linking between QnA pairs, which allow for the conversational flow.
+
+### Authoring with collaborators
+
+Collaborators may be other developers who share the full development stack of the knowledge base application or may be limited to just authoring the knowledge base.
+
+Knowledge base authoring supports several role-based access permissions you apply in the Azure portal to limit the scope of a collaborator's abilities.
+
+## Integration with client applications
+
+Integration with client applications is accomplished by sending a query to the prediction runtime endpoint. A query is sent to your specific project/knowledge base with an SDK or REST-based request to your question answering web app endpoint.
+
+To authenticate a client request correctly, the client application must send the correct credentials and knowledge base ID. If you're using an Azure Bot Service, configure these settings as part of the bot configuration in the Azure portal.
+
+### Conversation flow in a client application
+
+Conversation flow in a client application, such as an Azure bot, may require functionality before and after interacting with the knowledge base.
+
+Does your client application support conversation flow, either by providing alternate means to handle follow-up prompts or including chit-chit? If so, design these early and make sure the client application query is handled correctly by another service or when sent to your knowledge base.
+
+### Active learning from a client application
+
+Question answering uses _active learning_ to improve your knowledge base by suggesting alternate questions to an answer. The client application is responsible for a part of this [active learning](../tutorials/active-learning.md). Through conversational prompts, the client application can determine that the knowledge base returned an answer that's not useful to the user, and it can determine a better answer. The client application needs to send that information back to the knowledge base to improve the prediction quality.
+
+### Providing a default answer
+
+If your knowledge base doesn't find an answer, it returns the _default answer_. This answer is configurable on the **Settings** page.).
+
+This default answer is different from the Azure bot default answer. You configure the default answer for your Azure bot in the Azure portal as part of configuration settings. It's returned when the score threshold isn't met.
+
+## Prediction
+
+The prediction is the response from your knowledge base, and it includes more information than just the answer. To get a query prediction response, use the question answering API.
+
+### Prediction score fluctuations
+
+A score can change based on several factors:
+
+* Number of answers you requested in response with the `top` property
+* Variety of available alternate questions
+* Filtering for metadata
+* Query sent to `test` or `production` project/knowledge base.
+
+### Analytics with Azure Monitor
+
+In question answering, telemetry is offered through the [Azure Monitor service](../../../../azure-monitor/index.yml). Use our [top queries](../how-to/analytics.md) to understand your metrics.
+
+## Development lifecycle
+
+The development lifecycle of a knowledge base is ongoing: editing, testing, and publishing your knowledge base.
+
+### Knowledge base development of question answer pairs
+
+Your QnA pairs should be designed and developed based on your client application usage.
+
+Each pair can contain:
+* Metadata - filterable when querying to allow you to tag your QnA pairs with additional information about the source, content, format, and purpose of your data.
+* Follow-up prompts - helps to determine a path through your knowledge base so the user arrives at the correct answer.
+* Alternate questions - important to allow search to match to your answer from different  forms of the question. [Active learning suggestions](../tutorials/active-learning.md) turn into alternate questions.
+
+### DevOps development
+
+Developing a knowledge base to insert into a DevOps pipeline requires that the knowledge base is isolated during batch testing.
+
+A knowledge base shares the Cognitive Search index with all other knowledge bases on the language resource. While the knowledge base is isolated by partition, sharing the index can cause a difference in the score when compared to the published knowledge base.
+
+To have the _same score_ on the `test` and `production` knowledge bases, isolate a language resource to a single knowledge base. In this architecture, the resource only needs to live as long as the isolated batch test.
+
+## Next steps
+
+* [Azure resources](./azure-resources.md)
diff --git a/articles/cognitive-services/language-service/toc.yml b/articles/cognitive-services/language-service/toc.yml
@@ -515,12 +515,16 @@ items:
     items:
     - name: Resource planning
       href: question-answering/concepts/azure-resources.md
+    - name: App planning
+      href: question-answering/concepts/plan.md
     - name: Precise answering
       href: question-answering/concepts/precise-answering.md
     - name: Confidence score
       href: question-answering/concepts/confidence-score.md
     - name: Best practices
       href: question-answering/concepts/best-practices.md
+    - name: Limits
+      href: question-answering/concepts/limits.md
   - name: Tutorials
     items:
     - name: Create a FAQ Bot