Skip to content

Commit afef581

Browse files
authored
Merge pull request #271483 from MicrosoftDocs/main
4/8/2024 AM Publish
2 parents b14f6e8 + fe60f97 commit afef581

File tree

94 files changed

+656
-737
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

94 files changed

+656
-737
lines changed

articles/ai-services/content-safety/quickstart-groundedness.md

Lines changed: 22 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Follow this guide to use Azure AI Content Safety Groundedness detection to check
2626

2727
## Check groundedness without reasoning
2828

29-
In the simple case without the _reasoning_ feature, the Groundedness detection API classifies the ungroundedness of the submitted content as `true` or `false` and provides a confidence score.
29+
In the simple case without the _reasoning_ feature, the Groundedness detection API classifies the ungroundedness of the submitted content as `true` or `false`.
3030

3131
#### [cURL](#tab/curl)
3232

@@ -104,19 +104,17 @@ Create a new Python file named _quickstart.py_. Open the new file in your prefer
104104
105105
---
106106
107-
> [!TIP]
108-
> To test a summarization task instead of a question answering (QnA) task, use the following sample JSON body:
109-
>
110-
> ```json
111-
> {
112-
> "Domain": "Medical",
113-
> "Task": "Summarization",
114-
> "Text": "Ms Johnson has been in the hospital after experiencing a stroke.",
115-
> "GroundingSources": ["Our patient, Ms. Johnson, presented with persistent fatigue, unexplained weight loss, and frequent night sweats. After a series of tests, she was diagnosed with Hodgkin’s lymphoma, a type of cancer that affects the lymphatic system. The diagnosis was confirmed through a lymph node biopsy revealing the presence of Reed-Sternberg cells, a characteristic of this disease. She was further staged using PET-CT scans. Her treatment plan includes chemotherapy and possibly radiation therapy, depending on her response to treatment. The medical team remains optimistic about her prognosis given the high cure rate of Hodgkin’s lymphoma."],
116-
> "Reasoning": false
117-
> }
118-
> ```
107+
To test a summarization task instead of a question answering (QnA) task, use the following sample JSON body:
119108
109+
```json
110+
{
111+
"domain": "Medical",
112+
"task": "Summarization",
113+
"text": "Ms Johnson has been in the hospital after experiencing a stroke.",
114+
"groundingSources": ["Our patient, Ms. Johnson, presented with persistent fatigue, unexplained weight loss, and frequent night sweats. After a series of tests, she was diagnosed with Hodgkin’s lymphoma, a type of cancer that affects the lymphatic system. The diagnosis was confirmed through a lymph node biopsy revealing the presence of Reed-Sternberg cells, a characteristic of this disease. She was further staged using PET-CT scans. Her treatment plan includes chemotherapy and possibly radiation therapy, depending on her response to treatment. The medical team remains optimistic about her prognosis given the high cure rate of Hodgkin’s lymphoma."],
115+
"reasoning": false
116+
}
117+
```
120118

121119
The following fields must be included in the URL:
122120

@@ -134,7 +132,7 @@ The parameters in the request body are defined in this table:
134132
| - `query` | (Optional) This represents the question in a QnA task. Character limit: 7,500. | String |
135133
| **text** | (Required) The LLM output text to be checked. Character limit: 7,500. | String |
136134
| **groundingSources** | (Required) Uses an array of grounding sources to validate AI-generated text. Up to 55,000 characters of grounding sources can be analyzed in a single request. | String array |
137-
| **reasoning** | (Optional) Specifies whether to use the reasoning feature. The default value is `false`. If `true`, you need to bring your own Azure OpenAI resources to provide an explanation. Be careful: using reasoning increases the processing time and incurs extra fees.| Boolean |
135+
| **reasoning** | (Optional) Specifies whether to use the reasoning feature. The default value is `false`. If `true`, you need to bring your own Azure OpenAI GPT-4 Turbo resources to provide an explanation. Be careful: using reasoning increases the processing time.| Boolean |
138136

139137
### Interpret the API response
140138

@@ -157,18 +155,20 @@ The JSON objects in the output are defined here:
157155
| Name | Description | Type |
158156
| :------------------ | :----------- | ------- |
159157
| **ungroundedDetected** | Indicates whether the text exhibits ungroundedness. | Boolean |
160-
| **confidenceScore** | The confidence value of the _ungrounded_ designation. The score ranges from 0 to 1. | Float |
161158
| **ungroundedPercentage** | Specifies the proportion of the text identified as ungrounded, expressed as a number between 0 and 1, where 0 indicates no ungrounded content and 1 indicates entirely ungrounded content.| Float |
162159
| **ungroundedDetails** | Provides insights into ungrounded content with specific examples and percentages.| Array |
163-
| -**`Text`** | The specific text that is ungrounded. | String |
160+
| -**`text`** | The specific text that is ungrounded. | String |
164161

165162
## Check groundedness with reasoning
166163

167164
The Groundedness detection API provides the option to include _reasoning_ in the API response. With reasoning enabled, the response includes a `"reasoning"` field that details specific instances and explanations for any detected ungroundedness. Be careful: using reasoning increases the processing time and incurs extra fees.
168165

169166
### Bring your own GPT deployment
170167

171-
In order to use your Azure OpenAI resource to enable the reasoning feature, use Managed Identity to allow your Content Safety resource to access the Azure OpenAI resource:
168+
> [!TIP]
169+
> At the moment, we only support **Azure OpenAI GPT-4 Turbo** resources and do not support other GPT types. Your GPT-4 Turbo resources can be deployed in any region; however, we recommend that they be located in the same region as the content safety resources to minimize potential latency.
170+
171+
In order to use your Azure OpenAI GPT4-Turbo resource to enable the reasoning feature, use Managed Identity to allow your Content Safety resource to access the Azure OpenAI resource:
172172

173173
1. Enable Managed Identity for Azure AI Content Safety.
174174

@@ -188,7 +188,7 @@ In order to use your Azure OpenAI resource to enable the reasoning feature, use
188188

189189
### Make the API request
190190

191-
In your request to the Groundedness detection API, set the `"Reasoning"` body parameter to `true`, and provide the other needed parameters:
191+
In your request to the Groundedness detection API, set the `"reasoning"` body parameter to `true`, and provide the other needed parameters:
192192

193193
```json
194194
{
@@ -295,8 +295,8 @@ The parameters in the request body are defined in this table:
295295
| **text** | (Required) The LLM output text to be checked. Character limit: 7,500. | String |
296296
| **groundingSources** | (Required) Uses an array of grounding sources to validate AI-generated text. Up to 55,000 characters of grounding sources can be analyzed in a single request. | String array |
297297
| **reasoning** | (Optional) Set to `true`, the service uses Azure OpenAI resources to provide an explanation. Be careful: using reasoning increases the processing time and incurs extra fees.| Boolean |
298-
| **llmResource** | (Optional) If you want to use your own Azure OpenAI resources instead of our default GPT resources, add this field and include the subfields for the resources used. If you don't want to use your own resources, remove this field from the input. | String |
299-
| - `resourceType `| Specifies the type of resource being used. Currently it only allows `AzureOpenAI`. | Enum|
298+
| **llmResource** | (Required) If you want to use your own Azure OpenAI GPT4-Turbo resource to enable reasoning, add this field and include the subfields for the resources used. | String |
299+
| - `resourceType `| Specifies the type of resource being used. Currently it only allows `AzureOpenAI`. We only support Azure OpenAI GPT-4 Turbo resources and do not support other GPT types. Your GPT-4 Turbo resources can be deployed in any region; however, we recommend that they be located in the same region as the content safety resources to minimize potential latency. | Enum|
300300
| - `azureOpenAIEndpoint `| Your endpoint URL for Azure OpenAI service. | String |
301301
| - `azureOpenAIDeploymentName` | The name of the specific GPT deployment to use. | String|
302302
@@ -332,10 +332,9 @@ The JSON objects in the output are defined here:
332332
| Name | Description | Type |
333333
| :------------------ | :----------- | ------- |
334334
| **ungroundedDetected** | Indicates whether the text exhibits ungroundedness. | Boolean |
335-
| **confidenceScore** | The confidence value of the _ungrounded_ designation. The score ranges from 0 to 1. | Float |
336335
| **ungroundedPercentage** | Specifies the proportion of the text identified as ungrounded, expressed as a number between 0 and 1, where 0 indicates no ungrounded content and 1 indicates entirely ungrounded content.| Float |
337336
| **ungroundedDetails** | Provides insights into ungrounded content with specific examples and percentages.| Array |
338-
| -**`Text`** | The specific text that is ungrounded. | String |
337+
| -**`text`** | The specific text that is ungrounded. | String |
339338
| -**`offset`** | An object describing the position of the ungrounded text in various encoding. | String |
340339
| - `offset > utf8` | The offset position of the ungrounded text in UTF-8 encoding. | Integer |
341340
| - `offset > utf16` | The offset position of the ungrounded text in UTF-16 encoding. | Integer |
@@ -344,7 +343,7 @@ The JSON objects in the output are defined here:
344343
| - `length > utf8` | The length of the ungrounded text in UTF-8 encoding. | Integer |
345344
| - `length > utf16` | The length of the ungrounded text in UTF-16 encoding. | Integer |
346345
| - `length > codePoint` | The length of the ungrounded text in terms of Unicode code points. |Integer |
347-
| -**`Reason`** | Offers explanations for detected ungroundedness. | String |
346+
| -**`reason`** | Offers explanations for detected ungroundedness. | String |
348347

349348
## Clean up resources
350349

articles/ai-services/openai/concepts/use-your-data.md

Lines changed: 4 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -62,17 +62,15 @@ You need to connect to a data source to upload your data. When you want to use y
6262

6363
The [Integrated Vector Database in Azure Cosmos DB for MongoDB](/azure/cosmos-db/mongodb/vcore/vector-search) natively supports integration with Azure OpenAI On Your Data.
6464

65-
For some data sources such as uploading files from your local machine (preview) or data contained in a blob storage account (preview), Azure AI Search is used.
65+
For some data sources such as uploading files from your local machine (preview) or data contained in a blob storage account (preview), Azure AI Search is used. When you choose the following data sources, your data is ingested into an Azure AI Search index.
6666

6767
>[!TIP]
68-
>If you use [Azure Cosmos DB](/azure/cosmos-db/introduction) (except for its vCore-based API for MongoDB), you may take advantage of the Azure AI Advantage offer, which provides the equivalent of up to $6,000 in Azure Cosmos DB throughput credits.
69-
70-
When you choose the following data sources, your data is ingested into an Azure AI Search index.
68+
>If you use Azure Cosmos DB (except for its vCore-based API for MongoDB), you may be eligible for the [Azure AI Advantage offer](/azure/cosmos-db/ai-advantage), which provides the equivalent of up to $6,000 in Azure Cosmos DB throughput credits.
7169
7270
|Data source | Description |
7371
|---------|---------|
7472
| [Azure AI Search](/azure/search/search-what-is-azure-search) | Use an existing Azure AI Search index with Azure OpenAI On Your Data. |
75-
| [Azure Cosmos DB](/azure/cosmos-db/introduction) | Azure Cosmos DB's API for Postgres and vCore-based API for MongoDB have natively integrated vector indexing and do not require Azure AI Search; however, its other APIs do require Azure AI Search for vector indexing. Azure Cosmos DB will offer a natively integrated vector database by mid-2024. |
73+
| [Azure Cosmos DB](/azure/cosmos-db/introduction) | Azure Cosmos DB's API for Postgres and vCore-based API for MongoDB have natively integrated vector indexing and do not require Azure AI Search; however, its other APIs do require Azure AI Search for vector indexing. Azure Cosmos DB for NoSQL will offer a natively integrated vector database by mid-2024. |
7674
|Upload files (preview) | Upload files from your local machine to be stored in an Azure Blob Storage database, and ingested into Azure AI Search. |
7775
|URL/Web address (preview) | Web content from the URLs is stored in Azure Blob Storage. |
7876
|Azure Blob Storage (preview) | Upload files from Azure Blob Storage to be ingested into an Azure AI Search index. |
@@ -141,7 +139,7 @@ If you want to implement additional value-based criteria for query execution, yo
141139

142140
[!INCLUDE [ai-search-ingestion](../includes/ai-search-ingestion.md)]
143141

144-
# [Azure Cosmos DB for MongoDB vCore](#tab/mongo-db)
142+
# [Vector Database in Azure Cosmos DB for MongoDB vCore](#tab/mongo-db)
145143

146144
### Prerequisites
147145
* [Azure Cosmos DB for MongoDB vCore](/azure/cosmos-db/mongodb/vcore/introduction) account

articles/ai-services/openai/how-to/chatgpt.md

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,6 @@ ms.topic: how-to
1010
ms.date: 04/05/2024
1111
manager: nitinme
1212
keywords: ChatGPT
13-
zone_pivot_groups: openai-chat
1413
---
1514

1615
# Learn how to work with the GPT-3.5-Turbo and GPT-4 models
@@ -19,5 +18,4 @@ The GPT-3.5-Turbo and GPT-4 models are language models that are optimized for co
1918

2019
This article walks you through getting started with the GPT-3.5-Turbo and GPT-4 models. It's important to use the techniques described here to get the best results. If you try to interact with the models the same way you did with the older model series, the models will often be verbose and provide less useful responses.
2120

22-
[!INCLUDE [Chat Completions](../includes/chat-completion.md)]
23-
21+
[!INCLUDE [Chat Completions](../includes/chat-completion.md)]

articles/ai-services/openai/how-to/monitoring.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -56,10 +56,10 @@ The following table summarizes the current subset of metrics available in Azure
5656
|Metric|Category|Aggregation|Description|Dimensions|
5757
|---|---|---|---|---|
5858
|`Azure OpenAI Requests`|HTTP|Count|Total number of calls made to the Azure OpenAI API over a period of time. Applies to PayGo, PTU, and PTU-managed SKUs.| `ApiName`, `ModelDeploymentName`,`ModelName`,`ModelVersion`, `OperationName`, `Region`, `StatusCode`, `StreamType`|
59-
| `Generated Completion Tokens` | Usage | Sum | Number of generated tokens (output) from an OpenAI model. Applies to PayGo, PTU, and PTU-manged SKUs | `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
60-
| `Processed FineTuned Training Hours` | Usage |Sum| Number of Training Hours Processed on an OpenAI FineTuned Model | `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
61-
| `Processed Inference Tokens` | Usage | Sum| Number of inference tokens processed by an OpenAI model. Calculated as prompt tokens (input) + generated tokens. Applies to PayGo, PTU, and PTU-manged SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
62-
| `Processed Prompt Tokens` | Usage | Sum | Total number of prompt tokens (input) processed on an OpenAI model. Applies to PayGo, PTU, and PTU-managed SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
59+
| `Generated Completion Tokens` | Usage | Sum | Number of generated tokens (output) from an Azure OpenAI model. Applies to PayGo, PTU, and PTU-manged SKUs | `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
60+
| `Processed FineTuned Training Hours` | Usage |Sum| Number of training hours processed on an Azure OpenAI fine-tuned model. | `ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
61+
| `Processed Inference Tokens` | Usage | Sum| Number of inference tokens processed by an Azure OpenAI model. Calculated as prompt tokens (input) + generated tokens. Applies to PayGo, PTU, and PTU-manged SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
62+
| `Processed Prompt Tokens` | Usage | Sum | Total number of prompt tokens (input) processed on an Azure OpenAI model. Applies to PayGo, PTU, and PTU-managed SKUs.|`ApiName`, `ModelDeploymentName`,`ModelName`, `Region`|
6363
| `Provision-managed Utilization V2` | Usage | Average | Provision-managed utilization is the utilization percentage for a given provisioned-managed deployment. Calculated as (PTUs consumed/PTUs deployed)*100. When utilization is at or above 100%, calls are throttled and return a 429 error code. | `ModelDeploymentName`,`ModelName`,`ModelVersion`, `Region`, `StreamType`|
6464

6565
## Configure diagnostic settings

articles/ai-services/openai/includes/assistants-ai-studio.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -119,7 +119,7 @@ By default there are three panels: assistant setup, chat session, and Logs. **Sh
119119
120120
## Clean up resources
121121
122-
If you want to clean up and remove an OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
122+
If you want to clean up and remove an Azure OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
123123
124124
- [Portal](../../multi-service-resource.md?pivots=azportal#clean-up-resources)
125125
- [Azure CLI](../../multi-service-resource.md?pivots=azcli#clean-up-resources)

articles/ai-services/openai/includes/assistants-csharp.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -147,7 +147,7 @@ New messages can be created on the thread before re-running, which will see the
147147

148148
## Clean up resources
149149

150-
If you want to clean up and remove an OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
150+
If you want to clean up and remove an Azure OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
151151

152152
- [Portal](../../multi-service-resource.md?pivots=azportal#clean-up-resources)
153153
- [Azure CLI](../../multi-service-resource.md?pivots=azcli#clean-up-resources)

articles/ai-services/openai/includes/assistants-python.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ To successfully make a call against the Azure OpenAI service, you'll need the fo
4848

4949
Go to your resource in the Azure portal. The **Keys and Endpoint** can be found in the **Resource Management** section. Copy your endpoint and access key as you'll need both for authenticating your API calls. You can use either `KEY1` or `KEY2`. Always having two keys allows you to securely rotate and regenerate keys without causing a service disruption.
5050

51-
:::image type="content" source="../media/quickstarts/endpoint.png" alt-text="Screenshot of the overview blade for an OpenAI Resource in the Azure portal with the endpoint & access keys location circled in red." lightbox="../media/quickstarts/endpoint.png":::
51+
:::image type="content" source="../media/quickstarts/endpoint.png" alt-text="Screenshot of the overview blade for an Azure OpenAI resource in the Azure portal with the endpoint & access keys location circled in red." lightbox="../media/quickstarts/endpoint.png":::
5252

5353
Create and assign persistent environment variables for your key and endpoint.
5454

@@ -205,7 +205,7 @@ It is important to remember that while code interpreter gives the model the capa
205205

206206
## Clean up resources
207207

208-
If you want to clean up and remove an OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
208+
If you want to clean up and remove an Azure OpenAI resource, you can delete the resource or resource group. Deleting the resource group also deletes any other resources associated with it.
209209

210210
- [Portal](../../multi-service-resource.md?pivots=azportal#clean-up-resources)
211211
- [Azure CLI](../../multi-service-resource.md?pivots=azcli#clean-up-resources)

0 commit comments

Comments
 (0)