Skip to content

Commit 3bb534c

Browse files
committed
Merge branch 'main' of https://github.com/microsoftdocs/azure-ai-docs-pr into prompt2
2 parents cab0586 + 3ffda5f commit 3bb534c

File tree

315 files changed

+686
-386
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

315 files changed

+686
-386
lines changed

.github/policies/disallow-edits.yml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -12,10 +12,10 @@ configuration:
1212
action: Opened
1313
- or:
1414
- filesMatchPattern:
15-
pattern: articles/index.yml
15+
matchAny: true
16+
pattern: articles/index.*
1617
then:
1718
- addReply:
1819
reply: >-
1920
@${issueAuthor} - You tried to add an index file to this repository; this is not permitted so your pull request will be closed automatically.
2021
- closePullRequest
21-

articles/ai-services/.openpublishing.redirection.ai-services.json

Lines changed: 7 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -484,6 +484,11 @@
484484
"source_path_from_root": "/articles/ai-services/language-service/summarization/custom/quickstart.md",
485485
"redirect_url": "/azure/ai-services//language-service/summarization/overview",
486486
"redirect_document_id": false
487-
}
487+
},
488+
{
489+
"source_path_from_root": "/articles/ai-services/speech-service/video-translation-studio.md",
490+
"redirect_url": "/azure/ai-services/speech-service/video-translation-get-started",
491+
"redirect_document_id": true
492+
}
488493
]
489-
}
494+
}

articles/ai-services/encryption/cognitive-services-encryption-keys-portal.md

Lines changed: 1 addition & 16 deletions
Original file line numberDiff line numberDiff line change
@@ -27,20 +27,6 @@ Azure AI is built on top of multiple Azure services. While the data is stored se
2727
* unwrap key
2828
* get
2929

30-
For example, the managed identity for Azure Cosmos DB would need to have those permissions to the key vault.
31-
32-
## How metadata is stored
33-
34-
The following services are used by Azure AI to store metadata for your Azure AI resource and projects:
35-
36-
|Service|What it's used for|Example|
37-
|-----|-----|-----|
38-
|Azure Cosmos DB|Stores metadata for your Azure AI projects and tools|Flow creation timestamps, deployment tags, evaluation metrics|
39-
|Azure AI Search|Stores indices that are used to help query your AI studio content.|An index based off your model deployment names|
40-
|Azure Storage Account|Stores artifacts created by Azure AI projects and tools|Fine-tuned models|
41-
42-
All of the above services are encrypted using the same key at the time that you create your Azure AI resource for the first time, and are set up in a managed resource group in your subscription once for every Azure AI resource and set of projects associated with it. Your Azure AI resource and projects read and write data using managed identity. Managed identities are granted access to the resources using a role assignment (Azure role-based access control) on the data resources. The encryption key you provide is used to encrypt data that is stored on Microsoft-managed resources. It's also used to create indices for Azure AI Search, which are created at runtime.
43-
4430
## Customer-managed keys
4531

4632
When you don't use a customer-managed key, Microsoft creates and manages these resources in a Microsoft owned Azure subscription and uses a Microsoft-managed key to encrypt the data.
@@ -53,14 +39,13 @@ When you use a customer-managed key, these resources are _in your Azure subscrip
5339
These Microsoft-managed resources are located in a new Azure resource group is created in your subscription. This group is in addition to the resource group for your project. This resource group contains the Microsoft-managed resources that your key is used with. The resource group is named using the formula of `<Azure AI resource group name><GUID>`. It isn't possible to change the naming of the resources in this managed resource group.
5440

5541
> [!TIP]
56-
> * The [Request Units](/azure/cosmos-db/request-units) for the Azure Cosmos DB automatically scale as needed.
5742
> * If your AI resource uses a private endpoint, this resource group will also contain a Microsoft-managed Azure Virtual Network. This VNet is used to secure communications between the managed services and the project. You cannot provide your own VNet for use with the Microsoft-managed resources. You also cannot modify the virtual network. For example, you cannot change the IP address range that it uses.
5843
5944
> [!IMPORTANT]
6045
> If your subscription does not have enough quota for these services, a failure will occur.
6146
6247
> [!WARNING]
63-
> Don't delete the managed resource group that contains this Azure Cosmos DB instance, or any of the resources automatically created in this group. If you need to delete the resource group or Microsoft-managed services in it, you must delete the Azure AI resources that uses it. The resource group resources are deleted when the associated AI resource is deleted.
48+
> Don't delete the managed resource group any of the resources automatically created in this group. If you need to delete the resource group or Microsoft-managed services in it, you must delete the Azure AI resources that uses it. The resource group resources are deleted when the associated AI resource is deleted.
6449
6550
The process to enable Customer-Managed Keys with Azure Key Vault for Azure AI services varies by product. Use these links for service-specific instructions:
6651

articles/ai-services/language-service/personally-identifiable-information/how-to/use-containers.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
---
2-
title: Use personally identifiable information Docker containers on-premises
2+
title: Use personally identifiable information (PII) detection Docker containers on-premises
33
titleSuffix: Azure AI services
4-
description: Use Docker containers for the Personally Identifiable Information API to determine the language of written text, on-premises.
4+
description: Use Docker containers for the Personally Identifiable Information (PII) detection API to determine the language of written text, on-premises.
55
#services: cognitive-services
66
author: jboback
77
manager: nitinme
@@ -16,7 +16,7 @@ keywords: on-premises, Docker, container
1616
# Install and run Personally Identifiable Information (PII) Detection containers
1717

1818
> [!NOTE]
19-
> The data limits for the PII container are 5120 characters and 10 documents max.
19+
> The data limits in a single synchronous API call for the PII container are 5120 characters per document and up to 10 documents per call.
2020
2121
Containers enable you to host the PII detection API on your own infrastructure. If you have security or data governance requirements that can't be fulfilled by calling PII detection remotely, then containers might be a good option.
2222

articles/ai-services/openai/concepts/model-retirements.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,8 @@ These models are currently available for use in Azure OpenAI Service.
102102
| `gpt-4` | 1106-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o`|
103103
| `gpt-4` | 0125-preview |To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o` |
104104
| `gpt-4` | vision-preview | To be upgraded to `gpt-4` version: `turbo-2024-04-09`, starting no sooner than January 27, 2025 **<sup>1</sup>** | `gpt-4o`|
105-
| `gpt-4o` | 2024-05-13 | No earlier than March 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on December 5, 2024. | |
105+
| `gpt-4o` | 2024-05-13 | No earlier than May 20, 2025 <br><br>Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on December 5, 2024. | |
106+
| `gpt-4o-mini` | 2024-07-18 | No earlier than July 18, 2025 | |
106107
| `gpt-3.5-turbo-instruct` | 0914 | No earlier than Sep 14, 2025 | |
107108
| `text-embedding-ada-002` | 2 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
108109
| `text-embedding-ada-002` | 1 | No earlier than April 3, 2025 | `text-embedding-3-small` or `text-embedding-3-large` |
@@ -119,8 +120,7 @@ These models are currently available for use in Azure OpenAI Service.
119120
| Model | Current default version | New default version | Default upgrade date |
120121
|---|---|---|---|
121122
| `gpt-35-turbo` | 0301 | 0125 | Deployments of versions `0301`, `0613`, and `1106` set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `0125`, starting on November 13, 2024.|
122-
123-
123+
| `gpt-4o` | 2024-05-13 | 2024-08-06 | Deployments set to [**Auto-update to default**](/azure/ai-services/openai/how-to/working-with-models?tabs=powershell#auto-update-to-default) will be automatically upgraded to version: `2024-08-06`, starting on December 5, 2024. |
124124

125125
## Deprecated models
126126

articles/ai-services/openai/how-to/batch.md

Lines changed: 4 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ manager: nitinme
66
ms.service: azure-ai-openai
77
ms.custom:
88
ms.topic: how-to
9-
ms.date: 10/02/2024
9+
ms.date: 10/11/2024
1010
author: mrbullwinkle
1111
ms.author: mbullwin
1212
recommendations: false
@@ -65,9 +65,9 @@ The following models support global batch:
6565

6666
Refer to the [models page](../concepts/models.md) for the most up-to-date information on regions/models where global batch is currently supported.
6767

68-
### API Versions
68+
### API support
6969

70-
- `2024-07-01-preview`
70+
API support was first added with `2024-07-01-preview`.
7171

7272
### Not supported
7373

@@ -224,8 +224,7 @@ When a job failure occurs, you'll find details about the failure in the `errors`
224224

225225
### Known issues
226226

227-
- Resources deployed with Azure CLI won't work out-of-box with Azure OpenAI global batch. This is due to an issue where resources deployed using this method have endpoint subdomains that don't follow the `https://your-resource-name.openai.azure.com` pattern. A workaround for this issue is to deploy a new Azure OpenAI resource using one of the other common deployment methods which will properly handle the subdomain setup as part of the deployment process.
228-
227+
- Resources deployed with Azure CLI won't work out-of-box with Azure OpenAI global batch. This is due to an issue where resources deployed using this method have endpoint subdomains that don't follow the `https://your-resource-name.openai.azure.com` pattern. A workaround for this issue is to deploy a new Azure OpenAI resource using one of the other common deployment methods which will properly handle the subdomain setup as part of the deployment process.
229228

230229
## See also
231230

articles/ai-services/openai/includes/batch/batch-python.md

Lines changed: 1 addition & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -104,34 +104,11 @@ file_id = file.id
104104
"filename": "test.jsonl",
105105
"object": "file",
106106
"purpose": "batch",
107-
"status": "pending",
107+
"status": null,
108108
"status_details": null
109109
}
110110
```
111111

112-
## Track file upload status
113-
114-
Depending on the size of your upload file it might take some time before it's fully uploaded and processed. To check on your file upload status run:
115-
116-
```python
117-
# Wait until the uploaded file is in processed state
118-
import time
119-
import datetime
120-
121-
status = "pending"
122-
while status != "processed":
123-
time.sleep(15)
124-
file_response = client.files.retrieve(file_id)
125-
status = file_response.status
126-
print(f"{datetime.datetime.now()} File Id: {file_id}, Status: {status}")
127-
```
128-
129-
**Output:**
130-
131-
```output
132-
2024-07-31 21:42:53.663655 File Id: file-9f3a81d899b4442f98b640e4bc3535dd, Status: processed
133-
```
134-
135112
## Create batch job
136113

137114
Once your file has uploaded successfully by reaching a status of `processed` you can submit the file for batch processing.

articles/ai-services/openai/quotas-limits.md

Lines changed: 12 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 10/10/2024
13+
ms.date: 10/11/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -62,6 +62,17 @@ The following sections provide you with a quick guide to the default quotas and
6262

6363
## o1-preview & o1-mini rate limits
6464

65+
> [!IMPORTANT]
66+
> The ratio of RPM/TPM for quota with o1-series models works differently than older chat completions models:
67+
>
68+
> - **Older chat models:** 1 unit of capacity = 6 RPM and 1,000 TPM.
69+
> - **o1-preview:** 1 unit of capacity = 1 RPM and 6,000 TPM.
70+
> - **o1-mini:** 1 unit of capacity = 1 RPM per 10,000 TPM.
71+
>
72+
> This is particularly important for programmatic model deployment as this change in RPM/TPM ratio can result in accidental under allocation of quota if one is still assuming the 1:1000 ratio followed by older chat completion models.
73+
>
74+
> There is a known issue with the [quota/usages API](/rest/api/aiservices/accountmanagement/usages/list?view=rest-aiservices-accountmanagement-2024-06-01-preview&tabs=HTTP&preserve-view=true) where it assumes the old ratio applies to the new o1-series models. The API returns the correct base capacity number, but does not apply the correct ratio for the accurate calculation of TPM.
75+
6576
### o1-preview & o1-mini global standard
6677

6778
| Model|Tier| Quota Limit in tokens per minute (TPM) | Requests per minute |

articles/ai-services/speech-service/high-definition-voices.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -39,7 +39,7 @@ Here's a comparison of features between Azure AI Speech HD voices, Azure OpenAI
3939

4040
| Feature | Azure AI Speech HD voices | Azure OpenAI HD voices | Azure AI Speech voices (not HD) |
4141
|---------|---------------|------------------------|------------------------|
42-
| **Region** | North Central US, Sweden Central | North Central US, Sweden Central | Available in dozens of regions. See the [region list](regions.md#speech-service).|
42+
| **Region** | East US, Southeast Asia, West Europe | North Central US, Sweden Central | Available in dozens of regions. See the [region list](regions.md#speech-service).|
4343
| **Number of voices** | 12 | 6 | More than 500 |
4444
| **Multilingual** | No (perform on primary language only) | Yes | Yes (applicable only to multilingual voices) |
4545
| **SSML support** | Support for [a subset of SSML elements](#supported-and-unsupported-ssml-elements-for-azure-ai-speech-hd-voices).| Support for [a subset of SSML elements](openai-voices.md#ssml-elements-supported-by-openai-text-to-speech-voices-in-azure-ai-speech). | Support for the [full set of SSML](speech-synthesis-markup-structure.md) in Azure AI Speech. |

0 commit comments

Comments
 (0)