You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
If you get an `unauthorized: authentication required` error, login with the `az acr login --name <your-container-registry-name>` command.
134
+
If you get an `unauthorized: authentication required` error, sign in with the `az acr login --name <your-container-registry-name>` command.
135
135
136
136
When the process is done, the results should be similar to:
137
137
@@ -148,7 +148,7 @@ To deploy the container to the Azure Kubernetes Service, the container images ne
148
148
149
149
## Get language detection Docker image
150
150
151
-
1. Pull the latest version of the Docker image to the local machine. This may take a few minutes. If there is a newer version of this container, change the value from `1.1.006770001-amd64-preview` to the newer version.
151
+
1. Pull the latest version of the Docker image to the local machine. This can take a few minutes. If there is a newer version of this container, change the value from `1.1.006770001-amd64-preview` to the newer version.
@@ -216,7 +216,7 @@ The following steps are needed to get the required information to connect your c
216
216
az aks create --resource-group cogserv-container-rg --name patty-kube --node-count 2 --service-principal <appId> --client-secret <client-secret> --generate-ssh-keys
217
217
```
218
218
219
-
This step may take a few minutes. The result is:
219
+
This step can take a few minutes. The result is:
220
220
221
221
```output
222
222
{
@@ -310,14 +310,14 @@ This section uses the **kubectl** CLI to talk with the Azure Kubernetes Service.
310
310
Language-frontend deployment settings|Purpose|
311
311
|--|--|
312
312
|Line 32<br> `image` property|Image location for the frontend image in your Container Registry<br>`<container-registry-name>.azurecr.io/language-frontend:v1`|
313
-
|Line 44<br> `name` property|Container Registry secret for the image, referred to as `<client-secret>` in a previous section.|
313
+
|Line 44<br> `name` property|Container Registry secret for the image. Referred to as `<client-secret>` in a previous section.|
314
314
315
315
1. Change the language deployment lines of `language.yml` based on the following table to add your own container registry image names, client secret, and Language service settings.
316
316
317
317
|Language deployment settings|Purpose|
318
318
|--|--|
319
319
|Line 78<br> `image` property|Image location for the language image in your Container Registry<br>`<container-registry-name>.azurecr.io/language:1.1.006770001-amd64-preview`|
320
-
|Line 95<br> `name` property|Container Registry secret for the image, referred to as `<client-secret>` in a previous section.|
320
+
|Line 95<br> `name` property|Container Registry secret for the image. Referred to as `<client-secret>` in a previous section.|
321
321
|Line 91<br> `apiKey` property|Your Language service resource key|
322
322
|Line 92<br> `billing` property|The billing endpoint for your Language service resource.<br>`https://westus.api.cognitive.microsoft.com/text/analytics/v2.1`|
Copy file name to clipboardExpand all lines: articles/ai-services/containers/container-faq.yml
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ metadata:
8
8
manager: nitinme
9
9
ms.service: azure-ai-services
10
10
ms.topic: faq
11
-
ms.date: 02/21/2024
11
+
ms.date: 01/31/2025
12
12
ms.author: aahi
13
13
title: FAQ for Azure AI containers
14
14
summary: |
@@ -82,7 +82,7 @@ sections:
82
82
| `BillingEndpointBusy` | The billing endpoint is currently busy. Check the `Retry-After` value and wait for this period to end before you make more requests. |
83
83
| `ContainerUseUnauthorized` | The provided API key isn't authorized for use with this container. You're likely using a gated container, so make sure your Azure subscription ID is approved by submitting an [online request](https://aka.ms/csgate). |
84
84
| `[ERROR] Failed to download: context deadline exceeded` | The model file download request to our servers timed out. Make sure you have a strong internet connection to download all required files within one hour. Model downloads apply to Text Translation and some Speech service containers. |
85
-
| `The provided license path was not found. Please ensure a volume is mounted and a directory exists at the location specified by Mounts:License` | **Disconnected containers only.** There is no license file stored locally at the location specified in your docker run command. You might have mounted the license volume incorrectly. Be sure to check your local file system and provide a valid filepath to the desired local storage location for the license file using the -v argument in your docker run command. |
85
+
| `The provided license path was not found. Please ensure a volume is mounted and a directory exists at the location specified by Mounts:License` | **Disconnected containers only.** There is no license file stored locally at the location specified in your docker run command. You might have mounted the license volume incorrectly. Be sure to check your local file system and provide a valid filepath to the desired local storage location for the license file using the `-v` argument in your docker run command. |
86
86
| `Unknown` | The server is currently unable to process billing requests. |
Copy file name to clipboardExpand all lines: articles/ai-services/containers/container-reuse-recipe.md
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -7,7 +7,7 @@ author: aahill
7
7
manager: nitinme
8
8
ms.service: azure-ai-services
9
9
ms.topic: how-to
10
-
ms.date: 02/22/2024
10
+
ms.date: 01/31/2025
11
11
ms.author: aahi
12
12
#Customer intent: As a potential customer, I want to know how to configure containers so I can reuse them.
13
13
---
@@ -23,7 +23,7 @@ Once you have this new layer of container (with settings), and you have tested i
23
23
Any `docker run` examples in this document assume a Windows console with a `^` line continuation character. Consider the following for your own use:
24
24
25
25
* Do not change the order of the arguments unless you are very familiar with docker containers.
26
-
* If you are using an operating system other than Windows, or a console other than Windows console, use the correct console/terminal, folder syntax for mounts, and line continuation character for your console and system. Because the Azure AI services container is a Linux operating system, the target mount uses a Linux-style folder syntax.
26
+
* If you are using an operating system other than Windows, or a console other than Windows console, use the correct console/terminal, folder syntax for mounts, and line continuation character for your console and system. Because the Azure AI services container is a Linux operating system, the target mount uses a Linux-style folder syntax.
27
27
*`docker run` examples use the directory off the `c:` drive to avoid any permission conflicts on Windows. If you need to use a specific directory as the input directory, you might need to grant the docker service permission.
28
28
29
29
## Store no configuration settings in image
@@ -123,7 +123,7 @@ Follow these steps to use the Dockerfile and place the new image in your private
123
123
az acr login --name <my-registry>
124
124
```
125
125
126
-
You can also sign in with docker login if you are assigned a service principal.
126
+
You can also sign in with `docker login`if you are assigned a service principal.
@@ -18,7 +18,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
18
18
19
19
| Models | Description |
20
20
|--|--|
21
-
|[o1 & o1-mini](#o1-and-o1-mini-models)|[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
21
+
|[o-series models](#o-series-models)|[Reasoning models](../how-to/reasoning.md) with advanced problem-solving and increased focus and capability. |
22
22
|[GPT-4o & GPT-4o mini & GPT-4 Turbo](#gpt-4o-and-gpt-4-turbo)| The latest most capable Azure OpenAI models with multimodal versions, which can accept both text and images as input. |
23
23
|[GPT-4o audio](#gpt-4o-audio)| GPT-4o audio models that support either low-latency, "speech in, speech out" conversational interactions or audio generation. |
24
24
|[GPT-4](#gpt-4)| A set of models that improve on GPT-3.5 and can understand and generate natural language and code. |
@@ -28,30 +28,32 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
28
28
|[Whisper](#whisper-models)| A series of models in preview that can transcribe and translate speech to text. |
29
29
|[Text to speech](#text-to-speech-models-preview) (Preview) | A series of models in preview that can synthesize text to speech. |
30
30
31
-
## o1 and o1-mini models
31
+
## o-series models
32
32
33
-
The Azure OpenAI `o1` and `o1-mini` models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
33
+
The Azure OpenAI o<sup>*</sup> series models are specifically designed to tackle reasoning and problem-solving tasks with increased focus and capability. These models spend more time processing and understanding the user's request, making them exceptionally strong in areas like science, coding, and math compared to previous iterations.
34
34
35
35
| Model ID | Description | Max Request (tokens) | Training Data (up to) |
|`o1` (2024-12-17) | The most capable model in the o1 series, offering [enhanced reasoning abilities](../how-to/reasoning.md). <br> - Structured outputs<br> - Text, image processing <br> - Functions/Tools <br> <br> **Request access: [limited access model application](https://aka.ms/OAI/o1access)**| Input: 200,000 <br> Output: 100,000 | Oct 2023 |
38
39
|`o1-preview` (2024-09-12) | Older preview version | Input: 128,000 <br> Output: 32,768 | Oct 2023 |
39
40
|`o1-mini` (2024-09-12) | A faster and more cost-efficient option in the o1 series, ideal for coding tasks requiring speed and lower resource consumption.| Input: 128,000 <br> Output: 65,536 | Oct 2023 |
40
41
41
42
### Availability
42
43
43
-
The `o1` and `o1-mini`models are now available for API access and model deployment. **For access to `o1` registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who previously applied and received access to `o1-preview`, don't need to reapply as they are automatically on the wait-list for the latest model.
44
+
**For access to `o3-mini` and `o1` registration is required, and access will be granted based on Microsoft's eligibility criteria**. Customers who previously applied and received access to `o1-preview`, or `o1`don't need to reapply as they are automatically on the wait-list for the latest models in the o-series.
44
45
45
46
Request access: [limited access model application](https://aka.ms/OAI/o1access)
46
47
47
-
Once access has been granted, you will need to create a deployment for each model. If you have an existing `o1-preview` deployment in place upgrade is currently not supported, you will need to create a new deployment.
48
+
Once access has been granted, you will need to create a deployment for each model.
48
49
49
-
To learn more about the advanced `o1` series models see, [getting started with o1 series reasoning models](../how-to/reasoning.md).
50
+
To learn more about the advanced `o-series`models see, [getting started with reasoning models](../how-to/reasoning.md).
50
51
51
52
### Region availability
52
53
53
54
| Model | Region |
54
55
|---|---|
56
+
|`o3-mini`| East US2 (Global Standard) <br> Sweden Central (Global Standard) |
55
57
|`o1`| East US2 (Global Standard) <br> Sweden Central (Global Standard) |
56
58
|`o1-preview`| See the [models table](#global-standard-model-availability). |
57
59
|`o1-mini`| See the [models table](#global-provisioned-managed-model-availability). |
0 commit comments