Skip to content

Commit 8c4b925

Browse files
committed
Merge branch 'main' into release-arc-data
2 parents ee23d5e + b82ca66 commit 8c4b925

File tree

141 files changed

+1806
-1335
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

141 files changed

+1806
-1335
lines changed

.openpublishing.redirection.machine-configuration.json

Lines changed: 144 additions & 79 deletions
Large diffs are not rendered by default.
11.7 KB
Loading
17.6 KB
Loading

articles/ai-services/openai/concepts/gpt-with-vision.md

Lines changed: 25 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -62,13 +62,33 @@ Base Pricing for GPT-4 Turbo with Vision is:
6262

6363
See the [Tokens section of the overview](/azure/ai-services/openai/overview#tokens) for information on how text and images translate to tokens.
6464

65-
Additionally, if you use video prompt integration with the Video Retrieval add-on, it accrues other costs:
66-
- Ingestion: $0.05 per minute of video
67-
- Transactions: $0.25 per 1000 queries of the Video Retrieval index
65+
If you turn on Enhancements, additional usage applies for using GPT-4 Turbo with Vision with Azure AI Vision functionality.
6866

69-
Processing videos involves the use of extra tokens to identify key frames for analysis. The number of these additional tokens will be roughly equivalent to the sum of the tokens in the text input, plus 700 tokens.
67+
| Model | Price |
68+
|-----------------|-----------------|
69+
| + Enhanced add-on features for OCR | $1.5 per 1000 transactions |
70+
| + Enhanced add-on features for Object Detection | $1.5 per 1000 transactions |
71+
| + Enhanced add-on feature for “Add your Image” Image Embeddings | $1.5 per 1000 transactions |
72+
| + Enhanced add-on feature for “Video Retrieval” integration **<sup>1</sup>** | Ingestion: $0.05 per minute of video <br>Transactions: $0.25 per 1000 queries of the Video Retrieval index |
7073

71-
### Example price calculation
74+
**<sup>1</sup>** Processing videos involves the use of extra tokens to identify key frames for analysis. The number of these additional tokens will be roughly equivalent to the sum of the tokens in the text input, plus 700 tokens.
75+
76+
### Example image price calculation
77+
> [!IMPORTANT]
78+
> The following content is an example only, and prices are subject to change in the future.
79+
80+
For a typical use case, take an image with both visible objects and text and a 100-token prompt input. When the service processes the prompt, it generates 100 tokens of output. In the image, both text and objects can be detected. The price of this transaction would be:
81+
82+
| Item | Detail | Total Cost |
83+
|-----------------|-----------------|--------------|
84+
| GPT-4 Turbo with Vision input tokens | 100 text tokens | $0.001 |
85+
| Enhanced add-on features for OCR | $1.50 / 1000 transactions | $0.0015 |
86+
| Enhanced add-on features for Object Grounding | $1.50 / 1000 transactions | $0.0015 |
87+
| Output Tokens | 100 tokens (assumed) | $0.003 |
88+
| **Total Cost** | | $0.007 |
89+
90+
91+
### Example video price calculation
7292

7393
> [!IMPORTANT]
7494
> The following content is an example only, and prices are subject to change in the future.

articles/ai-services/openai/includes/gpt-v-python.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -22,7 +22,7 @@ Use this article to get started using the Azure OpenAI Python SDK to deploy and
2222
Currently, access to this service is granted only by application. You can apply for access to Azure OpenAI by completing the form at https://aka.ms/oai/access. Open an issue on this repo to contact us if you have an issue.
2323
- <a href="https://www.python.org/" target="_blank">Python 3.7.1 or later version</a>.
2424
- The following Python libraries: `os`
25-
- An Azure OpenAI Service resource with a GPT-4 Turbo with Vision model deployed. The resource must be in the `SwitzerlandNorth`, `SwedenCentral`, `WestUS`, or `AustraliaEast` Azure region. For more information about model deployment, see [the resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
25+
- An Azure OpenAI Service resource with a GPT-4 Turbo with Vision model deployed. See [GPT-4 and GPT-4 Turbo Preview model availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-preview-model-availability) for available regions. For more information about resource creation, see the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
2626
- For Vision enhancement (optional): An Azure Computer Vision resource in the same region as your Azure OpenAI resource, in the paid (S0) tier.
2727

2828
## Set up

articles/ai-services/openai/includes/gpt-v-rest.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ Use this article to get started using the Azure OpenAI REST APIs to deploy and u
1818
- Access granted to Azure OpenAI in the desired Azure subscription. Currently, access to this service is granted only by application. You can apply for access to Azure OpenAI by completing the form at https://aka.ms/oai/access. Open an issue on this repo to contact us if you have an issue.
1919
- <a href="https://www.python.org/" target="_blank">Python 3.7.1 or later version</a>.
2020
- The following Python libraries: `requests`, `json`.
21-
- An Azure OpenAI Service resource with a GPT-4 Turbo with Vision model deployed. The resource must be in the `SwitzerlandNorth`, `SwedenCentral`, `WestUS`, or `AustraliaEast` Azure region. For more information about model deployment, see [the resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
21+
- An Azure OpenAI Service resource with a GPT-4 Turbo with Vision model deployed. See [GPT-4 and GPT-4 Turbo Preview model availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-preview-model-availability) for available regions. For more information about resource creation, see the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
2222
- For Vision enhancement (optional): An Azure Computer Vision resource in the same region as your Azure OpenAI resource, in the paid (S0) tier.
2323

2424
> [!NOTE]

articles/ai-services/openai/includes/gpt-v-studio.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ Start exploring GPT-4 Turbo with Vision capabilities with a no-code approach thr
1616

1717
- An Azure subscription. <a href="https://azure.microsoft.com/free/ai-services" target="_blank">Create one for free</a>.
1818
- Access granted to Azure OpenAI in the desired Azure subscription. Currently, access to this service is granted only by application. You can apply for access to Azure OpenAI by completing the form at https://aka.ms/oai/access. Open an issue on this repo to contact us if you have an issue.
19-
- An Azure OpenAI Service resource. The resource must be in the `SwitzerlandNorth`, `SwedenCentral`, `WestUS`, or `AustraliaEast` Azure region. For more information about resource creation, see the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
19+
- An Azure OpenAI Service resource with a GPT-4 Turbo with Vision model deployed. See [GPT-4 and GPT-4 Turbo Preview model availability](../concepts/models.md#gpt-4-and-gpt-4-turbo-preview-model-availability) for available regions. For more information about resource creation, see the [resource deployment guide](/azure/ai-services/openai/how-to/create-resource).
2020
- For Vision enhancement (optional): An Azure Computer Vision resource in the same region as your Azure OpenAI resource, in the paid (S0) tier.
2121

2222
> [!NOTE]

articles/ai-services/speech-service/custom-neural-voice.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -48,6 +48,11 @@ Here's an overview of the steps to create a custom neural voice in Speech Studio
4848

4949
You can tune, adjust, and use your custom voice, similarly as you would use a prebuilt neural voice. Convert text into speech in real-time, or generate audio content offline with text input. You use the [REST API](./rest-text-to-speech.md), the [Speech SDK](./get-started-text-to-speech.md), or the [Speech Studio](https://speech.microsoft.com/audiocontentcreation).
5050

51+
> [!TIP]
52+
> You can also use the Speech SDK and custom voice REST API to train a custom neural voice.
53+
>
54+
> Check out the code samples in the [Speech SDK repository on GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/custom-voice/README.md) to see how to use personal voice in your application.
55+
5156
The style and the characteristics of the trained voice model depend on the style and the quality of the recordings from the voice talent used for training. However, you can make several adjustments by using [SSML (Speech Synthesis Markup Language)](./speech-synthesis-markup.md?tabs=csharp) when you make the API calls to your voice model to generate synthetic speech. SSML is the markup language used to communicate with the text to speech service to convert text into audio. The adjustments you can make include change of pitch, rate, intonation, and pronunciation correction. If the voice model is built with multiple styles, you can also use SSML to switch the styles.
5257

5358
## Components sequence

articles/ai-services/speech-service/openai-voices.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -22,6 +22,9 @@ OpenAI text to speech voices are available via two model variants: `Neural` and
2222
- `Neural`: Optimized for real-time use cases with the lowest latency, but lower quality than `NeuralHD`.
2323
- `NeuralHD`: Optimized for quality.
2424

25+
For a demonstration of OpenAI voices in Azure OpenAI Studio and Speech Studio, view this [introductory video](https://youtu.be/Ic505XeV3gs).
26+
> [!VIDEO https://www.youtube.com/embed/Ic505XeV3gs]
27+
2528
## Available text to speech voices in Azure AI services
2629

2730
You might ask: If I want to use an OpenAI text to speech voice, should I use it via the Azure OpenAI Service or via Azure AI Speech? What are the scenarios that guide me to use one or the other?

articles/ai-services/speech-service/personal-voice-overview.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -63,6 +63,9 @@ To get started, here's a summary of the steps to create a personal voice:
6363

6464
Once you have a personal voice, you can [use it](./personal-voice-how-to-use.md) to synthesize speech in any of the 91 languages supported across 100+ locales. A locale tag isn't required. Personal voice uses automatic language detection at the sentence level. For more information, see [use personal voice in your application](./personal-voice-how-to-use.md).
6565

66+
> [!TIP]
67+
> Check out the code samples in the [Speech SDK repository on GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/blob/master/samples/custom-voice/README.md) to see how to use personal voice in your application.
68+
6669
## Reference documentation
6770

6871
> [!div class="nextstepaction"]

0 commit comments

Comments
 (0)