Skip to content

Commit 80db3a5

Browse files
authored
Merge pull request #251531 from eric-urban/eur/whisper-stt32-preview
Whisper and STT 3.2 preview docs
2 parents 09dbc46 + b4ad905 commit 80db3a5

20 files changed

+768
-40
lines changed

articles/ai-services/openai/concepts/content-filter.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -7,23 +7,26 @@ ms.author: mbullwin
77
ms.service: cognitive-services
88
ms.subservice: openai
99
ms.topic: conceptual
10-
ms.date: 06/08/2023
10+
ms.date: 09/15/2023
1111
ms.custom: template-concept
1212
manager: nitinme
1313
keywords:
1414
---
1515

1616
# Content filtering
1717

18-
Azure OpenAI Service includes a content filtering system that works alongside core models. This system works by running both the prompt and completion through an ensemble of classification models aimed at detecting and preventing the output of harmful content. The content filtering system detects and takes action on specific categories of potentially harmful content in both input prompts and output completions. Variations in API configurations and application design may affect completions and thus filtering behavior. The content filtering system supports the following languages: English, German, Japanese, Spanish, French, Italian, Portuguese, and Chinese. It might not be able to detect inappropriate content in languages that it has not been trained or tested to process.
18+
> [!IMPORTANT]
19+
> The content filtering system isn't applied to prompts and completions processed by the Whisper model in Azure OpenAI Service. Learn more about the [Whisper model in Azure OpenAI](models.md#whisper-preview).
20+
21+
Azure OpenAI Service includes a content filtering system that works alongside core models. This system works by running both the prompt and completion through an ensemble of classification models aimed at detecting and preventing the output of harmful content. The content filtering system detects and takes action on specific categories of potentially harmful content in both input prompts and output completions. Variations in API configurations and application design may affect completions and thus filtering behavior. The content filtering system supports the following languages: Chinese, English, French, German, Italian, Japanese, Portuguese, and Spanish. It might not be able to detect inappropriate content in languages that it hasn't been trained or tested to process.
1922

2023
In addition to the content filtering system, the Azure OpenAI Service performs monitoring to detect content and/or behaviors that suggest use of the service in a manner that may violate applicable product terms. For more information about understanding and mitigating risks associated with your application, see the [Transparency Note for Azure OpenAI](/legal/cognitive-services/openai/transparency-note?tabs=text). For more information about how data is processed in connection with content filtering and abuse monitoring, see [Data, privacy, and security for Azure OpenAI Service](/legal/cognitive-services/openai/data-privacy?context=/azure/ai-services/openai/context/context#preventing-abuse-and-harmful-content-generation).
2124

2225
The following sections provide information about the content filtering categories, the filtering severity levels and their configurability, and API scenarios to be considered in application design and implementation.
2326

2427
## Content filtering categories
2528

26-
The content filtering system integrated in the Azure OpenAI Service contains neural multi-class classification models aimed at detecting and filtering harmful content; the models cover four categories (hate, sexual, violence, and self-harm) across four severity levels (safe, low, medium, and high). Content detected at the 'safe' severity level is labeled in annotations but is not subject to filtering and is not configurable.
29+
The content filtering system integrated in the Azure OpenAI Service contains neural multi-class classification models aimed at detecting and filtering harmful content; the models cover four categories (hate, sexual, violence, and self-harm) across four severity levels (safe, low, medium, and high). Content detected at the 'safe' severity level is labeled in annotations but isn't subject to filtering and isn't configurable.
2730

2831
### Categories
2932

@@ -45,13 +48,13 @@ The content filtering system integrated in the Azure OpenAI Service contains neu
4548

4649
## Configurability (preview)
4750

48-
The default content filtering configuration is set to filter at the medium severity threshold for all four content harm categories for both prompts and completions. That means that content that is detected at severity level medium or high is filtered, while content detected at severity level low is not filtered by the content filters. The configurability feature is available in preview and allows customers to adjust the settings, separately for prompts and completions, to filter content for each content category at different severity levels as described in the table below:
51+
The default content filtering configuration is set to filter at the medium severity threshold for all four content harm categories for both prompts and completions. That means that content that is detected at severity level medium or high is filtered, while content detected at severity level low isn't filtered by the content filters. The configurability feature is available in preview and allows customers to adjust the settings, separately for prompts and completions, to filter content for each content category at different severity levels as described in the table below:
4952

5053
| Severity filtered | Configurable for prompts | Configurable for completions | Descriptions |
5154
|-------------------|--------------------------|------------------------------|--------------|
5255
| Low, medium, high | Yes | Yes | Strictest filtering configuration. Content detected at severity levels low, medium and high is filtered.|
5356
| Medium, high | Yes | Yes | Default setting. Content detected at severity level low is not filtered, content at medium and high is filtered.|
54-
| High | If approved<sup>\*</sup>| If approved<sup>\*</sup> | Content detected at severity levels low and medium is not filtered. Only content at severity level high is filtered. Requires approval<sup>\*</sup>.|
57+
| High | If approved<sup>\*</sup>| If approved<sup>\*</sup> | Content detected at severity levels low and medium isn't filtered. Only content at severity level high is filtered. Requires approval<sup>\*</sup>.|
5558
| No filters | If approved<sup>\*</sup>| If approved<sup>\*</sup>| No content is filtered regardless of severity level detected. Requires approval<sup>\*</sup>.|
5659

5760
<sup>\*</sup> Only customers who have been approved for modified content filtering have full content filtering control, including configuring content filters at severity level high only or turning content filters off. Apply for modified content filters via this form: [Azure OpenAI Limited Access Review: Modified Content Filters and Abuse Monitoring (microsoft.com)](https://customervoice.microsoft.com/Pages/ResponsePage.aspx?id=v4j5cvGGr0GRqy180BHbR7en2Ais5pxKtso_Pz4b1_xURE01NDY1OUhBRzQ3MkQxMUhZSE1ZUlJKTiQlQCN0PWcu)

articles/ai-services/openai/concepts/models.md

Lines changed: 14 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ description: Learn about the different model capabilities that are available wit
55
ms.service: cognitive-services
66
ms.subservice: openai
77
ms.topic: conceptual
8-
ms.date: 09/12/2023
8+
ms.date: 09/15/2023
99
ms.custom: event-tier1-build-2022, references_regions, build-2023, build-2023-dataai
1010
manager: nitinme
1111
author: mrbullwinkle #ChrisHMSFT
@@ -24,6 +24,7 @@ Azure OpenAI Service is powered by a diverse set of models with different capabi
2424
| [GPT-3.5](#gpt-35) | A set of models that improve on GPT-3 and can understand and generate natural language and code. |
2525
| [Embeddings](#embeddings-models) | A set of models that can convert text into numerical vector form to facilitate text similarity. |
2626
| [DALL-E](#dall-e-models-preview) (Preview) | A series of models in preview that can generate original images from natural language. |
27+
| [Whisper](#whisper-models-preview) (Preview) | A series of models in preview that can transcribe and translate speech to text. |
2728

2829
## GPT-4
2930

@@ -60,6 +61,12 @@ The previous embeddings models have been consolidated into the following new rep
6061

6162
The DALL-E models, currently in preview, generate images from text prompts that the user provides.
6263

64+
## Whisper (Preview)
65+
66+
The Whisper models, currently in preview, can be used for speech to text.
67+
68+
You can also use the Whisper model via Azure AI Speech [batch transcription](../../speech-service/batch-transcription-create.md) API. Check out [What is the Whisper model?](../../speech-service/whisper-overview.md) to learn more about when to use Azure AI Speech vs. Azure OpenAI Service.
69+
6370
## Model summary table and region availability
6471

6572
> [!IMPORTANT]
@@ -112,6 +119,12 @@ These models can only be used with Embedding API requests.
112119
| --- | --- | --- | --- | --- |
113120
| dalle2 | East US | N/A | 1000 | N/A |
114121

122+
### Whisper models (Preview)
123+
124+
| Model ID | Base model Regions | Fine-Tuning Regions | Max Request (audio file size) | Training Data (up to) |
125+
| --- | --- | --- | --- | --- |
126+
| whisper | North Central US, West Europe | N/A | 25 MB | N/A |
127+
115128
## Working with models
116129

117130
### Finding what models are available
Lines changed: 33 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,33 @@
1+
---
2+
services: ai-services
3+
manager: nitinme
4+
author: eric-urban
5+
ms.author: eur
6+
ms.service: openai
7+
ms.topic: include
8+
ms.date: 09/15/2023
9+
---
10+
11+
## REST API
12+
13+
In a bash shell, run the following command. You need to replace `MyDeploymentName` with the deployment name you chose when you deployed the Whisper model. Entering the model name results in an error unless you chose a deployment name that is identical to the underlying model name.
14+
15+
```bash
16+
curl $AZURE_OPENAI_ENDPOINT/openai/deployments/MyDeploymentName/audio/transcriptions?api-version=2023-09-01-preview \
17+
-H "api-key: $AZURE_OPENAI_KEY" \
18+
-H "Content-Type: multipart/form-data" \
19+
-F file="@./wikipediaOcelot.wav"
20+
```
21+
22+
The format of your first line of the command with an example endpoint would appear as follows `curl https://aoai-docs.openai.azure.com/openai/deployments/{YOUR-DEPLOYMENT_NAME_HERE}/audio/transcriptions?api-version=2023-09-01-preview \`.
23+
24+
You can get sample audio files from the [Azure AI Speech SDK repository at GitHub](https://github.com/Azure-Samples/cognitive-services-speech-sdk/tree/master/sampledata/audiofiles).
25+
26+
> [!IMPORTANT]
27+
> For production, use a secure way of storing and accessing your credentials like [Azure Key Vault](../../../key-vault/general/overview.md). For more information about credential security, see the Azure AI services [security](../../security-features.md) article.
28+
29+
## Output
30+
31+
```bash
32+
{"text":"The ocelot, Lepardus paradalis, is a small wild cat native to the southwestern United States, Mexico, and Central and South America. This medium-sized cat is characterized by solid black spots and streaks on its coat, round ears, and white neck and undersides. It weighs between 8 and 15.5 kilograms, 18 and 34 pounds, and reaches 40 to 50 centimeters 16 to 20 inches at the shoulders. It was first described by Carl Linnaeus in 1758. Two subspecies are recognized, L. p. paradalis and L. p. mitis. Typically active during twilight and at night, the ocelot tends to be solitary and territorial. It is efficient at climbing, leaping, and swimming. It preys on small terrestrial mammals such as armadillo, opossum, and lagomorphs."}
33+
```

articles/ai-services/openai/overview.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.author: mbullwin
88
ms.service: cognitive-services
99
ms.subservice: openai
1010
ms.topic: overview
11-
ms.date: 07/06/2023
11+
ms.date: 09/15/2023
1212
ms.custom: event-tier1-build-2022, build-2023, build-2023-dataai
1313
recommendations: false
1414
keywords:
@@ -49,7 +49,7 @@ Apply here for access:
4949

5050
## Comparing Azure OpenAI and OpenAI
5151

52-
Azure OpenAI Service gives customers advanced language AI with OpenAI GPT-4, GPT-3, Codex, and DALL-E models with the security and enterprise promise of Azure. Azure OpenAI co-develops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.
52+
Azure OpenAI Service gives customers advanced language AI with OpenAI GPT-4, GPT-3, Codex, DALL-E, and Whisper models with the security and enterprise promise of Azure. Azure OpenAI co-develops the APIs with OpenAI, ensuring compatibility and a smooth transition from one to the other.
5353

5454
With Azure OpenAI, customers get the security capabilities of Microsoft Azure while running the same models as OpenAI. Azure OpenAI offers private networking, regional availability, and responsible AI content filtering.
5555

@@ -104,6 +104,8 @@ GPT-4 models are the latest available models. Due to high demand access to this
104104

105105
The DALL-E models, currently in preview, generate images from text prompts that the user provides.
106106

107+
The Whisper models, currently in preview, can be used to transcribe and translate speech to text.
108+
107109
Learn more about each model on our [models concept page](./concepts/models.md).
108110

109111
## Next steps

0 commit comments

Comments
 (0)