Skip to content

Commit d6450fe

Browse files
committed
update
1 parent 27474ea commit d6450fe

File tree

2 files changed

+30
-3
lines changed

2 files changed

+30
-3
lines changed

articles/ai-foundry/openai/concepts/models.md

Lines changed: 24 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -37,14 +37,35 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
3737

3838
## gpt-oss
3939

40+
### Region availability
41+
42+
| Model | Region |
43+
|---|---|
44+
| `gpt-oss-120b` | All Azure OpenAI regions |
45+
4046
### Capabilities
4147

4248
| Model ID | Description | Context Window | Max Output Tokens | Training Data (up to) |
4349
| --- | :--- |:--- |:---|:---: |
44-
| `gpt-oss-120b` (Preview) | - Text in/text out only <br> - Responses API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available for direct deployment<sup>1</sup> and via [managed compute](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
45-
| `gpt-oss-20b` (Preview) | - Text in/text out only <br> - Responses API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available via [managed compute only](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
50+
| `gpt-oss-120b` (Preview) | - Text in/text out only <br> - Chat Completions API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available for deployment<sup>1</sup> and via [managed compute](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
51+
| `gpt-oss-20b` (Preview) | - Text in/text out only <br> - Chat Completions API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available via [managed compute only](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
52+
53+
<sup>1</sup> Unlike other Azure OpenAI models `gpt-oss-120b` requires an [Azure AI Foundry project](/azure/ai-foundry/quickstarts/get-started-code?tabs=azure-ai-foundry&pivots=fdp-project) to deploy the model.
54+
55+
### Deploy with code
56+
57+
```cli
58+
az cognitiveservices account deployment create \
59+
--name "Foundry-project-resource" \
60+
--resource-group "test-rg" \
61+
--deployment-name "gpt-oss-120b" \
62+
--model-name "gpt-oss-120b" \
63+
--model-version "1" \
64+
--model-format "OpenAI-OSS" \
65+
--sku-capacity 10 \
66+
--sku-name "GlobalStandard"
67+
```
4668

47-
<sup>1</sup> Unlike other Azure OpenAI models `gpt-oss-120b` requires an [Azure AI Foundry project](/azure/ai-foundry/quickstarts/get-started-code?tabs=azure-ai-foundry&pivots=fdp-project).
4869

4970
## GPT-4.1 series
5071

articles/ai-foundry/openai/quotas-limits.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -73,6 +73,12 @@ The following section provides you with a quick guide to the default quotas and
7373
7474
[!INCLUDE [Quota](./includes/global-batch-limits.md)]
7575

76+
## gpt-oss
77+
78+
| Model | Tokens per minute (TPM) | Requests per minute (RPM) |
79+
|----------------|-------------------|---------------------------------|
80+
| `gpt-oss-120b` | 5 M | 5 K |
81+
7682
## GPT-4 rate limits
7783

7884
### GPT-4.5 preview Global Standard

0 commit comments

Comments
 (0)