update

mrbullwinkle · mrbullwinkle · commit d6450fedf553 · 2025-08-05T23:50:38.000-04:00
diff --git a/articles/ai-foundry/openai/concepts/models.md b/articles/ai-foundry/openai/concepts/models.md
@@ -37,14 +37,35 @@ Azure OpenAI is powered by a diverse set of models with different capabilities a
 
 ## gpt-oss
 
+### Region availability
+
+| Model | Region |
+|---|---|
+| `gpt-oss-120b`  | All Azure OpenAI regions |
+
 ### Capabilities
 
 |  Model ID  | Description | Context Window | Max Output Tokens | Training Data (up to)  |
 |  --- |  :--- |:--- |:---|:---: |
-| `gpt-oss-120b` (Preview)   | - Text in/text out only <br> - Responses API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available for direct deployment<sup>1</sup> and via [managed compute](../../how-to/deploy-models-managed.md)  | 131,072 | 131,072 | May 31, 2024 |
-| `gpt-oss-20b` (Preview) | - Text in/text out only <br> - Responses API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available via [managed compute only](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
+| `gpt-oss-120b` (Preview)   | - Text in/text out only <br> - Chat Completions API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available for deployment<sup>1</sup> and via [managed compute](../../how-to/deploy-models-managed.md)  | 131,072 | 131,072 | May 31, 2024 |
+| `gpt-oss-20b` (Preview) | - Text in/text out only <br> - Chat Completions API <br> - Streaming <br> - Function calling <br> - Structured outputs <br> - Reasoning <br> - Available via [managed compute only](../../how-to/deploy-models-managed.md) | 131,072 | 131,072 | May 31, 2024 |
+
+<sup>1</sup> Unlike other Azure OpenAI models `gpt-oss-120b` requires an [Azure AI Foundry project](/azure/ai-foundry/quickstarts/get-started-code?tabs=azure-ai-foundry&pivots=fdp-project) to deploy the model.
+
+### Deploy with code
+
+```cli
+az cognitiveservices account deployment create \
+  --name "Foundry-project-resource" \
+  --resource-group "test-rg" \
+  --deployment-name "gpt-oss-120b" \
+  --model-name "gpt-oss-120b" \
+  --model-version "1" \
+  --model-format "OpenAI-OSS" \
+  --sku-capacity 10 \
+  --sku-name "GlobalStandard"
+```
 
-<sup>1</sup> Unlike other Azure OpenAI models `gpt-oss-120b` requires an [Azure AI Foundry project](/azure/ai-foundry/quickstarts/get-started-code?tabs=azure-ai-foundry&pivots=fdp-project).
 
 ## GPT-4.1 series
 
diff --git a/articles/ai-foundry/openai/quotas-limits.md b/articles/ai-foundry/openai/quotas-limits.md
@@ -73,6 +73,12 @@ The following section provides you with a quick guide to the default quotas and
 
 [!INCLUDE [Quota](./includes/global-batch-limits.md)]
 
+## gpt-oss
+
+| Model          | Tokens per minute (TPM) | Requests per minute (RPM) |
+|----------------|-------------------|---------------------------------|
+| `gpt-oss-120b` | 5 M               | 5 K                             |
+
 ## GPT-4 rate limits
 
 ### GPT-4.5 preview Global Standard