You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/model-inference/concepts/models.md
+10-9Lines changed: 10 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -97,6 +97,16 @@ Meta Llama models and tools are a collection of pretrained and fine-tuned genera
97
97
98
98
See [this model collection in Azure AI Foundry portal](https://ai.azure.com/explore/models?&selectedCollection=meta).
99
99
100
+
### xAI
101
+
102
+
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
103
+
104
+
| Model | Type | Tier | Capabilities |
105
+
| ------ | ---- | --- | ------------ |
106
+
|[grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
107
+
|[grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
108
+
109
+
100
110
## Models from Partners and Community
101
111
102
112
Models from Partners and Community available for deployment with pay-as-you-go billing (for example, Cohere models) are offered by the model provider but hosted in Microsoft-managed Azure infrastructure and accessed via API in the Azure AI Foundry. Model providers define the license terms and set the price for use of their models, while Azure AI Foundry manages the hosting infrastructure.
@@ -210,15 +220,6 @@ See [this model collection in Azure AI Foundry portal](https://ai.azure.com/expl
210
220
| ------ | ---- | --- | ------------ |
211
221
|[tsuzumi-7b](https://ai.azure.com/explore/models/Tsuzumi-7b/version/1/registry/azureml-nttdata)| chat-completion | Global standard | - **Input:** text (8,192 tokens) <br /> - **Output:** text (8,192 tokens) <br /> - **Languages:**`en` and `jp` <br /> - **Tool calling:** No <br /> - **Response formats:** Text |
212
222
213
-
### xAI
214
-
215
-
xAI's Grok 3 and Grok 3 Mini models are designed to excel in various enterprise domains. Grok 3, a non-reasoning model pre-trained by the Colossus datacenter, is tailored for business use cases such as data extraction, coding, and text summarization, with exceptional instruction-following capabilities. It supports a 131,072 token context window, allowing it to handle extensive inputs while maintaining coherence and depth, and is particularly adept at drawing connections across domains and languages. On the other hand, Grok 3 Mini is a lightweight reasoning model trained to tackle agentic, coding, mathematical, and deep science problems with test-time compute. It also supports a 131,072 token context window for understanding codebases and enterprise documents, and excels at using tools to solve complex logical problems in novel environments, offering raw reasoning traces for user inspection with adjustable thinking budgets.
216
-
217
-
| Model | Type | Tier | Capabilities |
218
-
| ------ | ---- | --- | ------------ |
219
-
|[grok-3](https://ai.azure.com/explore/models/grok-3/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
220
-
|[grok-3-mini](https://ai.azure.com/explore/models/grok-3-mini/version/1/registry/azureml-xai)| chat-completion | Global standard | - **Input:** text (131,072 tokens) <br /> - **Output:** text (131,072 tokens) <br /> - **Languages:**`en` <br /> - **Tool calling:** yes <br /> - **Response formats:** text |
221
-
222
223
## Open and protected models
223
224
224
225
The [Azure AI model catalog](../../../ai-studio/how-to/model-catalog-overview.md) offers a larger selection of models, from a bigger range of providers. As opposite to Azure AI Foundry Models where models are provided as APIs, these models might require you to host them on your infrastructure, including the creation of an AI hub and project, and providing the underlying compute quota to host the models.
0 commit comments