Skip to content

Commit 0154b59

Browse files
committed
Phi 4 mini updates only
1 parent b18c187 commit 0154b59

File tree

3 files changed

+17
-65
lines changed

3 files changed

+17
-65
lines changed

articles/ai-studio/how-to/deploy-models-phi-4.md

Lines changed: 15 additions & 63 deletions
Original file line numberDiff line numberDiff line change
@@ -29,18 +29,6 @@ The Phi-4 family of small language models (SLMs) is a collection of instruction-
2929

3030
The Phi-4 family chat models include the following models:
3131

32-
# [Phi-4-multimodal-instruct](#tab/phi-4-multimodal-instruct)
33-
34-
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
35-
36-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
37-
38-
39-
The following models are available:
40-
41-
* [Phi-4-multimodal-instruct](https://aka.ms/azureai/landing/Phi-4-multimodal-instruct)
42-
43-
4432
# [Phi-4-mini-instruct](#tab/phi-4-mini-instruct)
4533

4634
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -167,7 +155,7 @@ print("Model provider name:", model_info.model_provider_name)
167155
```
168156

169157
```console
170-
Model name: Phi-4-multimodal-instruct
158+
Model name: Phi-4-mini-instruct
171159
Model type: chat-completions
172160
Model provider name: Microsoft
173161
```
@@ -188,7 +176,7 @@ response = client.complete(
188176
```
189177

190178
> [!NOTE]
191-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
179+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
192180
193181
The response is as follows, where you can see the model's usage statistics:
194182

@@ -204,7 +192,7 @@ print("\tCompletion tokens:", response.usage.completion_tokens)
204192

205193
```console
206194
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
207-
Model: Phi-4-multimodal-instruct
195+
Model: Phi-4-mini-instruct
208196
Usage:
209197
Prompt tokens: 19
210198
Total tokens: 91
@@ -356,18 +344,6 @@ except HttpResponseError as ex:
356344

357345
The Phi-4 family chat models include the following models:
358346

359-
# [Phi-4-multimodal-instruct](#tab/phi-4-multimodal-instruct)
360-
361-
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
362-
363-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
364-
365-
366-
The following models are available:
367-
368-
* [Phi-4-multimodal-instruct](https://aka.ms/azureai/landing/Phi-4-multimodal-instruct)
369-
370-
371347
# [Phi-4-mini-instruct](#tab/phi-4-mini-instruct)
372348

373349
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -492,7 +468,7 @@ console.log("Model provider name: ", model_info.body.model_provider_name)
492468
```
493469

494470
```console
495-
Model name: Phi-4-multimodal-instruct
471+
Model name: Phi-4-mini-instruct
496472
Model type: chat-completions
497473
Model provider name: Microsoft
498474
```
@@ -515,7 +491,7 @@ var response = await client.path("/chat/completions").post({
515491
```
516492

517493
> [!NOTE]
518-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
494+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
519495
520496
The response is as follows, where you can see the model's usage statistics:
521497

@@ -535,7 +511,7 @@ console.log("\tCompletion tokens:", response.body.usage.completion_tokens);
535511

536512
```console
537513
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
538-
Model: Phi-4-multimodal-instruct
514+
Model: Phi-4-mini-instruct
539515
Usage:
540516
Prompt tokens: 19
541517
Total tokens: 91
@@ -706,18 +682,6 @@ catch (error) {
706682
707683
The Phi-4 family chat models include the following models:
708684
709-
# [Phi-4-multimodal-instruct](#tab/phi-4-multimodal-instruct)
710-
711-
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
712-
713-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
714-
715-
716-
The following models are available:
717-
718-
* [Phi-4-multimodal-instruct](https://aka.ms/azureai/landing/Phi-4-multimodal-instruct)
719-
720-
721685
# [Phi-4-mini-instruct](#tab/phi-4-mini-instruct)
722686
723687
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -857,7 +821,7 @@ Console.WriteLine($"Model provider name: {modelInfo.Value.ModelProviderName}");
857821
```
858822
859823
```console
860-
Model name: Phi-4-multimodal-instruct
824+
Model name: Phi-4-mini-instruct
861825
Model type: chat-completions
862826
Model provider name: Microsoft
863827
```
@@ -879,7 +843,7 @@ Response<ChatCompletions> response = client.Complete(requestOptions);
879843
```
880844
881845
> [!NOTE]
882-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
846+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
883847
884848
The response is as follows, where you can see the model's usage statistics:
885849

@@ -895,7 +859,7 @@ Console.WriteLine($"\tCompletion tokens: {response.Value.Usage.CompletionTokens}
895859

896860
```console
897861
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
898-
Model: Phi-4-multimodal-instruct
862+
Model: Phi-4-mini-instruct
899863
Usage:
900864
Prompt tokens: 19
901865
Total tokens: 91
@@ -1068,18 +1032,6 @@ catch (RequestFailedException ex)
10681032

10691033
The Phi-4 family chat models include the following models:
10701034

1071-
# [Phi-4-multimodal-instruct](#tab/phi-4-multimodal-instruct)
1072-
1073-
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
1074-
1075-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
1076-
1077-
1078-
The following models are available:
1079-
1080-
* [Phi-4-multimodal-instruct](https://aka.ms/azureai/landing/Phi-4-multimodal-instruct)
1081-
1082-
10831035
# [Phi-4-mini-instruct](#tab/phi-4-mini-instruct)
10841036

10851037
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -1170,7 +1122,7 @@ The response is as follows:
11701122

11711123
```json
11721124
{
1173-
"model_name": "Phi-4-multimodal-instruct",
1125+
"model_name": "Phi-4-mini-instruct",
11741126
"model_type": "chat-completions",
11751127
"model_provider_name": "Microsoft"
11761128
}
@@ -1196,7 +1148,7 @@ The following example shows how you can create a basic chat completions request
11961148
```
11971149

11981150
> [!NOTE]
1199-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1151+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
12001152

12011153
The response is as follows, where you can see the model's usage statistics:
12021154
@@ -1206,7 +1158,7 @@ The response is as follows, where you can see the model's usage statistics:
12061158
"id": "0a1234b5de6789f01gh2i345j6789klm",
12071159
"object": "chat.completion",
12081160
"created": 1718726686,
1209-
"model": "Phi-4-multimodal-instruct",
1161+
"model": "Phi-4-mini-instruct",
12101162
"choices": [
12111163
{
12121164
"index": 0,
@@ -1263,7 +1215,7 @@ You can visualize how streaming generates content:
12631215
"id": "23b54589eba14564ad8a2e6978775a39",
12641216
"object": "chat.completion.chunk",
12651217
"created": 1718726371,
1266-
"model": "Phi-4-multimodal-instruct",
1218+
"model": "Phi-4-mini-instruct",
12671219
"choices": [
12681220
{
12691221
"index": 0,
@@ -1286,7 +1238,7 @@ The last message in the stream has `finish_reason` set, indicating the reason fo
12861238
"id": "23b54589eba14564ad8a2e6978775a39",
12871239
"object": "chat.completion.chunk",
12881240
"created": 1718726371,
1289-
"model": "Phi-4-multimodal-instruct",
1241+
"model": "Phi-4-mini-instruct",
12901242
"choices": [
12911243
{
12921244
"index": 0,
@@ -1337,7 +1289,7 @@ Explore other parameters that you can specify in the inference client. For a ful
13371289
"id": "0a1234b5de6789f01gh2i345j6789klm",
13381290
"object": "chat.completion",
13391291
"created": 1718726686,
1340-
"model": "Phi-4-multimodal-instruct",
1292+
"model": "Phi-4-mini-instruct",
13411293
"choices": [
13421294
{
13431295
"index": 0,

articles/ai-studio/how-to/model-catalog-overview.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -83,7 +83,7 @@ Gretel | Not available | Gretel-Navigator
8383
Healthcare AI family Models | MedImageParse<BR> MedImageInsight<BR> CxrReportGen<BR> Virchow<BR> Virchow2<BR> Prism<BR> BiomedCLIP-PubMedBERT<BR> microsoft-llava-med-v1.5<BR> m42-health-llama3-med4<BR> biomistral-biomistral-7b<BR> microsoft-biogpt-large-pub<BR> microsoft-biomednlp-pub<BR> stanford-crfm-biomedlm<BR> medicalai-clinicalbert<BR> microsoft-biogpt<BR> microsoft-biogpt-large<BR> microsoft-biomednlp-pub<BR> | Not Available
8484
JAIS | Not available | jais-30b-chat
8585
Meta Llama family models | Llama-3.3-70B-Instruct<BR> Llama-3.2-3B-Instruct<BR> Llama-3.2-1B-Instruct<BR> Llama-3.2-1B<BR> Llama-3.2-90B-Vision-Instruct<BR> Llama-3.2-11B-Vision-Instruct<BR> Llama-3.1-8B-Instruct<BR> Llama-3.1-8B<BR> Llama-3.1-70B-Instruct<BR> Llama-3.1-70B<BR> Llama-3-8B-Instruct<BR> Llama-3-70B<BR> Llama-3-8B<BR> Llama-Guard-3-1B<BR> Llama-Guard-3-8B<BR> Llama-Guard-3-11B-Vision<BR> Llama-2-7b<BR> Llama-2-70b<BR> Llama-2-7b-chat<BR> Llama-2-13b-chat<BR> CodeLlama-7b-hf<BR> CodeLlama-7b-Instruct-hf<BR> CodeLlama-34b-hf<BR> CodeLlama-34b-Python-hf<BR> CodeLlama-34b-Instruct-hf<BR> CodeLlama-13b-Instruct-hf<BR> CodeLlama-13b-Python-hf<BR> Prompt-Guard-86M<BR> CodeLlama-70b-hf<BR> | Llama-3.3-70B-Instruct<BR> Llama-3.2-90B-Vision-Instruct<br> Llama-3.2-11B-Vision-Instruct<br> Llama-3.1-8B-Instruct<br> Llama-3.1-70B-Instruct<br> Llama-3.1-405B-Instruct<br> Llama-3-8B-Instruct<br> Llama-3-70B-Instruct<br> Llama-2-7b<br> Llama-2-7b-chat<br> Llama-2-70b<br> Llama-2-70b-chat<br> Llama-2-13b<br> Llama-2-13b-chat<br>
86-
Microsoft Phi family models | Phi-3-mini-4k-Instruct <br> Phi-3-mini-128k-Instruct <br> Phi-3-small-8k-Instruct <br> Phi-3-small-128k-Instruct <br> Phi-3-medium-4k-instruct <br> Phi-3-medium-128k-instruct <br> Phi-3-vision-128k-Instruct <br> Phi-3.5-mini-Instruct <br> Phi-3.5-vision-Instruct <br> Phi-3.5-MoE-Instruct <br> Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct | Phi-3-mini-4k-Instruct <br> Phi-3-mini-128k-Instruct <br> Phi-3-small-8k-Instruct <br> Phi-3-small-128k-Instruct <br> Phi-3-medium-4k-instruct <br> Phi-3-medium-128k-instruct <br> <br> Phi-3.5-mini-Instruct <br> Phi-3.5-vision-Instruct <br> Phi-3.5-MoE-Instruct <br> Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct
86+
Microsoft Phi family models | Phi-3-mini-4k-Instruct <br> Phi-3-mini-128k-Instruct <br> Phi-3-small-8k-Instruct <br> Phi-3-small-128k-Instruct <br> Phi-3-medium-4k-instruct <br> Phi-3-medium-128k-instruct <br> Phi-3-vision-128k-Instruct <br> Phi-3.5-mini-Instruct <br> Phi-3.5-vision-Instruct <br> Phi-3.5-MoE-Instruct <br> Phi-4 <br> Phi-4-mini-instruct | Phi-3-mini-4k-Instruct <br> Phi-3-mini-128k-Instruct <br> Phi-3-small-8k-Instruct <br> Phi-3-small-128k-Instruct <br> Phi-3-medium-4k-instruct <br> Phi-3-medium-128k-instruct <br> <br> Phi-3.5-mini-Instruct <br> Phi-3.5-vision-Instruct <br> Phi-3.5-MoE-Instruct <br> Phi-4 <br> Phi-4-mini-instruct
8787
Mistral family models | mistralai-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x22B-Instruct-v0-1 <br> mistral-community-Mixtral-8x22B-v0-1 <br> mistralai-Mixtral-8x7B-v01 <br> mistralai-Mistral-7B-Instruct-v0-2 <br> mistralai-Mistral-7B-v01 <br> mistralai-Mixtral-8x7B-Instruct-v01 <br> mistralai-Mistral-7B-Instruct-v01 | Mistral-large (2402) <br> Mistral-large (2407) <br> Mistral-small <br> Ministral-3B <br> Mistral-NeMo
8888
Nixtla | Not available | TimeGEN-1
8989

articles/ai-studio/includes/region-availability-maas.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -54,7 +54,7 @@ Llama 3.1 405B Instruct | [Microsoft Managed countries/regions](/partner-center
5454

5555
| Model | Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
5656
|---------|---------|---------|---------|
57-
Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
57+
Phi-4 <br> Phi-4-mini-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
5858
Phi-3.5-vision-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
5959
Phi-3.5-MoE-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 |
6060
Phi-3.5-Mini-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 | East US 2 |

0 commit comments

Comments
 (0)