You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
35
+
36
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
191
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
180
192
181
193
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
195
-
Model: Phi-4-mini-instruct
207
+
Model: Phi-4-multimodal-instruct
196
208
Usage:
197
209
Prompt tokens: 19
198
210
Total tokens: 91
@@ -341,6 +353,18 @@ except HttpResponseError as ex:
341
353
342
354
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
359
+
360
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -488,7 +512,7 @@ var response = await client.path("/chat/completions").post({
488
512
```
489
513
490
514
> [!NOTE]
491
-
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
515
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
492
516
493
517
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
511
-
Model: Phi-4-mini-instruct
535
+
Model: Phi-4-multimodal-instruct
512
536
Usage:
513
537
Prompt tokens: 19
514
538
Total tokens: 91
@@ -676,6 +700,18 @@ catch (error) {
676
700
677
701
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
706
+
707
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
876
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
841
877
842
878
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
1065
+
1066
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites -with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -1113,7 +1161,7 @@ The response is as follows:
1113
1161
1114
1162
```json
1115
1163
{
1116
-
"model_name": "Phi-4-mini-instruct",
1164
+
"model_name": "Phi-4-multimodal-instruct",
1117
1165
"model_type": "chat-completions",
1118
1166
"model_provider_name": "Microsoft"
1119
1167
}
@@ -1139,7 +1187,7 @@ The following example shows how you can create a basic chat completions request
1139
1187
```
1140
1188
1141
1189
> [!NOTE]
1142
-
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1190
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1143
1191
1144
1192
The response is as follows, where you can see the model's usage statistics:
1145
1193
@@ -1149,7 +1197,7 @@ The response is as follows, where you can see the model's usage statistics:
1149
1197
"id": "0a1234b5de6789f01gh2i345j6789klm",
1150
1198
"object": "chat.completion",
1151
1199
"created": 1718726686,
1152
-
"model": "Phi-4-mini-instruct",
1200
+
"model": "Phi-4-multimodal-instruct",
1153
1201
"choices": [
1154
1202
{
1155
1203
"index": 0,
@@ -1206,7 +1254,7 @@ You can visualize how streaming generates content:
1206
1254
"id": "23b54589eba14564ad8a2e6978775a39",
1207
1255
"object": "chat.completion.chunk",
1208
1256
"created": 1718726371,
1209
-
"model": "Phi-4-mini-instruct",
1257
+
"model": "Phi-4-multimodal-instruct",
1210
1258
"choices": [
1211
1259
{
1212
1260
"index": 0,
@@ -1229,7 +1277,7 @@ The last message in the stream has `finish_reason` set, indicating the reason fo
1229
1277
"id": "23b54589eba14564ad8a2e6978775a39",
1230
1278
"object": "chat.completion.chunk",
1231
1279
"created": 1718726371,
1232
-
"model": "Phi-4-mini-instruct",
1280
+
"model": "Phi-4-multimodal-instruct",
1233
1281
"choices": [
1234
1282
{
1235
1283
"index": 0,
@@ -1280,7 +1328,7 @@ Explore other parameters that you can specify in the inference client. For a ful
| Model | Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
63
63
|---------|---------|---------|---------|
64
-
Phi-4 <br> Phi-4-mini-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
64
+
Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
65
65
Phi-3.5-vision-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
66
66
Phi-3.5-MoE-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 |
67
67
Phi-3.5-Mini-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 | East US 2 |
0 commit comments