You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
35
+
36
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
191
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
180
192
181
193
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
195
-
Model: Phi-4-mini-instruct
207
+
Model: Phi-4-multimodal-instruct
196
208
Usage:
197
209
Prompt tokens: 19
198
210
Total tokens: 91
@@ -344,6 +356,18 @@ except HttpResponseError as ex:
344
356
345
357
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
362
+
363
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -491,7 +515,7 @@ var response = await client.path("/chat/completions").post({
491
515
```
492
516
493
517
> [!NOTE]
494
-
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
518
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
495
519
496
520
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
514
-
Model: Phi-4-mini-instruct
538
+
Model: Phi-4-multimodal-instruct
515
539
Usage:
516
540
Prompt tokens: 19
517
541
Total tokens: 91
@@ -682,6 +706,18 @@ catch (error) {
682
706
683
707
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
712
+
713
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
882
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
847
883
848
884
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
1074
+
1075
+
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites -with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -1122,7 +1170,7 @@ The response is as follows:
1122
1170
1123
1171
```json
1124
1172
{
1125
-
"model_name": "Phi-4-mini-instruct",
1173
+
"model_name": "Phi-4-multimodal-instruct",
1126
1174
"model_type": "chat-completions",
1127
1175
"model_provider_name": "Microsoft"
1128
1176
}
@@ -1148,7 +1196,7 @@ The following example shows how you can create a basic chat completions request
1148
1196
```
1149
1197
1150
1198
> [!NOTE]
1151
-
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1199
+
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1152
1200
1153
1201
The response is as follows, where you can see the model's usage statistics:
1154
1202
@@ -1158,7 +1206,7 @@ The response is as follows, where you can see the model's usage statistics:
1158
1206
"id": "0a1234b5de6789f01gh2i345j6789klm",
1159
1207
"object": "chat.completion",
1160
1208
"created": 1718726686,
1161
-
"model": "Phi-4-mini-instruct",
1209
+
"model": "Phi-4-multimodal-instruct",
1162
1210
"choices": [
1163
1211
{
1164
1212
"index": 0,
@@ -1215,7 +1263,7 @@ You can visualize how streaming generates content:
1215
1263
"id": "23b54589eba14564ad8a2e6978775a39",
1216
1264
"object": "chat.completion.chunk",
1217
1265
"created": 1718726371,
1218
-
"model": "Phi-4-mini-instruct",
1266
+
"model": "Phi-4-multimodal-instruct",
1219
1267
"choices": [
1220
1268
{
1221
1269
"index": 0,
@@ -1238,7 +1286,7 @@ The last message in the stream has `finish_reason` set, indicating the reason fo
1238
1286
"id": "23b54589eba14564ad8a2e6978775a39",
1239
1287
"object": "chat.completion.chunk",
1240
1288
"created": 1718726371,
1241
-
"model": "Phi-4-mini-instruct",
1289
+
"model": "Phi-4-multimodal-instruct",
1242
1290
"choices": [
1243
1291
{
1244
1292
"index": 0,
@@ -1289,7 +1337,7 @@ Explore other parameters that you can specify in the inference client. For a ful
| Model | Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
56
56
|---------|---------|---------|---------|
57
-
Phi-4 <br> Phi-4-mini-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
57
+
Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
58
58
Phi-3.5-vision-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
59
59
Phi-3.5-MoE-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 |
60
60
Phi-3.5-Mini-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 | East US 2 |
0 commit comments