You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
35
-
36
-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
179
+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
192
180
193
181
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
207
-
Model: Phi-4-multimodal-instruct
195
+
Model: Phi-4-mini-instruct
208
196
Usage:
209
197
Prompt tokens: 19
210
198
Total tokens: 91
@@ -356,18 +344,6 @@ except HttpResponseError as ex:
356
344
357
345
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
362
-
363
-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -515,7 +491,7 @@ var response = await client.path("/chat/completions").post({
515
491
```
516
492
517
493
> [!NOTE]
518
-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
494
+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
519
495
520
496
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
538
-
Model: Phi-4-multimodal-instruct
514
+
Model: Phi-4-mini-instruct
539
515
Usage:
540
516
Prompt tokens: 19
541
517
Total tokens: 91
@@ -706,18 +682,6 @@ catch (error) {
706
682
707
683
The Phi-4 family chat models include the following models:
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0 models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
712
-
713
-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites - with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
846
+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
883
847
884
848
The response is as follows, where you can see the model's usage statistics:
Response: As of now, it's estimated that there are about 7,000 languages spoken around the world. However, this number can vary as some languages become extinct and new ones develop. It's also important to note that the number of speakers can greatly vary between languages, with some having millions of speakers and others only a few hundred.
Phi-4-multimodal-instruct is a lightweight open multimodal foundation model that leverages the language, vision, and speech research and datasets used for Phi-3.5 and 4.0models. The model processes text, image, and audio inputs, and generates text outputs. The model underwent an enhancement process, incorporating both supervised fine-tuning, and direct preference optimization to support precise instruction adherence and safety measures.
1074
-
1075
-
The Phi-4-multimodal-instruct model comes in the following variant with a 128K token length.
Phi-4-mini-instruct is a lightweight open model built upon synthetic data and filtered publicly available websites -with a focus on high-quality, reasoning dense data. The model belongs to the Phi-4 model family and supports 128K token context length. The model underwent an enhancement process, incorporating both supervised fine-tuning and direct preference optimization to support precise instruction adherence and robust safety measures.
@@ -1170,7 +1122,7 @@ The response is as follows:
1170
1122
1171
1123
```json
1172
1124
{
1173
-
"model_name": "Phi-4-multimodal-instruct",
1125
+
"model_name": "Phi-4-mini-instruct",
1174
1126
"model_type": "chat-completions",
1175
1127
"model_provider_name": "Microsoft"
1176
1128
}
@@ -1196,7 +1148,7 @@ The following example shows how you can create a basic chat completions request
1196
1148
```
1197
1149
1198
1150
> [!NOTE]
1199
-
> Phi-4-multimodal-instruct, Phi-4-mini-instruct, and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1151
+
> Phi-4-mini-instruct and Phi-4 don't support system messages (`role="system"`). When you use the Azure AI model inference API, system messages are translated to user messages, which is the closest capability available. This translation is offered for convenience, but it's important for you to verify that the model is following the instructions in the system message with the right level of confidence.
1200
1152
1201
1153
The response is as follows, where you can see the model's usage statistics:
1202
1154
@@ -1206,7 +1158,7 @@ The response is as follows, where you can see the model's usage statistics:
1206
1158
"id": "0a1234b5de6789f01gh2i345j6789klm",
1207
1159
"object": "chat.completion",
1208
1160
"created": 1718726686,
1209
-
"model": "Phi-4-multimodal-instruct",
1161
+
"model": "Phi-4-mini-instruct",
1210
1162
"choices": [
1211
1163
{
1212
1164
"index": 0,
@@ -1263,7 +1215,7 @@ You can visualize how streaming generates content:
1263
1215
"id": "23b54589eba14564ad8a2e6978775a39",
1264
1216
"object": "chat.completion.chunk",
1265
1217
"created": 1718726371,
1266
-
"model": "Phi-4-multimodal-instruct",
1218
+
"model": "Phi-4-mini-instruct",
1267
1219
"choices": [
1268
1220
{
1269
1221
"index": 0,
@@ -1286,7 +1238,7 @@ The last message in the stream has `finish_reason` set, indicating the reason fo
1286
1238
"id": "23b54589eba14564ad8a2e6978775a39",
1287
1239
"object": "chat.completion.chunk",
1288
1240
"created": 1718726371,
1289
-
"model": "Phi-4-multimodal-instruct",
1241
+
"model": "Phi-4-mini-instruct",
1290
1242
"choices": [
1291
1243
{
1292
1244
"index": 0,
@@ -1337,7 +1289,7 @@ Explore other parameters that you can specify in the inference client. For a ful
| Model | Offer Availability Region | Hub/Project Region for Deployment | Hub/Project Region for Fine tuning |
56
56
|---------|---------|---------|---------|
57
-
Phi-4 <br> Phi-4-mini-instruct <br> Phi-4-multimodal-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
57
+
Phi-4 <br> Phi-4-mini-instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
58
58
Phi-3.5-vision-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | Not available |
59
59
Phi-3.5-MoE-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 |
60
60
Phi-3.5-Mini-Instruct | Not applicable | East US <br> East US 2 <br> North Central US <br> South Central US <br> Sweden Central <br> West US <br> West US 3 | East US 2 | East US 2 |
0 commit comments