You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-foundry/how-to/fine-tune-serverless.md
+20-20Lines changed: 20 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -22,11 +22,11 @@ Azure AI Foundry enables you to customize large language models to your specific
22
22
23
23
**Cost Efficiency**: Azure AI Foundry's fine-tuning can be more cost-effective, especially for large-scale deployments, thanks to pay-as-you-go pricing.
24
24
25
-
**Model Variety**: Azure AI Foundry's Serverless API finetuning offers support for both proprietary and open-source models, providing users with the flexibility to select the models that best suit their needs without being restricted to a single type.
25
+
**Model Variety**: Azure AI Foundry's Serverless API fine-tuning offers support for both proprietary and open-source models, providing users with the flexibility to select the models that best suit their needs without being restricted to a single type.
26
26
27
27
**Customization and Control**: Azure AI Foundry provides greater customization and control over the fine-tuning process, enabling users to tailor models more precisely to their specific requirements.
28
28
29
-
In this article, you will discover how to fine-tune models that are deployed using serverless API's in [Azure AI Foundry](https://ai.azure.com).
29
+
In this article, you will discover how to fine-tune models that are deployed using serverless APIs in [Azure AI Foundry](https://ai.azure.com).
30
30
31
31
32
32
## Prerequisites
@@ -61,7 +61,7 @@ You can also go to the Azure AI Foundry portal to view all models that contain f
61
61
1. Navigate to the model catalog.
62
62
1. Select the **Fine-tuning tasks** filter.
63
63
64
-
:::image type="content" source="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-filters.png" alt-text="Screenshot of model catalog finetuning filter options." lightbox="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-filters.png":::
64
+
:::image type="content" source="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-filters.png" alt-text="Screenshot of model catalog fine-tuning filter options." lightbox="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-filters.png":::
65
65
66
66
1. Select **All** or select a specific task.
67
67
@@ -209,9 +209,9 @@ Here are some of the tasks you can do on the **Models** tab:
209
209
:::image type="content" source="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-details.png" alt-text="Screenshot of the fine-tuning details dashboard." lightbox="../media/how-to/fine-tune/fine-tune-serverless/fine-tune-details.png":::
210
210
211
211
212
-
### Supported enterprise scenarios for finetuning
212
+
### Supported enterprise scenarios for fine-tuning
213
213
214
-
Several enterprise scenarios are supported for MaaS finetuning. The table below outlines the supported configurations for user storage networking and authentication to ensure smooth operation within enterprise scenarios:
214
+
Several enterprise scenarios are supported for MaaS fine-tuning. The table below outlines the supported configurations for user storage networking and authentication to ensure smooth operation within enterprise scenarios:
215
215
216
216
>[!Note]
217
217
>- Data connections auth can be changed via AI Foundry by clicking on the datastore connection which your dataset is stored in, and navigating to the **Access details** > **Authentication Method** setting.
@@ -222,18 +222,18 @@ Several enterprise scenarios are supported for MaaS finetuning. The table below
| Public Network Access = Enabled | Account key enabled | SAS/Account Key | Yes, UX and SDK |
224
224
| Public Network Access = Enabled | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UX and SDK <br><br> *Note:* for UX, you may need to add Storage Blob Data Reader or Storage Blob Data Contributor for your user ID on the storage account, or change the connection's authentication to use Account key/SAS token ||||
225
-
| Enabled from selected virtual networks and IP addresses | Account key enabled | Account key | Yes, UX and SDK <br><br> *Note:*: for UX, the IP of the compute running the browser must be in the selected list |
226
-
| Enabled from selected virtual networks and IP addresses | Account key enabled | SAS | Yes, UX and SDK <br><br> *Note:*: for UX, the IP of the compute running the browser must be in the selected list |
225
+
| Enabled from selected virtual networks and IP addresses | Account key enabled | Account key | Yes, UX and SDK <br><br> *Note:* for UX, the IP of the compute running the browser must be in the selected list |
226
+
| Enabled from selected virtual networks and IP addresses | Account key enabled | SAS | Yes, UX and SDK <br><br> *Note:* for UX, the IP of the compute running the browser must be in the selected list |
227
227
| Enabled from selected virtual networks and IP addresses | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UX and SDK. <br><br>*Note:* for UX, you may need to add Storage Blob Data Reader or Storage Blob Data Contributor for your user ID on the storage account, or change the connection's authentication to use Account key/SAS token. Also ensure the IP of the compute running the browser must be in the selected list ||||
228
228
| Public Network Access = Disabled | Account key enabled | SAS/Account Key | Yes, UX and SDK. <br><br> *Note:* for UX data upload and submission to work, the workspace _needs to be accessed from within the Vnet_ that has appropriate access to the storage |
229
229
| Public Network Access = Disabled | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UX and SDK. <br><br> *Note:* for UX data upload and submission to work, the workspace _needs to be accessed from within the Vnet_ that has appropriate access to the storage |
230
230
231
231
232
232
The scenarios above should work in a Managed Vnet workspace as well. See setup of Managed Vnet AI Foundry hub here: [How to configure a managed network for Azure AI Foundry hubs](./configure-managed-network.md)
233
233
234
-
Customer-Managed Keys (CMKs) is **not** a supported enterprise scenario with MaaS finetuning.
234
+
Customer-Managed Keys (CMKs) is **not** a supported enterprise scenario with MaaS fine-tuning.
235
235
236
-
Issues finetuning with unique network setups on the workspace and storage usually points to a networking setup issue.
236
+
Issues fine-tuning with unique network setups on the workspace and storage usually points to a networking setup issue.
237
237
238
238
---
239
239
@@ -284,7 +284,7 @@ You can delete a fine-tuned model from the fine-tuning model list in [Azure AI F
284
284
285
285
### Create a client to consume the model
286
286
287
-
The following sections walk you through how to fine-tune a model in python. To find a notebook example of this code, please see [FineTuning LLM with Model-As-Service](https://github.com/Azure/azureml-examples/blob/main/sdk/python/jobs/finetuning/standalone/model-as-a-service/chat-completion/chat_completion_with_model_as_service.ipynb)
287
+
The following sections walk you through how to fine-tune a model in python. To find a notebook example of this code, please see [Fine-tuning LLM with Model-As-Service](https://github.com/Azure/azureml-examples/blob/main/sdk/python/jobs/finetuning/standalone/model-as-a-service/chat-completion/chat_completion_with_model_as_service.ipynb)
288
288
289
289
1. Ensure to install dependencies to start fine-tuning your model.
290
290
@@ -334,7 +334,7 @@ workspace.id
334
334
335
335
The AI Foundry model catalog offers fine-tuning support for multiple types of models, including chat completions and text generation. For a list of models that support fine-tuning and the Azure regions of support for fine-tuning, see [region availability for models in serverless API endpoints.](deploy-models-serverless-availability.md) Fine-tuning tasks are available only to users whose Azure subscription belongs to a billing account in a country/region where the model provider has made the offer available. If the offer is available in the relevant region, the user then must have a project resource in the Azure region where the model is available for deployment or fine-tuning, as applicable.
336
336
337
-
For this example, we use a Phi-4-mini-instruct model. In this code snippet, the model id property of the model will be passed as input to the fine tuning job. This is also available as the Asset ID field in model details page in Azure AI Foundry Model Catalog.
337
+
For this example, we use a Phi-4-mini-instruct model. In this code snippet, the model ID property of the model will be passed as input to the fine tuning job. This is also available as the Asset ID field in model details page in Azure AI Foundry Model Catalog.
338
338
339
339
```python
340
340
model_name ="Phi-4-mini-instruct"
@@ -423,7 +423,7 @@ except:
423
423
424
424
### Create validation data
425
425
426
-
The next step provides options to configure the model to use validation data in the training process. If you don't want to use validation data, you can choose to skip this step to continue to the the next section. Otherwise, if you have a validation dataset, you can either choose existing prepared validation data or upload new prepared validation data to use when customizing your model.
426
+
The next step provides options to configure the model to use validation data in the training process. If you don't want to use validation data, you can choose to skip this step to continue to the next section. Otherwise, if you have a validation dataset, you can either choose existing prepared validation data or upload new prepared validation data to use when customizing your model.
427
427
428
428
```python
429
429
from azure.ai.ml.entities import Data
@@ -491,13 +491,13 @@ There are following set of parameters that are required to fine-tune your model.
491
491
1. model: Base model to fine-tune.
492
492
1. training_data: Training data for fine-tuning the base model.
493
493
1. validation_data: Validation data for fine-tuning the base model.
494
-
1. task: FineTuning task to perform. eg. CHAT_COMPLETION for chat-completion fine-tuning jobs.
494
+
1. task: Fine-tuning task to perform. eg. CHAT_COMPLETION for chat-completion fine-tuning jobs.
495
495
1. outputs: Output registered model name.
496
496
497
497
The following parameters are optional:
498
498
499
499
1. hyperparameters: Parameters that control the fine-tuning behavior at run-time.
500
-
1. name: FineTuning job name
500
+
1. name: Fine-tuning job name
501
501
1. experiment_name: Experiment name for fin-tuning job.
### Supported enterprise scenarios for fine-tuning
580
580
581
-
Several enterprise scenarios are supported for MaaS finetuning. The table below outlines the supported configurations for user storage networking and authentication to ensure smooth operation within enterprise scenarios:
581
+
Several enterprise scenarios are supported for MaaS fine-tuning. The table below outlines the supported configurations for user storage networking and authentication to ensure smooth operation within enterprise scenarios:
582
582
583
583
>[!Note]
584
584
>- Data connections auth can be changed via AI Foundry by clicking on the datastore connection which your dataset is stored in, and navigating to the **Access details**>**Authentication Method** setting.
@@ -589,18 +589,18 @@ Several enterprise scenarios are supported for MaaS finetuning. The table below
| Public Network Access = Enabled | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UXandSDK<br><br>*Note:*forUX, you may need to add Storage Blob Data Reader or Storage Blob Data Contributor for your user ID on the storage account, or change the connection's authentication to use Account key/SAS token | | | |
592
-
| Enabled from selected virtual networks andIP addresses | Account key enabled | Account key | Yes, UXandSDK<br><br>*Note:*:forUX, the IP of the compute running the browser must be in the selected list|
593
-
| Enabled from selected virtual networks andIP addresses | Account key enabled |SAS| Yes, UXandSDK<br><br>*Note:*:forUX, the IP of the compute running the browser must be in the selected list|
592
+
| Enabled from selected virtual networks andIP addresses | Account key enabled | Account key | Yes, UXandSDK<br><br>*Note:*forUX, the IP of the compute running the browser must be in the selected list|
593
+
| Enabled from selected virtual networks andIP addresses | Account key enabled |SAS| Yes, UXandSDK<br><br>*Note:*forUX, the IP of the compute running the browser must be in the selected list|
594
594
| Enabled from selected virtual networks andIP addresses | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UXandSDK. <br><br>*Note:*forUX, you may need to add Storage Blob Data Reader or Storage Blob Data Contributor for your user ID on the storage account, or change the connection's authentication to use Account key/SAS token. Also ensure the IP of the compute running the browser must be in the selected list | | | |
595
595
| Public Network Access = Disabled | Account key enabled |SAS/Account Key | Yes, UXandSDK. <br><br>*Note:*forUX data upload and submission to work, the workspace _needs to be accessed from within the Vnet_ that has appropriate access to the storage |
596
596
| Public Network Access = Disabled | Account key disabled | Entra-Based Auth (Credentialless) | Yes, UXandSDK. <br><br>*Note:*forUX data upload and submission to work, the workspace _needs to be accessed from within the Vnet_ that has appropriate access to the storage |
597
597
598
598
599
599
The scenarios above should work in a Managed Vnet workspace as well. See setup of Managed Vnet AI Foundry hub here: [How to configure a managed network for Azure AI Foundry hubs](./configure-managed-network.md)
600
600
601
-
Customer-Managed Keys (CMKs) is**not** a supported enterprise scenario with MaaS finetuning.
601
+
Customer-Managed Keys (CMKs) is**not** a supported enterprise scenario with MaaS fine-tuning.
602
602
603
-
Issues finetuningwith unique network setups on the workspace and storage usually points to a networking setup issue.
603
+
Issues fine-tuningwith unique network setups on the workspace and storage usually points to a networking setup issue.
0 commit comments