You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/ai-services/openai/how-to/fine-tuning.md
+80-1Lines changed: 80 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -44,19 +44,98 @@ We use LoRA, or low rank approximation, to fine-tune models in a way that reduce
44
44
45
45
::: zone-end
46
46
47
-
## Global Standard (preview)
47
+
## Other Deployment Types
48
48
49
49
Azure OpenAI fine-tuning supports [global standard deployments](./deployment-types.md#global-standard) in East US2, North Central US, and Sweden Central for:
50
50
51
51
-`gpt-4o-mini-2024-07-18`
52
52
-`gpt-4o-2024-08-06`
53
53
54
+
And supports [regional provisioned managed](./deployment-types.md#provisioned) in North Central US and Switzerland West for:
55
+
56
+
-`gpt-4o-mini-2024-07-18`
57
+
-`gpt-4o-2024-08-06`
58
+
59
+
### Global Standard (preview)
60
+
54
61
Global standard fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
55
62
56
63
:::image type="content" source="../media/fine-tuning/global-standard.png" alt-text="Screenshot of the global standard deployment user experience with a fine-tuned model." lightbox="../media/fine-tuning/global-standard.png":::
57
64
58
65
Global Standard fine-tuning deployments currently do not support vision and structured outputs.
59
66
67
+
### Provisioned Managed (preview)
68
+
69
+
Provisioned managed fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md#what-do-the-provisioned-deployment-types-provide) for fine-tuned deployments. As part of public preview, provisioned managed deployments may be created regionally via the data-plane [REST API](../reference.md#data-plane-inference) version `2024-10-01` or newer. See below for examples.
70
+
71
+
Provisioned Managed fine-tuning deployments currently do not support vision and structured outputs.
72
+
73
+
#### Creating a Provisioned Managed deployment
74
+
75
+
To create a new deployment, make an HTTP PUT call via the [Deployments - Create or Update REST API](/rest/api/aiservices/accountmanagement/deployments/create-or-update?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true). The approach is similar to performing [cross region deployment](#cross-region-deployment) with the following exceptions:
76
+
77
+
- You must provide a sku name of `ProvisionedStandard`.
78
+
- The capacity must be declared in PTUs.
79
+
- The `api-version` must be `2024-10-01` or newer.
80
+
- The HTTP method should be `PUT`.
81
+
82
+
For example, to deploy a gpt-4o-mini model:
83
+
84
+
```bash
85
+
curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-01" \
#### Scaling a fine-tuned model on Provisioned Managed
102
+
103
+
To scale a fine-tuned provision managed deployment to increase or decrease PTU capacity, perform the same `PUT` REST API call as you did when [creating the deployment](#creating-a-provisioned-managed-deployment) and provide an updated `capacity` value for the `sku`. Keep in mind, provisioned deployments must scale in [minimum increments](../concepts/provisioned-throughput.md#how-much-throughput-per-ptu-you-get-for-each-model).
104
+
105
+
For example, to scale the model deployed in the previous section from 25 to 40 PTU, make another `PUT` call and increase the capacity:
106
+
107
+
```bash
108
+
curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-01" \
To delete a deployment, use the [Deployments - Delete REST API(/rest/api/aiservices/accountmanagement/deployments/delete?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true) and send an HTTP DELETE to the deployment resource. Like with creating deployments, you must include the following parameters:
127
+
128
+
- Azure subscription id
129
+
- Azure resource group name
130
+
- Azure OpenAI resource name
131
+
- Name of the PTU deployment to delete
132
+
133
+
For example, to delete the model created and modified in the previous sections:
0 commit comments