Skip to content

Commit 70530b4

Browse files
Merge pull request #2750 from voutilad/ft-ptum-pup
Announce public preview of PTU-M for fine-tuned models.
2 parents 2cf499b + 0b82013 commit 70530b4

File tree

1 file changed

+80
-1
lines changed

1 file changed

+80
-1
lines changed

articles/ai-services/openai/how-to/fine-tuning.md

Lines changed: 80 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -44,19 +44,98 @@ We use LoRA, or low rank approximation, to fine-tune models in a way that reduce
4444

4545
::: zone-end
4646

47-
## Global Standard (preview)
47+
## Other Deployment Types
4848

4949
Azure OpenAI fine-tuning supports [global standard deployments](./deployment-types.md#global-standard) in East US2, North Central US, and Sweden Central for:
5050

5151
- `gpt-4o-mini-2024-07-18`
5252
- `gpt-4o-2024-08-06`
5353

54+
And supports [regional provisioned managed](./deployment-types.md#provisioned) in North Central US and Switzerland West for:
55+
56+
- `gpt-4o-mini-2024-07-18`
57+
- `gpt-4o-2024-08-06`
58+
59+
### Global Standard (preview)
60+
5461
Global standard fine-tuned deployments offer [cost savings](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/), but custom model weights may temporarily be stored outside the geography of your Azure OpenAI resource.
5562

5663
:::image type="content" source="../media/fine-tuning/global-standard.png" alt-text="Screenshot of the global standard deployment user experience with a fine-tuned model." lightbox="../media/fine-tuning/global-standard.png":::
5764

5865
Global Standard fine-tuning deployments currently do not support vision and structured outputs.
5966

67+
### Provisioned Managed (preview)
68+
69+
Provisioned managed fine-tuned deployments offer [predictable performance](../concepts/provisioned-throughput.md#what-do-the-provisioned-deployment-types-provide) for fine-tuned deployments. As part of public preview, provisioned managed deployments may be created regionally via the data-plane [REST API](../reference.md#data-plane-inference) version `2024-10-01` or newer. See below for examples.
70+
71+
Provisioned Managed fine-tuning deployments currently do not support vision and structured outputs.
72+
73+
#### Creating a Provisioned Managed deployment
74+
75+
To create a new deployment, make an HTTP PUT call via the [Deployments - Create or Update REST API](/rest/api/aiservices/accountmanagement/deployments/create-or-update?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true). The approach is similar to performing [cross region deployment](#cross-region-deployment) with the following exceptions:
76+
77+
- You must provide a sku name of `ProvisionedStandard`.
78+
- The capacity must be declared in PTUs.
79+
- The `api-version` must be `2024-10-01` or newer.
80+
- The HTTP method should be `PUT`.
81+
82+
For example, to deploy a gpt-4o-mini model:
83+
84+
```bash
85+
curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-01" \
86+
-H "Authorization: Bearer <TOKEN>" \
87+
-H "Content-Type: application/json" \
88+
-d '{
89+
"sku": {"name": "ProvisionedStandard", "capacity": 25},
90+
"properties": {
91+
"model": {
92+
"format": "OpenAI",
93+
"name": "gpt-4omini-ft-model-name",
94+
"version": "1",
95+
"source": "/subscriptions/00000000-0000-0000-0000-000000000000/resourceGroups/{SourceResourceGroupName}/providers/Microsoft.CognitiveServices/accounts/{SourceAOAIAccountName}"
96+
}
97+
}
98+
}'
99+
```
100+
101+
#### Scaling a fine-tuned model on Provisioned Managed
102+
103+
To scale a fine-tuned provision managed deployment to increase or decrease PTU capacity, perform the same `PUT` REST API call as you did when [creating the deployment](#creating-a-provisioned-managed-deployment) and provide an updated `capacity` value for the `sku`. Keep in mind, provisioned deployments must scale in [minimum increments](../concepts/provisioned-throughput.md#how-much-throughput-per-ptu-you-get-for-each-model).
104+
105+
For example, to scale the model deployed in the previous section from 25 to 40 PTU, make another `PUT` call and increase the capacity:
106+
107+
```bash
108+
curl -X PUT "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-01" \
109+
-H "Authorization: Bearer <TOKEN>" \
110+
-H "Content-Type: application/json" \
111+
-d '{
112+
"sku": {"name": "ProvisionedStandard", "capacity": 40},
113+
"properties": {
114+
"model": {
115+
"format": "OpenAI",
116+
"name": "gpt-4omini-ft-model-name",
117+
"version": "1",
118+
"source": "/subscriptions/00000000-0000-0000-0000-000000000000/resourceGroups/{SourceResourceGroupName}/providers/Microsoft.CognitiveServices/accounts/{SourceAOAIAccountName}"
119+
}
120+
}
121+
}'
122+
```
123+
124+
#### Deleting a Provisioned Managed deployment
125+
126+
To delete a deployment, use the [Deployments - Delete REST API(/rest/api/aiservices/accountmanagement/deployments/delete?view=rest-aiservices-accountmanagement-2024-10-01&tabs=HTTP&preserve-view=true) and send an HTTP DELETE to the deployment resource. Like with creating deployments, you must include the following parameters:
127+
128+
- Azure subscription id
129+
- Azure resource group name
130+
- Azure OpenAI resource name
131+
- Name of the PTU deployment to delete
132+
133+
For example, to delete the model created and modified in the previous sections:
134+
135+
```bash
136+
curl -X DELETE "https://management.azure.com/subscriptions/<SUBSCRIPTION>/resourceGroups/<RESOURCE_GROUP>/providers/Microsoft.CognitiveServices/accounts/<RESOURCE_NAME>/deployments/<MODEL_DEPLOYMENT_NAME>api-version=2024-10-01" \
137+
-H "Authorization: Bearer <TOKEN>"
138+
```
60139

61140
## Vision fine-tuning
62141

0 commit comments

Comments
 (0)