Skip to content

Commit d53ad79

Browse files
author
Harmanpreet Kaur
committed
included gpt-4.1
1 parent 35b470a commit d53ad79

File tree

10 files changed

+19
-17
lines changed

10 files changed

+19
-17
lines changed

.github/workflows/deploy.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -120,7 +120,7 @@ jobs:
120120
environmentName="${{ env.SOLUTION_PREFIX }}" \
121121
secondaryLocation="northcentralus" \
122122
deploymentType="GlobalStandard" \
123-
gptModelName="gpt-4o" \
123+
gptModelName="gpt-4.1" \
124124
azureOpenaiAPIVersion="2024-05-01-preview" \
125125
gptDeploymentCapacity=${{ env.GPT_MIN_CAPACITY }} \
126126
embeddingModel="text-embedding-ada-002" \

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -99,7 +99,7 @@ _Note: This is not meant to outline all costs as selected SKUs, scaled use, cust
9999
| [Azure AI Search](https://learn.microsoft.com/en-us/azure/search/) | Standard tier, S1. Pricing is based on the number of documents and operations. Information retrieval at scale for vector and text content in traditional or generative search scenarios. | [Pricing](https://azure.microsoft.com/pricing/details/search/) |
100100
| [Azure Storage Account](https://learn.microsoft.com/en-us/azure/storage/blobs/) | Standard tier, LRS. Pricing is based on storage and operations. Blob storage in the clopud, optimized for storing massive amounts of unstructured data. | [Pricing](https://azure.microsoft.com/pricing/details/storage/blobs/) |
101101
| [Azure Key Vault](https://learn.microsoft.com/en-us/azure/key-vault/) | Standard tier. Pricing is based on the number of operations. Maintain keys that access and encrypt your cloud resources, apps, and solutions. | [Pricing](https://azure.microsoft.com/pricing/details/key-vault/) |
102-
| [Azure AI Services](https://learn.microsoft.com/en-us/azure/ai-services/) | S0 tier, defaults to gpt-4o and text-embedding-ada-002 models. Pricing is based on token count. | [Pricing](https://azure.microsoft.com/pricing/details/cognitive-services/) |
102+
| [Azure AI Services](https://learn.microsoft.com/en-us/azure/ai-services/) | S0 tier, defaults to gpt-4.1 and text-embedding-ada-002 models. Pricing is based on token count. | [Pricing](https://azure.microsoft.com/pricing/details/cognitive-services/) |
103103
| [Azure Container App](https://learn.microsoft.com/en-us/azure/container-apps/) | Consumption tier with 0.5 CPU, 1GiB memory/storage. Pricing is based on resource allocation, and each month allows for a certain amount of free usage. Allows you to run containerized applications without worrying about orchestration or infrastructure. | [Pricing](https://azure.microsoft.com/pricing/details/container-apps/) |
104104
| [Azure Container Registry](https://learn.microsoft.com/en-us/azure/container-registry/) | Basic tier. Build, store, and manage container images and artifacts in a private registry for all types of container deployments | [Pricing](https://azure.microsoft.com/pricing/details/container-registry/) |
105105
| [Log analytics](https://learn.microsoft.com/en-us/azure/azure-monitor/) | Pay-as-you-go tier. Costs based on data ingested. Collect and analyze on telemetry data generated by Azure. | [Pricing](https://azure.microsoft.com/pricing/details/monitor/) |

docs/AzureGPTQuotaSettings.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,6 +5,6 @@
55
3. **Go to** the `Management Center` from the bottom-left navigation menu.
66
4. Select `Quota`
77
- Click on the `GlobalStandard` dropdown.
8-
- Select the required **GPT model** (`GPT-4, GPT-4o`) or **Embeddings model** (`text-embedding-ada-002`).
8+
- Select the required **GPT model** (`GPT-4, GPT-4o,GPT-4.1`) or **Embeddings model** (`text-embedding-ada-002`).
99
- Choose the **region** where the deployment is hosted.
1010
5. Request More Quota or delete any unused model deployments as needed.

docs/CustomizingAzdParameters.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,10 +18,10 @@ Change the Model Deployment Type (allowed values: Standard, GlobalStandard)
1818
azd env set AZURE_ENV_MODEL_DEPLOYMENT_TYPE Standard
1919
```
2020

21-
Set the Model Name (allowed values: gpt-4, gpt-4o)
21+
Set the Model Name (allowed values: gpt-4, gpt-4o,gpt-4.1)
2222

2323
```shell
24-
azd env set AZURE_ENV_MODEL_NAME gpt-4o
24+
azd env set AZURE_ENV_MODEL_NAME gpt-4.1
2525
```
2626

2727
Change the Model Capacity (choose a number based on available GPT model capacity in your subscription)

docs/DeploymentGuide.md

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -103,7 +103,7 @@ When you start the deployment, most parameters will have **default values**, but
103103
| **Environment Name** | A **3-20 character alphanumeric value** used to generate a unique ID to prefix the resources. | byctemplate |
104104
| **Secondary Location** | A **less busy** region for **CosmosDB**, useful in case of availability constraints. | eastus2 |
105105
| **Deployment Type** | Select from a drop-down list. | Global Standard |
106-
| **GPT Model** | Choose from **gpt-4, gpt-4o** | gpt-4o |
106+
| **GPT Model** | Choose from **gpt-4, gpt-4o , gpt-4.1** | gpt-4.1 |
107107
| **GPT Model Deployment Capacity** | Configure capacity for **GPT models**. | 30k |
108108
| **Embedding Model** | Default: **text-embedding-ada-002**. | text-embedding-ada-002 |
109109
| **Embedding Model Capacity** | Set the capacity for **embedding models**. | 80k |
@@ -114,8 +114,9 @@ When you start the deployment, most parameters will have **default values**, but
114114
<details>
115115
<summary><b>[Optional] Quota Recommendations</b></summary>
116116

117-
By default, the _Gpt-4o model capacity_ in deployment is set to _30k tokens_, so we recommend:
118-
- **For Global Standard | GPT-4o** - the capacity to at least 150k tokens post-deployment for optimal performance.
117+
By default, the _Gpt-4.1 model capacity_ in deployment is set to _30k tokens_, so we recommend:
118+
- **For Global Standard | GPT-4.1** - the capacity to at least 150k tokens post-deployment for optimal performance.
119+
- **For Global Standard | GPT-4.0** - the capacity to at least 150k tokens post-deployment for optimal performance.
119120

120121
- **For Standard | GPT-4** - ensure a minimum of 30k–40k tokens for best results.
121122

docs/QuotaCheck.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,8 @@
11
## Check Quota Availability Before Deployment
22

33
Before deploying the accelerator, **ensure sufficient quota availability** for the required model.
4-
> **For Global Standard | GPT-4o - the capacity to at least 150k tokens post-deployment for optimal performance.**
4+
5+
> **For Global Standard | GPT-4o |GPT-4.1- the capacity to at least 150k tokens post-deployment for optimal performance.**
56
67
> **For Standard | GPT-4 - ensure a minimum of 30k–40k tokens for best results.**
78
@@ -13,7 +14,7 @@ azd auth login
1314

1415
### 📌 Default Models & Capacities:
1516
```
16-
gpt-4o:30, text-embedding-ada-002:80, gpt-4:30
17+
gpt-4.1:30, text-embedding-ada-002:80, gpt-4:30, gpt-4o:30
1718
```
1819
### 📌 Default Regions:
1920
```

infra/main.bicep

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -26,10 +26,10 @@ param secondaryLocation string
2626
param deploymentType string = 'GlobalStandard'
2727

2828
@description('Name of the GPT model to deploy:')
29-
param gptModelName string = 'gpt-4o'
29+
param gptModelName string = 'gpt-4.1'
3030

3131
@description('Version of the GPT model to deploy:')
32-
param gptModelVersion string = '2024-05-13'
32+
param gptModelVersion string = '2025-04-14'
3333

3434
param azureOpenaiAPIVersion string = '2024-05-01-preview'
3535

infra/main.json

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@
55
"_generator": {
66
"name": "bicep",
77
"version": "0.36.1.42791",
8-
"templateHash": "11172828768806624864"
8+
"templateHash": "5449809042324258772"
99
}
1010
},
1111
"parameters": {
@@ -41,14 +41,14 @@
4141
},
4242
"gptModelName": {
4343
"type": "string",
44-
"defaultValue": "gpt-4o",
44+
"defaultValue": "gpt-4.1",
4545
"metadata": {
4646
"description": "Name of the GPT model to deploy:"
4747
}
4848
},
4949
"gptModelVersion": {
5050
"type": "string",
51-
"defaultValue": "2024-05-13",
51+
"defaultValue": "2025-04-14",
5252
"metadata": {
5353
"description": "Version of the GPT model to deploy:"
5454
}

scripts/checkquota.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -32,7 +32,7 @@ echo "✅ Azure subscription set successfully."
3232

3333
# Define models and their minimum required capacities
3434
declare -A MIN_CAPACITY=(
35-
["OpenAI.Standard.gpt-4o"]=$GPT_MIN_CAPACITY
35+
["OpenAI.Standard.gpt-4.1"]=$GPT_MIN_CAPACITY
3636
["OpenAI.Standard.text-embedding-ada-002"]=$TEXT_EMBEDDING_MIN_CAPACITY
3737
)
3838

scripts/quota_check_params.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -47,7 +47,7 @@ log_verbose() {
4747
}
4848

4949
# Default Models and Capacities (Comma-separated in "model:capacity" format)
50-
DEFAULT_MODEL_CAPACITY="gpt-4o:30,text-embedding-ada-002:80,gpt-4:30"
50+
DEFAULT_MODEL_CAPACITY="gpt-4.1:30,text-embedding-ada-002:80,gpt-4:30,gpt-4o:30"
5151

5252
# Convert the comma-separated string into an array
5353
IFS=',' read -r -a MODEL_CAPACITY_PAIRS <<< "$DEFAULT_MODEL_CAPACITY"

0 commit comments

Comments
 (0)