Skip to content

Commit 3e8721c

Browse files
committed
gpt-4o-mini
1 parent 007bc69 commit 3e8721c

File tree

6 files changed

+10
-11
lines changed

6 files changed

+10
-11
lines changed

docs/docs/architecture.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ In this workshop, you will create the Contoso Sales Agent: a conversational agen
88

99
This agent is built on Microsoft Azure services.
1010

11-
- **Generative AI model**: The underlying LLM powering this app is the [Azure OpenAI gpt-4o](https://learn.microsoft.com/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#gpt-4o-and-gpt-4-turbo){:target="_blank"} LLM.
11+
- **Generative AI model**: The underlying LLM powering this app is the [Azure OpenAI gpt-4o-mini](https://learn.microsoft.com/azure/ai-services/openai/concepts/models?tabs=global-standard%2Cstandard-chat-completions#gpt-4o-mini-and-gpt-4-turbo){:target="_blank"} LLM.
1212

1313
- **Vector Store**: We will provide the agent with product information as a PDF file to support its queries. The agent will use the "basic agent setup" of the [Foundry Agent Service file search tool](https://learn.microsoft.com/azure/ai-services/agents/how-to/tools/file-search?tabs=python&pivots=overview){:target="_blank"} to find relevant portions of the document with vector search and provide them to the agent as context.
1414

docs/docs/includes/introduction-event.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -180,7 +180,7 @@ Next, we log in to Azure AI Foundry to retrieve the project endpoint, which the
180180
3. Add the **Model deployment name** to the user secrets.
181181
182182
```powershell
183-
dotnet user-secrets set "Azure:ModelName" "gpt-4o"
183+
dotnet user-secrets set "Azure:ModelName" "gpt-4o-mini"
184184
```
185185
186186
4. Add the **Bing connection ID** to the user secrets for grounding with Bing search.

docs/docs/includes/introduction-self-guided.md

Lines changed: 5 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -84,10 +84,9 @@ The following resources will be created in the **rg-agent-workshop-****** resour
8484
8585
- An **Azure AI Foundry hub** named **foundry-******
8686
- An **Azure AI Foundry project** named **project-******
87-
- A **Serverless (pay-as-you-go) GPT-4o model deployment** named **gpt-4o**. See pricing details [here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/){:target="_blank"}.
88-
- A **Grounding with Bing Search** resource. See the [documentation](https://learn.microsoft.com/azure/ai-services/agents/how-to/tools/bing-grounding) and [pricing](https://www.microsoft.com/en-us/bing/apis/grounding-pricing){:target="_blank"} for details.
87+
- A **Serverless (pay-as-you-go) GPT-4o model deployment** named **gpt-4o-mini**. See pricing details [here](https://azure.microsoft.com/pricing/details/cognitive-services/openai-service/){:target="_blank"}.
8988
90-
!!! warning "You will need 140K TPM quota availability for the gpt-4o Global Standard SKU, not because the agent uses lots of tokens, but due to the frequency of calls made by the agent to the model. Review your quota availability in the [AI Foundry Management Center](https://ai.azure.com/managementCenter/quota){:target="_blank"}."
89+
!!! warning "You will need 140K TPM quota availability for the gpt-4o-mini Global Standard SKU, not because the agent uses lots of tokens, but due to the frequency of calls made by the agent to the model. Review your quota availability in the [AI Foundry Management Center](https://ai.azure.com/managementCenter/quota){:target="_blank"}."
9190
9291
We have provided a bash script to automate the deployment of the resources required for the workshop. Alternatively, you may deploy resources manually using Azure AI Foundry studio. Select the desired tab.
9392
@@ -146,11 +145,11 @@ We have provided a bash script to automate the deployment of the resources requi
146145
3. From **My assets**, select **Models + endpoints**.
147146
4. Select **Deploy Model / Deploy Base Model**.
148147

149-
- Select **gpt-4o** from the model list, then select **Confirm**.
148+
- Select **gpt-4o-mini** from the model list, then select **Confirm**.
150149
- Name the deployment
151150

152151
```text
153-
gpt-4o
152+
gpt-4o-mini
154153
```
155154

156155
- Deployment type: Select **Global Standard**.
@@ -197,7 +196,7 @@ We have provided a bash script to automate the deployment of the resources requi
197196
4. Run the following command to set the [ASP.NET Core safe secret](https://learn.microsoft.com/aspnet/core/security/app-secrets){:target="_blank"} for the model deployment name:
198197
199198
```bash
200-
dotnet user-secrets set "Azure:ModelName" "gpt-4o" --project "$CSHARP_PROJECT_PATH"
199+
dotnet user-secrets set "Azure:ModelName" "gpt-4o-mini" --project "$CSHARP_PROJECT_PATH"
201200
```
202201
203202
5. Add the **Bing connection ID** to the user secrets for grounding with Bing search.

infra/deploy.ps1

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@ Write-Host "Deploying the Azure resources..."
22

33
# Define resource group parameters
44
$RG_LOCATION = "eastus"
5-
$MODEL_NAME = "gpt-4o"
5+
$MODEL_NAME = "gpt-4o-mini"
66
$MODEL_VERSION = "2024-11-20"
77
$AI_PROJECT_FRIENDLY_NAME = "Agent Service Workshop"
88
$MODEL_CAPACITY = 140

infra/deploy.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ echo "Deploying the Azure resources..."
44

55
# Define resource group parameters
66
RG_LOCATION="eastus"
7-
MODEL_NAME="gpt-4o"
7+
MODEL_NAME="gpt-4o-mini"
88
MODEL_VERSION="2024-11-20"
99
AI_PROJECT_FRIENDLY_NAME="Agent Service Workshop"
1010
MODEL_CAPACITY=140

infra/main.bicep

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ param aiProjectDescription string = 'A standard project resource required for th
1717
param tags object = {}
1818

1919
@description('Model name for deployment')
20-
param modelName string = 'gpt-4o'
20+
param modelName string = 'gpt-4o-mini'
2121

2222
@description('Model format for deployment')
2323
param modelFormat string = 'OpenAI'

0 commit comments

Comments
 (0)