Skip to content

Commit af483e7

Browse files
committed
Updated search-sku-manage-costs.md
1 parent ec87fc6 commit af483e7

File tree

1 file changed

+57
-25
lines changed

1 file changed

+57
-25
lines changed

articles/search/search-sku-manage-costs.md

Lines changed: 57 additions & 25 deletions
Original file line numberDiff line numberDiff line change
@@ -8,17 +8,13 @@ ms.author: haileytapia
88
ms.service: azure-ai-search
99
ms.custom:
1010
- ignite-2023
11-
ms.topic: conceptual
12-
ms.date: 06/06/2025
11+
ms.topic: how-to
12+
ms.date: 06/09/2025
1313
---
1414

1515
# Plan and manage costs of an Azure AI Search service
1616

17-
This article explains how Azure AI Search is billed and provides tips for planning, estimating, monitoring, and minimizing costs.
18-
19-
As a first step, use the [Azure pricing calculator](https://azure.microsoft.com/pricing/calculator/) to estimate your baseline costs. You can also find estimated costs and tier comparisons on the [Select a pricing tier](search-create-service-portal.md#choose-a-tier) page during service creation.
20-
21-
Through built-in cost management that cuts across service boundaries, Azure provides inclusive cost monitoring and the ability to set budgets and define alerts. The costs of running a search service vary depending on capacity and the features used. After you create a search service, optimize capacity to pay for only what you need.
17+
This article explains how Azure AI Search is billed and provides tips for estimating, planning, monitoring, and minimizing costs. The cost of running a search service varies based on capacity and the features used. After you create a search service, optimize its capacity so that you only pay for what you need.
2218

2319
> [!NOTE]
2420
> Higher-capacity partitions are available at the same billing rate on newer services created after April and May 2024. For more information about partition size upgrades, see [Service limits](search-limits-quotas-capacity.md#service-limits).
@@ -27,7 +23,7 @@ Through built-in cost management that cuts across service boundaries, Azure prov
2723

2824
## Understand the billing model
2925

30-
Azure AI Search runs on Azure infrastructure that accrues costs when you deploy new resources. Note that other infrastructure costs might also accrue.
26+
Azure AI Search runs on Azure infrastructure that accrues costs when you deploy new resources. Other infrastructure costs might also accrue.
3127

3228
### How you're charged for Azure AI Search
3329

@@ -43,10 +39,10 @@ The following table lists premium features and their billing units. All of these
4339

4440
| Feature | Unit |
4541
|-------|------|
46-
| Image extraction (AI enrichment) <sup>1</sup> | Per 1000 images. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
47-
| [Custom Entity Lookup skill](cognitive-search-skill-custom-entity-lookup.md) (AI enrichment) | Per 1000 text records. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing) |
48-
| [Built-in skills](cognitive-search-predefined-skills.md) (AI enrichment) | Number of transactions, billed at the same rate as calling Azure AI services directly. You can process 20 documents per indexer per day for free. Larger or more frequent workloads require an Azure AI services multi-service resource key. |
49-
| [Semantic ranker](semantic-search-overview.md) | Number of queries of `queryType=semantic`, billed at a progressive rate. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
42+
| Image extraction (AI enrichment) <sup>1</sup> | Per 1,000 images. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
43+
| [Custom Entity Lookup skill](cognitive-search-skill-custom-entity-lookup.md) (AI enrichment) | Per 1,000 text records. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing) |
44+
| [Built-in skills](cognitive-search-predefined-skills.md) (AI enrichment) | Number of transactions. Billed at the same rate as calling Azure AI services directly. You can process 20 documents per indexer per day for free. Larger or more frequent workloads require an Azure AI services multi-service resource key. |
45+
| [Semantic ranker](semantic-search-overview.md) | Number of queries of `queryType=semantic`. Billed at a progressive rate. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
5046
| [Shared private link](search-indexer-howto-access-private.md) | [Billed for bandwidth](https://azure.microsoft.com/pricing/details/private-link/) as long as the shared private link exists and is used. |
5147

5248
<sup>1</sup> Refers to images extracted from a file within the indexer pipeline. Text extraction is free. Image extraction is billed when you [enable the `indexAction` parameter](cognitive-search-concept-image-scenarios.md#configure-indexers-for-image-processing) for document cracking or when you call the [Document Extraction skill](cognitive-search-skill-document-extraction.md).
@@ -59,40 +55,76 @@ The following table lists premium features and their billing units. All of these
5955

6056
+ [Customer-managed keys](search-security-manage-encryption-keys.md), which provide double encryption of sensitive content, require a billable [Azure Key Vault](https://azure.microsoft.com/pricing/details/key-vault/).
6157

62-
+ A skillset can include [billable built-in skills](cognitive-search-predefined-skills.md), non-billable built-in utility skills, and custom skills. Non-billable utility skills include [Conditional](cognitive-search-skill-conditional.md), [Shaper](cognitive-search-skill-shaper.md), [Text Merge](cognitive-search-skill-textmerger.md), and [Text Split](cognitive-search-skill-textsplit.md). They don't have an API key requirement or 20-document limit.
58+
+ A skillset can include [billable built-in skills](cognitive-search-predefined-skills.md), nonbillable built-in utility skills, and custom skills. Nonbillable utility skills include [Conditional](cognitive-search-skill-conditional.md), [Shaper](cognitive-search-skill-shaper.md), [Text Merge](cognitive-search-skill-textmerger.md), and [Text Split](cognitive-search-skill-textsplit.md). They don't have an API key requirement or 20-document limit.
6359

6460
+ A custom skill is functionality you provide. Custom skills are only billable if they call other billable services. They don't have an API key requirement or 20-document limit.
6561

6662
> [!NOTE]
6763
> You aren't billed for the number of full-text or vector queries, query responses, or documents ingested, but [service limits](search-limits-quotas-capacity.md) apply to each pricing tier.
6864
65+
## Estimate and plan costs
66+
67+
Effective cost planning starts before you create an Azure AI Search service. Use the [pricing calculator](https://azure.microsoft.com/pricing/calculator/) to estimate your baseline costs. You can also compare the features and costs of each tier on the [Select a pricing tier](search-create-service-portal.md#choose-a-tier) page during service creation.
68+
69+
For initial testing, we reccomend that you index 1–5% of your representative content. Include any OCR, embeddings, or enrichment skills you plan to use. Measure the resulting index size, throughput, and costs, and then extrapolate them to estimate full-scale requirements. This approach helps you understand the index-to-source ratio and the impact of enrichment or vector features on both [capacity](search-capacity-planning.md) and cost.
70+
71+
<!-- If you're using [skillsets](cognitive-search-working-with-skillsets.md) with AI enrichment, computer vision, embedding requests, custom skills, or any other transformation or external service, each skill runs on a separate meter and has its own pricing. Review the pricing for each skill you plan to use. Enable incremental enrichment so unchanged skills are skipped after the first run, when applicable.
72+
73+
When planning, consider:
74+
75+
+ Indexing and enrichment volume. Estimate the frequency and volume of data ingestion and enrichment, as these drive both compute and potential AI service charges.
76+
77+
+ Data transfer and storage. Account for networking and storage costs, especially if your solution spans multiple regions or uses additional Azure resources.
78+
79+
+ Service limits. Review [service limits](search-limits-quotas-capacity.md) for your chosen tier to avoid unplanned scale-ups or throttling.
80+
81+
+ Skillset and enrichment metering. If you are using AI enrichment, image extraction, computer vision, embedding requests, custom skills, or any other transformation or external service, be aware that each skill runs on a separate meter and may have its own pricing. Review the pricing for each skill you plan to use. Enable incremental enrichment so unchanged skills are skipped after the first run, when applicable. -->
82+
6983
## Monitor costs
7084

71-
Cost management is built into the Azure infrastructure. Review [Billing and cost management](/azure/cost-management-billing/cost-management-billing-overview) for more information about tracking costs, tools, and APIs.
85+
At the Azure AI Search level, you can [monitor built-in metrics](search-monitor-queries.md) for queries per second (QPS), search latency, throttled queries, and index size. You can then [create an Azure Monitor dashboard](/azure-monitor/visualize/tutorial-logs-dashboards) that overlays QPS, latency, and cost data to determine when to add or remove replicas.
86+
87+
At the subscription or resource group level, [Cost Management](/azure/cost-management-billing/costs/overview-cost-management) provides tools to track, analyze, and control your costs. You can use Cost Management to:
88+
89+
+ [Create budgets](/azure/cost-management-billing/costs/tutorial-acm-create-budgets) that define and track progress against spending limits. For more granular monitoring, customize your budgets using [filters](/azure/cost-management-billing/costs/group-filter) for specific Azure resources or services. Filters prevent you from accidentally creating resources that incur extra costs.
90+
91+
+ [Create alerts](/azure/cost-management-billing/costs/cost-mgt-alerts-monitor-usage-spending) that automatically notify stakeholders of spending anomalies or overspending risks. Alerts are based on spending compared to budget and cost thresholds. Both budgets and alerts are created for subscriptions and resource groups, making them useful for monitoring overall costs.
92+
93+
+ [Export cost data](/azure/cost-management-billing/costs/tutorial-export-acm-data) to an Azure Storage account. This is helpful when you or others need to perform additional cost analysis. For example, a finance team can analyze the data using Excel or Power BI. You can export your costs on a daily, weekly, or monthly schedule and set a custom date range. Exporting cost data is the recommended method for retrieving cost datasets.
7294

7395
## Minimize costs
7496

75-
Follow these guidelines to minimize costs of an Azure AI Search solution.
97+
To minimize the costs of your Azure AI Search solution, use the following strategies across the service lifecycle.
98+
99+
### Deployment and configuration
100+
101+
+ Create a search service in a [region with more storage per partition](search-limits-quotas-capacity.md#service-limits).
102+
103+
+ Create all related Azure resources in the same region (or as few regions as possible) to minimize or eliminate bandwidth charges.
104+
105+
+ Choose the lightest [pricing tier](search-sku-tier.md) that meets your needs. Basic and S1 offer full access to the modern API at the lowest hourly rate per SU.
106+
107+
+ Use [Azure Web App](/azure/app-service/overview) for your front-end application to keep requests and responses within the data center boundary.
76108

77-
1. If possible, create a search service [in a region that has more storage per partition](search-limits-quotas-capacity.md#service-limits). If you're using multiple Azure resources in your solution, create them in the same region, or in as few regions as possible, to minimize or eliminate bandwidth charges.
109+
### Scaling
78110

79-
1. [Scale up](search-capacity-planning.md) for resource-intensive operations like indexing, and then readjust downwards for regular query workloads. If there are predictable patterns to your workloads, you might be able to synchronize scale up to coincide with the expected volume (you would need to write code to automate this).
111+
+ [Add partitions](search-capacity-planning.md#add-or-remove-partitions-and-replicas) only when the index size or ingestion throughput requires it.
80112

81-
When estimating the cost of a search solution, keep in mind that pricing and capacity aren't linear (doubling capacity more than doubles the cost on the same tier). Also, at some point, switching up to a higher tier can give you better and faster performance at roughly the same price point. For more information and an example, see [Switch to a Standard S2 tier](search-performance-tips.md#tip-switch-to-a-standard-s2-tier).
113+
+ [Add replicas](search-capacity-planning.md#add-or-remove-partitions-and-replicas) only when your queries per second increase, when complex queries are throttling your service, or when high availability is required.
82114

83-
1. Consider [Azure Web App](/azure/app-service/overview) for your front-end application so that requests and responses stay within the data center boundary.
115+
+ Scale up for resource-intensive operations, such as indexing, and then readjust downwards for regular query workloads.
84116

85-
1. If you're using [AI enrichment](cognitive-search-concept-intro.md), there's an extra charge for blob storage, but the cumulative cost goes down if you enable [enrichment caching](cognitive-search-incremental-indexing-conceptual.md).
117+
+ Write code to automate scaling for predictable workload patterns.
86118

87-
## Create budgets
119+
+ Remember that capacity and pricing aren't linear. Doubling capacity more than doubles costs on the same tier. For better performance at a similar price, consider [switching to a higher tier](search-performance-tips.md#tip-switch-to-a-standard-s2-tier).
88120

89-
You can create [budgets](/azure/cost-management-billing/costs/tutorial-acm-create-budgets?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) to manage costs and create [alerts](/azure/cost-management-billing/costs/cost-mgt-alerts-monitor-usage-spending?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) that automatically notify stakeholders of spending anomalies and overspending risks. Alerts are based on spending compared to budget and cost thresholds. Budgets and alerts are created for Azure subscriptions and resource groups, so they're useful as part of an overall cost monitoring strategy.
121+
### Indexing and enrichment
90122

91-
Budgets can be created with filters for specific resources or services in Azure if you want more granularity present in your monitoring. Filters help ensure that you don't accidentally create new resources that cost you extra money. For more information about the filter options available when you create a budget, see [Group and filter options](/azure/cost-management-billing/costs/group-filter?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn).
123+
+ Use [incremental indexing](search-howto-reindex.md) to process only new or changed data.
92124

93-
## Export cost data
125+
+ Enable [enrichment caching](cognitive-search-incremental-indexing-conceptual.md) to reduce [AI enrichment](cognitive-search-concept-intro.md) costs. Although caching incurs a charge for Azure Blob Storage, it lowers the cumulative enrichment cost because storage is cheaper than image extraction and AI processing.
94126

95-
You can also [export your cost data](/azure/cost-management-billing/costs/tutorial-export-acm-data?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) to a storage account. This is helpful when you need or others to do more data analysis for costs. For example, a finance team can analyze the data using Excel or Power BI. You can export your costs on a daily, weekly, or monthly schedule and set a custom date range. Exporting cost data is the recommended way to retrieve cost datasets.
127+
+ Keep vector payloads compact. For vector search, see the [vector compression best practices](https://techcommunity.microsoft.com/blog/azure-ai-services-blog/azure-ai-search-cut-vector-costs-up-to-92-5-with-new-compression-techniques/4404866).
96128

97129
## FAQ
98130

0 commit comments

Comments
 (0)