You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/search/search-sku-manage-costs.md
+57-25Lines changed: 57 additions & 25 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,17 +8,13 @@ ms.author: haileytapia
8
8
ms.service: azure-ai-search
9
9
ms.custom:
10
10
- ignite-2023
11
-
ms.topic: conceptual
12
-
ms.date: 06/06/2025
11
+
ms.topic: how-to
12
+
ms.date: 06/09/2025
13
13
---
14
14
15
15
# Plan and manage costs of an Azure AI Search service
16
16
17
-
This article explains how Azure AI Search is billed and provides tips for planning, estimating, monitoring, and minimizing costs.
18
-
19
-
As a first step, use the [Azure pricing calculator](https://azure.microsoft.com/pricing/calculator/) to estimate your baseline costs. You can also find estimated costs and tier comparisons on the [Select a pricing tier](search-create-service-portal.md#choose-a-tier) page during service creation.
20
-
21
-
Through built-in cost management that cuts across service boundaries, Azure provides inclusive cost monitoring and the ability to set budgets and define alerts. The costs of running a search service vary depending on capacity and the features used. After you create a search service, optimize capacity to pay for only what you need.
17
+
This article explains how Azure AI Search is billed and provides tips for estimating, planning, monitoring, and minimizing costs. The cost of running a search service varies based on capacity and the features used. After you create a search service, optimize its capacity so that you only pay for what you need.
22
18
23
19
> [!NOTE]
24
20
> Higher-capacity partitions are available at the same billing rate on newer services created after April and May 2024. For more information about partition size upgrades, see [Service limits](search-limits-quotas-capacity.md#service-limits).
@@ -27,7 +23,7 @@ Through built-in cost management that cuts across service boundaries, Azure prov
27
23
28
24
## Understand the billing model
29
25
30
-
Azure AI Search runs on Azure infrastructure that accrues costs when you deploy new resources. Note that other infrastructure costs might also accrue.
26
+
Azure AI Search runs on Azure infrastructure that accrues costs when you deploy new resources. Other infrastructure costs might also accrue.
31
27
32
28
### How you're charged for Azure AI Search
33
29
@@ -43,10 +39,10 @@ The following table lists premium features and their billing units. All of these
43
39
44
40
| Feature | Unit |
45
41
|-------|------|
46
-
| Image extraction (AI enrichment) <sup>1</sup> | Per 1000 images. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
47
-
|[Custom Entity Lookup skill](cognitive-search-skill-custom-entity-lookup.md) (AI enrichment) | Per 1000 text records. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing)|
48
-
|[Built-in skills](cognitive-search-predefined-skills.md) (AI enrichment) | Number of transactions, billed at the same rate as calling Azure AI services directly. You can process 20 documents per indexer per day for free. Larger or more frequent workloads require an Azure AI services multi-service resource key. |
49
-
|[Semantic ranker](semantic-search-overview.md)| Number of queries of `queryType=semantic`, billed at a progressive rate. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
42
+
| Image extraction (AI enrichment) <sup>1</sup> | Per 1,000 images. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
43
+
|[Custom Entity Lookup skill](cognitive-search-skill-custom-entity-lookup.md) (AI enrichment) | Per 1,000 text records. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing)|
44
+
|[Built-in skills](cognitive-search-predefined-skills.md) (AI enrichment) | Number of transactions. Billed at the same rate as calling Azure AI services directly. You can process 20 documents per indexer per day for free. Larger or more frequent workloads require an Azure AI services multi-service resource key. |
45
+
|[Semantic ranker](semantic-search-overview.md)| Number of queries of `queryType=semantic`. Billed at a progressive rate. See the [pricing page](https://azure.microsoft.com/pricing/details/search/#pricing). |
50
46
|[Shared private link](search-indexer-howto-access-private.md)|[Billed for bandwidth](https://azure.microsoft.com/pricing/details/private-link/) as long as the shared private link exists and is used. |
51
47
52
48
<sup>1</sup> Refers to images extracted from a file within the indexer pipeline. Text extraction is free. Image extraction is billed when you [enable the `indexAction` parameter](cognitive-search-concept-image-scenarios.md#configure-indexers-for-image-processing) for document cracking or when you call the [Document Extraction skill](cognitive-search-skill-document-extraction.md).
@@ -59,40 +55,76 @@ The following table lists premium features and their billing units. All of these
59
55
60
56
+[Customer-managed keys](search-security-manage-encryption-keys.md), which provide double encryption of sensitive content, require a billable [Azure Key Vault](https://azure.microsoft.com/pricing/details/key-vault/).
61
57
62
-
+ A skillset can include [billable built-in skills](cognitive-search-predefined-skills.md), non-billable built-in utility skills, and custom skills. Non-billable utility skills include [Conditional](cognitive-search-skill-conditional.md), [Shaper](cognitive-search-skill-shaper.md), [Text Merge](cognitive-search-skill-textmerger.md), and [Text Split](cognitive-search-skill-textsplit.md). They don't have an API key requirement or 20-document limit.
58
+
+ A skillset can include [billable built-in skills](cognitive-search-predefined-skills.md), nonbillable built-in utility skills, and custom skills. Nonbillable utility skills include [Conditional](cognitive-search-skill-conditional.md), [Shaper](cognitive-search-skill-shaper.md), [Text Merge](cognitive-search-skill-textmerger.md), and [Text Split](cognitive-search-skill-textsplit.md). They don't have an API key requirement or 20-document limit.
63
59
64
60
+ A custom skill is functionality you provide. Custom skills are only billable if they call other billable services. They don't have an API key requirement or 20-document limit.
65
61
66
62
> [!NOTE]
67
63
> You aren't billed for the number of full-text or vector queries, query responses, or documents ingested, but [service limits](search-limits-quotas-capacity.md) apply to each pricing tier.
68
64
65
+
## Estimate and plan costs
66
+
67
+
Effective cost planning starts before you create an Azure AI Search service. Use the [pricing calculator](https://azure.microsoft.com/pricing/calculator/) to estimate your baseline costs. You can also compare the features and costs of each tier on the [Select a pricing tier](search-create-service-portal.md#choose-a-tier) page during service creation.
68
+
69
+
For initial testing, we reccomend that you index 1–5% of your representative content. Include any OCR, embeddings, or enrichment skills you plan to use. Measure the resulting index size, throughput, and costs, and then extrapolate them to estimate full-scale requirements. This approach helps you understand the index-to-source ratio and the impact of enrichment or vector features on both [capacity](search-capacity-planning.md) and cost.
70
+
71
+
<!-- If you're using [skillsets](cognitive-search-working-with-skillsets.md) with AI enrichment, computer vision, embedding requests, custom skills, or any other transformation or external service, each skill runs on a separate meter and has its own pricing. Review the pricing for each skill you plan to use. Enable incremental enrichment so unchanged skills are skipped after the first run, when applicable.
72
+
73
+
When planning, consider:
74
+
75
+
+ Indexing and enrichment volume. Estimate the frequency and volume of data ingestion and enrichment, as these drive both compute and potential AI service charges.
76
+
77
+
+ Data transfer and storage. Account for networking and storage costs, especially if your solution spans multiple regions or uses additional Azure resources.
78
+
79
+
+ Service limits. Review [service limits](search-limits-quotas-capacity.md) for your chosen tier to avoid unplanned scale-ups or throttling.
80
+
81
+
+ Skillset and enrichment metering. If you are using AI enrichment, image extraction, computer vision, embedding requests, custom skills, or any other transformation or external service, be aware that each skill runs on a separate meter and may have its own pricing. Review the pricing for each skill you plan to use. Enable incremental enrichment so unchanged skills are skipped after the first run, when applicable. -->
82
+
69
83
## Monitor costs
70
84
71
-
Cost management is built into the Azure infrastructure. Review [Billing and cost management](/azure/cost-management-billing/cost-management-billing-overview) for more information about tracking costs, tools, and APIs.
85
+
At the Azure AI Search level, you can [monitor built-in metrics](search-monitor-queries.md) for queries per second (QPS), search latency, throttled queries, and index size. You can then [create an Azure Monitor dashboard](/azure-monitor/visualize/tutorial-logs-dashboards) that overlays QPS, latency, and cost data to determine when to add or remove replicas.
86
+
87
+
At the subscription or resource group level, [Cost Management](/azure/cost-management-billing/costs/overview-cost-management) provides tools to track, analyze, and control your costs. You can use Cost Management to:
88
+
89
+
+[Create budgets](/azure/cost-management-billing/costs/tutorial-acm-create-budgets) that define and track progress against spending limits. For more granular monitoring, customize your budgets using [filters](/azure/cost-management-billing/costs/group-filter) for specific Azure resources or services. Filters prevent you from accidentally creating resources that incur extra costs.
90
+
91
+
+[Create alerts](/azure/cost-management-billing/costs/cost-mgt-alerts-monitor-usage-spending) that automatically notify stakeholders of spending anomalies or overspending risks. Alerts are based on spending compared to budget and cost thresholds. Both budgets and alerts are created for subscriptions and resource groups, making them useful for monitoring overall costs.
92
+
93
+
+[Export cost data](/azure/cost-management-billing/costs/tutorial-export-acm-data) to an Azure Storage account. This is helpful when you or others need to perform additional cost analysis. For example, a finance team can analyze the data using Excel or Power BI. You can export your costs on a daily, weekly, or monthly schedule and set a custom date range. Exporting cost data is the recommended method for retrieving cost datasets.
72
94
73
95
## Minimize costs
74
96
75
-
Follow these guidelines to minimize costs of an Azure AI Search solution.
97
+
To minimize the costs of your Azure AI Search solution, use the following strategies across the service lifecycle.
98
+
99
+
### Deployment and configuration
100
+
101
+
+ Create a search service in a [region with more storage per partition](search-limits-quotas-capacity.md#service-limits).
102
+
103
+
+ Create all related Azure resources in the same region (or as few regions as possible) to minimize or eliminate bandwidth charges.
104
+
105
+
+ Choose the lightest [pricing tier](search-sku-tier.md) that meets your needs. Basic and S1 offer full access to the modern API at the lowest hourly rate per SU.
106
+
107
+
+ Use [Azure Web App](/azure/app-service/overview) for your front-end application to keep requests and responses within the data center boundary.
76
108
77
-
1. If possible, create a search service [in a region that has more storage per partition](search-limits-quotas-capacity.md#service-limits). If you're using multiple Azure resources in your solution, create them in the same region, or in as few regions as possible, to minimize or eliminate bandwidth charges.
109
+
### Scaling
78
110
79
-
1.[Scale up](search-capacity-planning.md) for resource-intensive operations like indexing, and then readjust downwards for regular query workloads. If there are predictable patterns to your workloads, you might be able to synchronize scale up to coincide with the expected volume (you would need to write code to automate this).
111
+
+[Add partitions](search-capacity-planning.md#add-or-remove-partitions-and-replicas) only when the index size or ingestion throughput requires it.
80
112
81
-
When estimating the cost of a search solution, keep in mind that pricing and capacity aren't linear (doubling capacity more than doubles the cost on the same tier). Also, at some point, switching up to a higher tier can give you better and faster performance at roughly the same price point. For more information and an example, see [Switch to a Standard S2 tier](search-performance-tips.md#tip-switch-to-a-standard-s2-tier).
113
+
+[Add replicas](search-capacity-planning.md#add-or-remove-partitions-and-replicas) only when your queries per second increase, when complex queries are throttling your service, or when high availability is required.
82
114
83
-
1. Consider [Azure Web App](/azure/app-service/overview)for your front-end application so that requests and responses stay within the data center boundary.
115
+
+ Scale up for resource-intensive operations, such as indexing, and then readjust downwards for regular query workloads.
84
116
85
-
1. If you're using [AI enrichment](cognitive-search-concept-intro.md), there's an extra charge for blob storage, but the cumulative cost goes down if you enable [enrichment caching](cognitive-search-incremental-indexing-conceptual.md).
117
+
+ Write code to automate scaling for predictable workload patterns.
86
118
87
-
## Create budgets
119
+
+ Remember that capacity and pricing aren't linear. Doubling capacity more than doubles costs on the same tier. For better performance at a similar price, consider [switching to a higher tier](search-performance-tips.md#tip-switch-to-a-standard-s2-tier).
88
120
89
-
You can create [budgets](/azure/cost-management-billing/costs/tutorial-acm-create-budgets?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) to manage costs and create [alerts](/azure/cost-management-billing/costs/cost-mgt-alerts-monitor-usage-spending?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) that automatically notify stakeholders of spending anomalies and overspending risks. Alerts are based on spending compared to budget and cost thresholds. Budgets and alerts are created for Azure subscriptions and resource groups, so they're useful as part of an overall cost monitoring strategy.
121
+
### Indexing and enrichment
90
122
91
-
Budgets can be created with filters for specific resources or services in Azure if you want more granularity present in your monitoring. Filters help ensure that you don't accidentally create new resources that cost you extra money. For more information about the filter options available when you create a budget, see [Group and filter options](/azure/cost-management-billing/costs/group-filter?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn).
123
+
+ Use [incremental indexing](search-howto-reindex.md) to process only new or changed data.
92
124
93
-
## Export cost data
125
+
+ Enable [enrichment caching](cognitive-search-incremental-indexing-conceptual.md) to reduce [AI enrichment](cognitive-search-concept-intro.md) costs. Although caching incurs a charge for Azure Blob Storage, it lowers the cumulative enrichment cost because storage is cheaper than image extraction and AI processing.
94
126
95
-
You can also [export your cost data](/azure/cost-management-billing/costs/tutorial-export-acm-data?WT.mc_id=costmanagementcontent_docsacmhorizontal_-inproduct-learn) to a storage account. This is helpful when you need or others to do more data analysis for costs. For example, a finance team can analyze the data using Excel or Power BI. You can export your costs on a daily, weekly, or monthly schedule and set a custom date range. Exporting cost data is the recommended way to retrieve cost datasets.
127
+
+ Keep vector payloads compact. For vector search, see the [vector compression best practices](https://techcommunity.microsoft.com/blog/azure-ai-services-blog/azure-ai-search-cut-vector-costs-up-to-92-5-with-new-compression-techniques/4404866).
0 commit comments