Skip to content

Commit 5236449

Browse files
committed
Feedback
1 parent dca4e9b commit 5236449

File tree

1 file changed

+17
-17
lines changed

1 file changed

+17
-17
lines changed

articles/cosmos-db/mongodb/vcore/compute-storage.md

Lines changed: 17 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ ms.author: nlarin
77
ms.service: cosmos-db
88
ms.subservice: mongodb-vcore
99
ms.topic: conceptual
10-
ms.date: 06/30/2024
10+
ms.date: 07/07/2024
1111
---
1212

1313
# Compute and storage configurations for Azure Cosmos DB for MongoDB vCore clusters
@@ -77,7 +77,7 @@ For instance, if you need 8 TiB of storage per shard or more, make sure you sele
7777

7878
### Working set and memory considerations
7979

80-
In MongoDB, *the working set* refers to the portion of your data that is frequently accessed and used by your applications. It includes both the data and the indexes that are regularly read or written to during the application's typical operations. The concept of a working set is important for performance optimization because MongoDB, like many databases, performs best when the working set fits in RAM.
80+
In Azure Cosmos DB for MongoDB vCore, *the working set* refers to the portion of your data that is frequently accessed and used by your applications. It includes both the data and the indexes that are regularly read or written to during the application's typical operations. The concept of a working set is important for performance optimization because MongoDB, like many databases, performs best when the working set fits in RAM.
8181

8282
To define and understand your MongoDB database working set, consider the following components:
8383

@@ -89,7 +89,7 @@ By keeping the working set in RAM, you can minimize slower disk I/O operations,
8989

9090
### Choosing optimal configuration for a workload
9191

92-
Determining the right compute and storage configuration for your MongoDB workload involves evaluating several factors related to your application's requirements and usage patterns. The key steps and considerations to determine the optimal configuration include:
92+
Determining the right compute and storage configuration for your Azure Cosmos DB for MongoDB vCore workload involves evaluating several factors related to your application's requirements and usage patterns. The key steps and considerations to determine the optimal configuration include:
9393

9494
1. **Understand your workload**
9595
- **Data volume**: Estimate the total size of your data, including indexes.
@@ -98,7 +98,7 @@ Determining the right compute and storage configuration for your MongoDB workloa
9898
- **Concurrency**: Assess the number of concurrent operations your database needs to handle.
9999

100100
2. **Monitor current performance**
101-
- **Resource utilization**: Use MongoDB monitoring tools to track CPU, memory, disk I/O, and network usage before you move your MongoDB workload to Azure and [monitoring metrics](./how-to-monitor-diagnostics-logs.md) and MongoDB monitoring tools once you start running your MongoDB workload on an Azure Cosmos DB for MongoDB vCore cluster.
101+
- **Resource utilization**: Use monitoring tools to track CPU, memory, disk I/O, and network usage before you move your workload to Azure and [monitoring metrics](./how-to-monitor-diagnostics-logs.md) once you start running your MongoDB workload on an Azure Cosmos DB for MongoDB vCore cluster.
102102
- **Performance metrics**: Monitor key performance metrics such as latency, throughput, and cache hit ratios.
103103
- **Bottlenecks**: Identify any existing performance bottlenecks, such as high CPU usage, memory pressure, or slow disk I/O.
104104

@@ -107,38 +107,38 @@ Determining the right compute and storage configuration for your MongoDB workloa
107107
- **CPU**: Choose a CPU configuration that can handle your query load and concurrency requirements. CPU-intensive workloads may require more cores. Use 'CPU percent' metric with 'Max' aggregation on your Azure Cosmos DB for MongoDB vCore cluster to see historical compute usage patterns.
108108
- **Storage IOPS**: Select storage with sufficient IOPS to handle your read and write operations. Use 'IOPS' metric with 'Max' aggregation on your cluster to see historical storage IOPS usage.
109109
- **Network**: Ensure adequate network bandwidth to handle data transfer between your application and the database, especially for distributed setups. Make sure you configured host for your MongoDB application to support [accelerated networking](../../../virtual-network/accelerated-networking-overview.md) technologies such as SR-IOV.
110-
110+
111111
4. **Scale appropriately**
112112
- **Vertical scaling**: Scale compute / RAM up and down and scale storage up.
113-
- Compute: Increase the vCore / RAM on a cluster if your workload requires temporary increase or is often crossing over 90% of CPU utilization for prolonged periods.
114-
- Make sure you have appropriate data retention in your MongoDB database. Retention allows you to avoid unnecessary storage use. Monitor storage usage by setting alerts on the 'Storage percent' and/or 'Storage used' metrics with 'Max' aggregation. Consider increase storage as your workload size crosses 70% usage.
115-
- **Horizontal scaling**: Consider using multiple shards for your cluster to distribute your data across multiple MongoDB nodes for performance gains and better capacity management as your workload grows. This is especially useful for large datasets (over 2-4 TiB) and high-throughput applications.
116-
117-
6. **Test and iterate**
113+
- Compute: Increase the vCore / RAM on a cluster if your workload requires temporary increase or is often crossing over 70% of CPU utilization for prolonged periods.
114+
- Make sure you have appropriate data retention in your Azure Cosmos DB for MongoDB vCore database. Retention allows you to avoid unnecessary storage use. Monitor storage usage by setting alerts on the 'Storage percent' and/or 'Storage used' metrics with 'Max' aggregation. Consider increase storage as your workload size crosses 70% usage.
115+
- **Horizontal scaling**: Consider using multiple shards for your cluster to distribute your data across multiple Azure Cosmos DB for MongoDB vCore nodes for performance gains and better capacity management as your workload grows. This is especially useful for large datasets (over 2-4 TiB) and high-throughput applications.
116+
117+
5. **Test and iterate**
118118
- **Benchmarking**: Perform measurement for the most frequently used queries with different configurations to determine the impact on performance. Use CPU/RAM and IOPS metrics and application-level benchmarking.
119119
- **Load testing**: Conduct load testing to simulate production workloads and validate the performance of your chosen configuration.
120-
- **Continuous monitoring**: Continuously monitor your MongoDB deployment and adjust resources as needed based on changing workloads and usage patterns.
121-
120+
- **Continuous monitoring**: Continuously monitor your Azure Cosmos DB for MongoDB vCore deployment and adjust resources as needed based on changing workloads and usage patterns.
121+
122122
By systematically evaluating these factors and continuously monitoring and adjusting your configuration, you can ensure that your MongoDB deployment is well-optimized for your specific workload.
123123

124124
### Considerations for storage
125125

126-
Deciding on the appropriate storage size for your MongoDB workload involves several considerations to ensure optimal performance and scalability. Here are considerations for the storage size in Azure Cosmos DB for MongoDB vCore:
126+
Deciding on the appropriate storage size for your workload involves several considerations to ensure optimal performance and scalability. Here are considerations for the storage size in Azure Cosmos DB for MongoDB vCore:
127127

128128
1. **Estimate data size:**
129-
- Calculate the expected size of your MongoDB data. Consider:
130-
- **Current data size:** If migrating from an existing MongoDB instance.
129+
- Calculate the expected size of your Azure Cosmos DB for MongoDB vCore data. Consider:
130+
- **Current data size:** If migrating from an existing database.
131131
- **Growth rate:** Estimate how much data will be added over time.
132132
- **Document size and structure:** Understand your data schema and document sizes, as they affect storage efficiency.
133133

134134
2. **Factor in indexes:**
135-
- MongoDB uses **[indexes](./indexing.md)** for efficient querying. Indexes consume extra disk space.
135+
- Azure Cosmos DB for MongoDB vCore uses **[indexes](./indexing.md)** for efficient querying. Indexes consume extra disk space.
136136
- Estimate the size of indexes based on:
137137
- **Number of indexes**.
138138
- **Size of indexed fields**.
139139

140140
3. **Performance considerations:**
141-
- Disk performance impacts MongoDB operations, especially for workloads that can't fit their [working set](#working-set-and-memory-considerations) into RAM. Consider:
141+
- Disk performance impacts database operations, especially for workloads that can't fit their [working set](#working-set-and-memory-considerations) into RAM. Consider:
142142
- **I/O throughput:** IOPS, or Input/Output Operations Per Second, is the number of requests that are sent to storage disks in one second. The larger storage size comes with more IOPS. Ensure adequate throughput for read/write operations. Use 'IOPS' metric with 'Max' aggregation to monitor used IOPS on your cluster.
143143
- **Latency:** Latency is the time it takes an application to receive a single request, send it to storage disks, and send the response to the client. Latency is a critical measure of an application's performance in addition to IOPS and throughput. Latency is largely defined by the type of storage used and storage configuration. In a managed service like Azure Cosmos DB for MongoDB, the fast storage such as Premium SSD disks is used with settings optimized to reduce latency.
144144

0 commit comments

Comments
 (0)