You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/azure-cache-for-redis/cache-how-to-monitor.md
+30-17Lines changed: 30 additions & 17 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ author: flang-msft
5
5
ms.author: franlanglois
6
6
ms.service: cache
7
7
ms.topic: conceptual
8
-
ms.date: 05/06/2022
8
+
ms.date: 02/06/2022
9
9
10
10
---
11
11
# Monitor Azure Cache for Redis
@@ -127,7 +127,7 @@ The types **Count** and **“Sum** can be misleading for certain metrics (connec
127
127
128
128
For non-clustered caches, we recommend using the metrics without the suffix `Instance Based`. For example, to check server load for your cache instance, use the metric `Server Load`.
129
129
130
-
In contrast, for clustered caches, we recommend using the metrics with the suffix `Instance Based`, Then, add a split or filter on `ShardId`. For example, to check the server load of shard 1, use the metric "Server Load (Instance Based)", then apply filter `ShardId = 1`.
130
+
In contrast, for clustered caches, we recommend using the metrics with the suffix `Instance Based`. Then, add a split or filter on `ShardId`. For example, to check the server load of shard 1, use the metric "Server Load (Instance Based)", then apply filter `ShardId = 1`.
131
131
132
132
## List of metrics
133
133
@@ -168,34 +168,38 @@ In contrast, for clustered caches, we recommend using the metrics with the suffi
168
168
> Geo-replication metrics are affected by monthly internal maintenance operations. The Azure Cache for Redis service periodically patches all caches with the latest platform features and improvements. During these updates, each cache node is taken offline, which temporarily disables the geo-replication link. If your geo replication link is unhealthy, check to see if it was caused by a patching event on either the geo-primary or geo-secondary cache by using **Diagnose and Solve Problems** from the Resource menu in the portal. Depending on the amount of data in the cache, the downtime from patching can take anywhere from a few minutes to an hour. If the geo-replication link is unhealthy for over an hour, [file a support request](../azure-portal/supportability/how-to-create-azure-support-request.md).
169
169
>
170
170
171
-
- Geo Replication Connectivity Lag (preview)
171
+
> [!NOTE]
172
+
> The [Geo-Replication Dashboard](#organize-with-workbooks) workbook is a simple and easy way to view all Premium-tier geo-replication metrics in the same place. This dashboard will pull together metrics that are only emitted by the geo-primary or geo-secondary, so they can be viewed simultaneously.
173
+
>
174
+
175
+
- Geo Replication Connectivity Lag
172
176
- Depicts the time, in seconds, since the last successful data synchronization between geo-primary & geo-secondary. If the link goes down, this value continues to increase, indicating a problem.
173
177
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
174
178
- This metric is only available in the Premium tier for caches with geo-replication enabled.
175
-
- Geo Replication Data Sync Offset (preview)
179
+
- Geo Replication Data Sync Offset
176
180
- Depicts the approximate amount of data, in bytes, that has yet to be synchronized to geo-secondary cache.
177
-
- This metric is only emitted **from the geo-primary** cache instance. On the geo-secondary instance, this metric has no value.
181
+
- This metric is only emitted _from the geo-primary_ cache instance. On the geo-secondary instance, this metric has no value.
178
182
- This metric is only available in the Premium tier for caches with geo-replication enabled.
179
-
- Geo Replication Full Sync Event Finished (preview)
183
+
- Geo Replication Full Sync Event Finished
180
184
- Depicts the completion of full synchronization between geo-replicated caches. When you see lots of writes on geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
181
-
-This metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
182
-
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
185
+
-The metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
186
+
- This metric is only emitted _from the geo-secondary_ cache instance. On the geo-primary instance, this metric has no value.
183
187
- This metric is only available in the Premium tier for caches with geo-replication enabled.
184
188
185
-
- Geo Replication Full Sync Event Started (preview)
186
-
- Depicts the start of full synchronization between geo-replicated caches. When there are a lot of writes in geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
187
-
-This metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
188
-
-This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
189
-
-This metric is only available in the Premium tier for caches with geo-replication enabled.
189
+
- Geo Replication Full Sync Event Started
190
+
- Depicts the start of full synchronization between geo-replicated caches. When there are many writes in geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
191
+
-The metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
192
+
-The metric is only emitted _from the geo-secondary_ cache instance. On the geo-primary instance, this metric has no value.
193
+
-The metric is only available in the Premium tier for caches with geo-replication enabled.
190
194
191
195
- Geo Replication Healthy
192
196
- Depicts the status of the geo-replication link between caches. There can be two possible states that the replication link can be in:
193
197
- 0 disconnected/unhealthy
194
198
- 1 – healthy
195
-
-This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
196
-
-This metric is only available in the Premium tier for caches with geo-replication enabled.
199
+
-The metric is available in the Enterprise, Enterprise Flash tiers, and Premium tier caches with geo-replication enabled.
200
+
-In caches on the Premium tier, this metric is only emitted *from the geo-secondary* cache instance. On the geo-primary instance, this metric has no value.
197
201
- This metric may indicate a disconnected/unhealthy replication status for several reasons, including: monthly patching, host OS updates, network misconfiguration, or failed geo-replication link provisioning.
198
-
- A value of 0 does not mean that data on the geo-replica is lost. It just means that the link between geo-primary and geo-secondary is unhealthy.
202
+
- A value of 0 doesn't mean that data on the geo-replica is lost. It just means that the link between geo-primary and geo-secondary is unhealthy.
199
203
- If the geo-replication link is unhealthy for over an hour, [file a support request](../azure-portal/supportability/how-to-create-azure-support-request.md).
200
204
201
205
- Gets
@@ -233,10 +237,19 @@ For more information about configuring and using Alerts, see [Overview of Alerts
233
237
234
238
## Organize with workbooks
235
239
236
-
Once you've defined a metric, you can send it to a workbook. Workbooks provide a way to organize your metrics into groups that provide the information in coherent way.
240
+
Once you've defined a metric, you can send it to a workbook. Workbooks provide a way to organize your metrics into groups that provide the information in coherent way. Azure Cache for Redis provides two workbooks by default in the **Azure Cache for Redis Insights** section:
241
+
242
+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-workbook.png" alt-text="Screenshot showing the workbooks selected in the Resource menu.":::
237
243
238
244
For information on creating a metric, see [Create your own metrics](#create-your-own-metrics).
239
245
246
+
The two workbooks provided are:
247
+
-**Azure Cache For Redis Resource Overview** combines many of the most commonly used metrics so that the health and performance of the cache instance can be viewed at a glance.
248
+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-resource-overview.png" alt-text="Screenshot of graphs showing a resource overview for the cache.":::
249
+
250
+
-**Geo-Replication Dashboard** pulls geo-replication health and status metrics from both the geo-primary and geo-secondary cache instances to give a complete picture of geo-replcation health. Using this dashboard is recommended, as some geo-replication metrics are only emitted from either the geo-primary or geo-secondary.
251
+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-geo-dashboard.png" alt-text="Screenshot showing the geo-replication dashboard with a geo-primary and geo-secondary cache set.":::
252
+
240
253
## Next steps
241
254
242
255
-[Azure Monitor for Azure Cache for Redis](redis-cache-insights-overview.md)
0 commit comments