Skip to content

Commit 8a143e3

Browse files
authored
Merge pull request #225247 from MSFTeegarden/patch-10
Update cache-how-to-monitor.md
2 parents 495484d + 5027414 commit 8a143e3

File tree

4 files changed

+30
-17
lines changed

4 files changed

+30
-17
lines changed

articles/azure-cache-for-redis/cache-how-to-monitor.md

Lines changed: 30 additions & 17 deletions
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ author: flang-msft
55
ms.author: franlanglois
66
ms.service: cache
77
ms.topic: conceptual
8-
ms.date: 05/06/2022
8+
ms.date: 02/06/2022
99

1010
---
1111
# Monitor Azure Cache for Redis
@@ -127,7 +127,7 @@ The types **Count** and **“Sum** can be misleading for certain metrics (connec
127127
128128
For non-clustered caches, we recommend using the metrics without the suffix `Instance Based`. For example, to check server load for your cache instance, use the metric `Server Load`.
129129

130-
In contrast, for clustered caches, we recommend using the metrics with the suffix `Instance Based`, Then, add a split or filter on `ShardId`. For example, to check the server load of shard 1, use the metric "Server Load (Instance Based)", then apply filter `ShardId = 1`.
130+
In contrast, for clustered caches, we recommend using the metrics with the suffix `Instance Based`. Then, add a split or filter on `ShardId`. For example, to check the server load of shard 1, use the metric "Server Load (Instance Based)", then apply filter `ShardId = 1`.
131131

132132
## List of metrics
133133

@@ -168,34 +168,38 @@ In contrast, for clustered caches, we recommend using the metrics with the suffi
168168
> Geo-replication metrics are affected by monthly internal maintenance operations. The Azure Cache for Redis service periodically patches all caches with the latest platform features and improvements. During these updates, each cache node is taken offline, which temporarily disables the geo-replication link. If your geo replication link is unhealthy, check to see if it was caused by a patching event on either the geo-primary or geo-secondary cache by using **Diagnose and Solve Problems** from the Resource menu in the portal. Depending on the amount of data in the cache, the downtime from patching can take anywhere from a few minutes to an hour. If the geo-replication link is unhealthy for over an hour, [file a support request](../azure-portal/supportability/how-to-create-azure-support-request.md).
169169
>
170170
171-
- Geo Replication Connectivity Lag (preview)
171+
> [!NOTE]
172+
> The [Geo-Replication Dashboard](#organize-with-workbooks) workbook is a simple and easy way to view all Premium-tier geo-replication metrics in the same place. This dashboard will pull together metrics that are only emitted by the geo-primary or geo-secondary, so they can be viewed simultaneously.
173+
>
174+
175+
- Geo Replication Connectivity Lag
172176
- Depicts the time, in seconds, since the last successful data synchronization between geo-primary & geo-secondary. If the link goes down, this value continues to increase, indicating a problem.
173177
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
174178
- This metric is only available in the Premium tier for caches with geo-replication enabled.
175-
- Geo Replication Data Sync Offset (preview)
179+
- Geo Replication Data Sync Offset
176180
- Depicts the approximate amount of data, in bytes, that has yet to be synchronized to geo-secondary cache.
177-
- This metric is only emitted **from the geo-primary** cache instance. On the geo-secondary instance, this metric has no value.
181+
- This metric is only emitted _from the geo-primary_ cache instance. On the geo-secondary instance, this metric has no value.
178182
- This metric is only available in the Premium tier for caches with geo-replication enabled.
179-
- Geo Replication Full Sync Event Finished (preview)
183+
- Geo Replication Full Sync Event Finished
180184
- Depicts the completion of full synchronization between geo-replicated caches. When you see lots of writes on geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
181-
- This metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
182-
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
185+
- The metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
186+
- This metric is only emitted _from the geo-secondary_ cache instance. On the geo-primary instance, this metric has no value.
183187
- This metric is only available in the Premium tier for caches with geo-replication enabled.
184188

185-
- Geo Replication Full Sync Event Started (preview)
186-
- Depicts the start of full synchronization between geo-replicated caches. When there are a lot of writes in geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
187-
- This metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
188-
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
189-
- This metric is only available in the Premium tier for caches with geo-replication enabled.
189+
- Geo Replication Full Sync Event Started
190+
- Depicts the start of full synchronization between geo-replicated caches. When there are many writes in geo-primary, and replication between the two caches can’t keep up, then a full sync is needed. A full sync involves copying the complete data from geo-primary to geo-secondary by taking an RDB snapshot rather than a partial sync that occurs on normal instances. See [this page](https://redis.io/docs/manual/replication/#how-redis-replication-works) for a more detailed explanation.
191+
- The metric reports zero most of the time because geo-replication uses partial resynchronizations for any new data added after the initial full synchronization.
192+
- The metric is only emitted _from the geo-secondary_ cache instance. On the geo-primary instance, this metric has no value.
193+
- The metric is only available in the Premium tier for caches with geo-replication enabled.
190194

191195
- Geo Replication Healthy
192196
- Depicts the status of the geo-replication link between caches. There can be two possible states that the replication link can be in:
193197
- 0 disconnected/unhealthy
194198
- 1 – healthy
195-
- This metric is only emitted **from the geo-secondary** cache instance. On the geo-primary instance, this metric has no value.
196-
- This metric is only available in the Premium tier for caches with geo-replication enabled.
199+
- The metric is available in the Enterprise, Enterprise Flash tiers, and Premium tier caches with geo-replication enabled.
200+
- In caches on the Premium tier, this metric is only emitted *from the geo-secondary* cache instance. On the geo-primary instance, this metric has no value.
197201
- This metric may indicate a disconnected/unhealthy replication status for several reasons, including: monthly patching, host OS updates, network misconfiguration, or failed geo-replication link provisioning.
198-
- A value of 0 does not mean that data on the geo-replica is lost. It just means that the link between geo-primary and geo-secondary is unhealthy.
202+
- A value of 0 doesn't mean that data on the geo-replica is lost. It just means that the link between geo-primary and geo-secondary is unhealthy.
199203
- If the geo-replication link is unhealthy for over an hour, [file a support request](../azure-portal/supportability/how-to-create-azure-support-request.md).
200204

201205
- Gets
@@ -233,10 +237,19 @@ For more information about configuring and using Alerts, see [Overview of Alerts
233237

234238
## Organize with workbooks
235239

236-
Once you've defined a metric, you can send it to a workbook. Workbooks provide a way to organize your metrics into groups that provide the information in coherent way.
240+
Once you've defined a metric, you can send it to a workbook. Workbooks provide a way to organize your metrics into groups that provide the information in coherent way. Azure Cache for Redis provides two workbooks by default in the **Azure Cache for Redis Insights** section:
241+
242+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-workbook.png" alt-text="Screenshot showing the workbooks selected in the Resource menu.":::
237243

238244
For information on creating a metric, see [Create your own metrics](#create-your-own-metrics).
239245

246+
The two workbooks provided are:
247+
- **Azure Cache For Redis Resource Overview** combines many of the most commonly used metrics so that the health and performance of the cache instance can be viewed at a glance.
248+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-resource-overview.png" alt-text="Screenshot of graphs showing a resource overview for the cache.":::
249+
250+
- **Geo-Replication Dashboard** pulls geo-replication health and status metrics from both the geo-primary and geo-secondary cache instances to give a complete picture of geo-replcation health. Using this dashboard is recommended, as some geo-replication metrics are only emitted from either the geo-primary or geo-secondary.
251+
:::image type="content" source="media/cache-how-to-monitor/cache-monitoring-geo-dashboard.png" alt-text="Screenshot showing the geo-replication dashboard with a geo-primary and geo-secondary cache set.":::
252+
240253
## Next steps
241254

242255
- [Azure Monitor for Azure Cache for Redis](redis-cache-insights-overview.md)
125 KB
Loading
81.9 KB
Loading
83 KB
Loading

0 commit comments

Comments
 (0)