You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/site-recovery/monitor-log-analytics.md
+54-24Lines changed: 54 additions & 24 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -3,7 +3,7 @@ title: Monitor Azure Site Recovery with Azure Monitor Logs
3
3
description: Learn how to monitor Azure Site Recovery with Azure Monitor Logs (Log Analytics)
4
4
ms.service: site-recovery
5
5
ms.topic: conceptual
6
-
ms.date: 08/31/2023
6
+
ms.date: 05/13/2024
7
7
ms.author: ankitadutta
8
8
author: ankitaduttaMSFT
9
9
@@ -19,7 +19,7 @@ For Site Recovery, you can use Azure Monitor Logs to help you do the following:
19
19
-**Monitor Site Recovery health and status**. For example, you can monitor replication health, test failover status, Site Recovery events, recovery point objectives (RPOs) for protected machines, and disk/data change rates.
20
20
-**Set up alerts for Site Recovery**. For example, you can configure alerts for machine health, test failover status, or Site Recovery job status.
21
21
22
-
Using Azure Monitor Logs with Site Recovery is supported for **Azure to Azure** replication and **VMware VM/physical server to Azure** replication.
22
+
Using Azure Monitor Logs with Site Recovery is supported for **Azure to Azure** replication and **VMware virtual machine/physical server to Azure** replication.
23
23
24
24
> [!NOTE]
25
25
> To get the churn data logs and upload rate logs for VMware and physical machines, you need to install a Microsoft monitoring agent on the Process Server. This agent sends the logs of the replicating machines to the workspace. This capability is available only for the 9.30 mobility agent version onwards.
@@ -34,6 +34,27 @@ Here's what you need:
34
34
35
35
We recommend that you review [common monitoring questions](monitoring-common-questions.md) before you start.
36
36
37
+
## Event logs available for Azure Site Recovery
38
+
39
+
Azure Site Recovery provides the following Resource specific and legacy tables. Each event provides detailed data on a specific set of site recovery related artifacts.
-[Azure Site Recovery Replicated Items Details](https://learn.microsoft.com/azure/azure-monitor/reference/tables/ASRReplicatedItems)
45
+
46
+
47
+
**Legacy tables**:
48
+
49
+
- Azure Site Recovery Events
50
+
- Azure Site Recovery Replicated Items
51
+
- Azure Site Recovery Replication Stats
52
+
- Azure Site Recovery Recovery Points
53
+
- Azure Site Recovery Replication Data Upload Rate
54
+
- Azure Site Recovery Protected Disk Data Churn
55
+
- Azure Site Recovery Replicated Item Details
56
+
57
+
37
58
## Configure Site Recovery to send logs
38
59
39
60
1. In the vault, select **Diagnostic settings** > **Add diagnostic setting**.
@@ -64,21 +85,30 @@ You can capture the data churn rate information and source data upload rate info
64
85
65
86
- ASRAnalytics(*)\SourceVmChurnRate
66
87
- ASRAnalytics(*)\SourceVmThrpRate
88
+
89
+
The churn and upload rate data will start feeding into the workspace.
90
+
9. The following Site Recovery counters are not searchable currently:
91
+
- ASRAnalytics(*)\SourceVmChurnRate
92
+
- ASRAnalytics(*)\SourceVmThrpRate
93
+
However, they can be added by pasting the names in full.
94
+
95
+

67
96
68
-
The churn and upload rate data will start feeding into the workspace.
69
97
98
+
-`ASRAnalytics(*)\SourceVmChurnRate` provides insights into the churn rate for replicated virtual machines.
99
+
-`ASRAnalytics(*)\SourceVmThrpRate` represents the throughput rate for replicated virtual machines that is indicator of the data transfer speed between the source and target during replication.
70
100
71
101
## Query the logs - examples
72
102
73
103
You retrieve data from logs using log queries written with the [Kusto query language](../azure-monitor/logs/get-started-queries.md). This section provides a few examples of common queries you might use for Site Recovery monitoring.
74
104
75
105
> [!NOTE]
76
-
> Some of the examples use **replicationProviderName_s** set to **A2A**. This retrieves Azure VMs that are replicated to a secondary Azure region using Site Recovery. In these examples, you can replace **A2A** with **InMageRcm**, if you want to retrieve on-premises VMware VMs or physical servers that are replicated to Azure using Site Recovery.
106
+
> Some of the examples use **replicationProviderName_s** set to **A2A**. This retrieves Azure virtual machines that are replicated to a secondary Azure region using Site Recovery. In these examples, you can replace **A2A** with **InMageRcm**, if you want to retrieve on-premises VMware virtual machines or physical servers that are replicated to Azure using Site Recovery.
77
107
78
108
79
109
### Query replication health
80
110
81
-
This query plots a pie chart for the current replication health of all protected Azure VMs, broken down into three states: Normal, Warning, or Critical.
111
+
This query plots a pie chart for the current replication health of all protected Azure virtual machines, broken down into three states: Normal, Warning, or Critical.
82
112
83
113
```
84
114
AzureDiagnostics
@@ -91,7 +121,7 @@ AzureDiagnostics
91
121
```
92
122
### Query Mobility service version
93
123
94
-
This query plots a pie chart for Azure VMs replicated with Site Recovery, broken down by the version of the Mobility agent that they're running.
124
+
This query plots a pie chart for Azure virtual machines replicated with Site Recovery, broken down by the version of the Mobility agent that they're running.
95
125
96
126
```
97
127
AzureDiagnostics
@@ -105,7 +135,7 @@ AzureDiagnostics
105
135
106
136
### Query RPO time
107
137
108
-
This query plots a bar chart of Azure VMs replicated with Site Recovery, broken down by recovery point objective (RPO): Less than 15 minutes, between 15-30 minutes, more than 30 minutes.
138
+
This query plots a bar chart of Azure virtual machines replicated with Site Recovery, broken down by recovery point objective (RPO): Less than 15 minutes, between 15-30 minutes, more than 30 minutes.

217
+

188
218
189
-
### Query data change rate (churn) and upload rate for an Azure VM
219
+
### Query data change rate (churn) and upload rate for an Azure virtual machine
190
220
191
-
This query plots a trend graph for a specific Azure VM (ContosoVM123), that represents the data change rate (Write Bytes per Second), and data upload rate.
221
+
This query plots a trend graph for a specific Azure virtual machine (ContosoVM123), that represents the data change rate (Write Bytes per Second), and data upload rate.

234
+

205
235
206
236
### Query data change rate (churn) and upload rate for a VMware or physical machine
207
237
@@ -222,7 +252,7 @@ Process Server pushes this data every 5 minutes to the Log Analytics workspace.
222
252
223
253
### Query disaster recovery summary (Azure to Azure)
224
254
225
-
This query plots a summary table for Azure VMs replicated to a secondary Azure region. It shows the VM name, replication, and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the source location.
255
+
This query plots a summary table for Azure virtual machines replicated to a secondary Azure region. It shows the virtual machine name, replication, and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the source location.
This query plots a summary table for VMware VMs and physical servers replicated to Azure. It shows the machine name, replication and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the relevant process server.
267
+
This query plots a summary table for VMware virtual machines and physical servers replicated to Azure. It shows the machine name, replication and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the relevant process server.
238
268
239
269
```
240
270
AzureDiagnostics
@@ -249,11 +279,11 @@ AzureDiagnostics
249
279
You can set up Site Recovery alerts based on Azure Monitor data. [Learn more](../azure-monitor/alerts/alerts-log.md#create-a-new-log-alert-rule-in-the-azure-portal) about setting up log alerts.
250
280
251
281
> [!NOTE]
252
-
> Some of the examples use **replicationProviderName_s** set to **A2A**. This sets alerts for Azure VMs that are replicated to a secondary Azure region. In these examples, you can replace **A2A** with **InMageRcm** if you want to set alerts for on-premises VMware VMs or physical servers replicated to Azure.
282
+
> Some of the examples use **replicationProviderName_s** set to **A2A**. This sets alerts for Azure virtual machines that are replicated to a secondary Azure region. In these examples, you can replace **A2A** with **InMageRcm** if you want to set alerts for on-premises VMware virtual machines or physical servers replicated to Azure.
253
283
254
284
### Multiple machines in a critical state
255
285
256
-
Set up an alert if more than 20 replicated Azure VMs go into a Critical state.
286
+
Set up an alert if more than 20 replicated Azure virtual machines go into a Critical state.
257
287
258
288
```
259
289
AzureDiagnostics
@@ -267,7 +297,7 @@ For the alert, set **Threshold value** to `20`.
267
297
268
298
### Single machine in a critical state
269
299
270
-
Set up an alert if a specific replicated Azure VM goes into a Critical state.
300
+
Set up an alert if a specific replicated Azure virtual machine goes into a Critical state.
271
301
272
302
```
273
303
AzureDiagnostics
@@ -282,7 +312,7 @@ For the alert, set **Threshold value** to `1`.
282
312
283
313
### Multiple machines exceed RPO
284
314
285
-
Set up an alert if the RPO for more than 20 Azure VMs exceeds 30 minutes.
315
+
Set up an alert if the RPO for more than 20 Azure virtual machines exceeds 30 minutes.
286
316
```
287
317
AzureDiagnostics
288
318
| where replicationProviderName_s == "A2A"
@@ -296,7 +326,7 @@ For the alert, set **Threshold value** to `20`.
296
326
297
327
### Single machine exceeds RPO
298
328
299
-
Set up an alert if the RPO for a single Azure VM exceeds 30 minutes.
329
+
Set up an alert if the RPO for a single Azure virtual machine exceeds 30 minutes.
300
330
301
331
```
302
332
AzureDiagnostics
@@ -312,7 +342,7 @@ For the alert, set **Threshold value** to `1`.
312
342
313
343
### Test failover for multiple machines exceeds 90 days
314
344
315
-
Set up an alert if the last successful test failover was more than 90 days, for more than 20 VMs.
345
+
Set up an alert if the last successful test failover was more than 90 days, for more than 20 virtual machines.
316
346
317
347
```
318
348
AzureDiagnostics
@@ -327,7 +357,7 @@ For the alert, set **Threshold value** to `20`.
327
357
328
358
### Test failover for a single machine exceeds 90 days
329
359
330
-
Set up an alert if the last successful test failover for a specific VM was more than 90 days ago.
360
+
Set up an alert if the last successful test failover for a specific virtual machine was more than 90 days ago.
0 commit comments