Skip to content

Commit 7f9b05c

Browse files
author
Ankita Dutta
committed
images
1 parent 1498c99 commit 7f9b05c

File tree

5 files changed

+65
-48
lines changed

5 files changed

+65
-48
lines changed
18.1 KB
Loading
110 KB
Loading
31.2 KB
Loading

articles/site-recovery/monitor-log-analytics.md

Lines changed: 54 additions & 24 deletions
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@ title: Monitor Azure Site Recovery with Azure Monitor Logs
33
description: Learn how to monitor Azure Site Recovery with Azure Monitor Logs (Log Analytics)
44
ms.service: site-recovery
55
ms.topic: conceptual
6-
ms.date: 08/31/2023
6+
ms.date: 05/13/2024
77
ms.author: ankitadutta
88
author: ankitaduttaMSFT
99

@@ -19,7 +19,7 @@ For Site Recovery, you can use Azure Monitor Logs to help you do the following:
1919
- **Monitor Site Recovery health and status**. For example, you can monitor replication health, test failover status, Site Recovery events, recovery point objectives (RPOs) for protected machines, and disk/data change rates.
2020
- **Set up alerts for Site Recovery**. For example, you can configure alerts for machine health, test failover status, or Site Recovery job status.
2121

22-
Using Azure Monitor Logs with Site Recovery is supported for **Azure to Azure** replication and **VMware VM/physical server to Azure** replication.
22+
Using Azure Monitor Logs with Site Recovery is supported for **Azure to Azure** replication and **VMware virtual machine/physical server to Azure** replication.
2323

2424
> [!NOTE]
2525
> To get the churn data logs and upload rate logs for VMware and physical machines, you need to install a Microsoft monitoring agent on the Process Server. This agent sends the logs of the replicating machines to the workspace. This capability is available only for the 9.30 mobility agent version onwards.
@@ -34,6 +34,27 @@ Here's what you need:
3434

3535
We recommend that you review [common monitoring questions](monitoring-common-questions.md) before you start.
3636

37+
## Event logs available for Azure Site Recovery
38+
39+
Azure Site Recovery provides the following Resource specific and legacy tables. Each event provides detailed data on a specific set of site recovery related artifacts.
40+
41+
**Resource Specific tables**:
42+
43+
- [AzureSiteRecoveryJobs](https://learn.microsoft.com/azure/azure-monitor/reference/tables/asrjobs)
44+
- [Azure Site Recovery Replicated Items Details](https://learn.microsoft.com/azure/azure-monitor/reference/tables/ASRReplicatedItems)
45+
46+
47+
**Legacy tables**:
48+
49+
- Azure Site Recovery Events
50+
- Azure Site Recovery Replicated Items
51+
- Azure Site Recovery Replication Stats
52+
- Azure Site Recovery Recovery Points
53+
- Azure Site Recovery Replication Data Upload Rate
54+
- Azure Site Recovery Protected Disk Data Churn
55+
- Azure Site Recovery Replicated Item Details
56+
57+
3758
## Configure Site Recovery to send logs
3859

3960
1. In the vault, select **Diagnostic settings** > **Add diagnostic setting**.
@@ -64,21 +85,30 @@ You can capture the data churn rate information and source data upload rate info
6485

6586
- ASRAnalytics(*)\SourceVmChurnRate
6687
- ASRAnalytics(*)\SourceVmThrpRate
88+
89+
The churn and upload rate data will start feeding into the workspace.
90+
9. The following Site Recovery counters are not searchable currently:
91+
- ASRAnalytics(*)\SourceVmChurnRate
92+
- ASRAnalytics(*)\SourceVmThrpRate
93+
However, they can be added by pasting the names in full.
94+
95+
![Screenshot of the Windows performance counter.](./media/monitoring-log-analytics/performance-counter.png)
6796

68-
The churn and upload rate data will start feeding into the workspace.
6997

98+
- `ASRAnalytics(*)\SourceVmChurnRate` provides insights into the churn rate for replicated virtual machines.
99+
- `ASRAnalytics(*)\SourceVmThrpRate` represents the throughput rate for replicated virtual machines that is indicator of the data transfer speed between the source and target during replication.
70100

71101
## Query the logs - examples
72102

73103
You retrieve data from logs using log queries written with the [Kusto query language](../azure-monitor/logs/get-started-queries.md). This section provides a few examples of common queries you might use for Site Recovery monitoring.
74104

75105
> [!NOTE]
76-
> Some of the examples use **replicationProviderName_s** set to **A2A**. This retrieves Azure VMs that are replicated to a secondary Azure region using Site Recovery. In these examples, you can replace **A2A** with **InMageRcm**, if you want to retrieve on-premises VMware VMs or physical servers that are replicated to Azure using Site Recovery.
106+
> Some of the examples use **replicationProviderName_s** set to **A2A**. This retrieves Azure virtual machines that are replicated to a secondary Azure region using Site Recovery. In these examples, you can replace **A2A** with **InMageRcm**, if you want to retrieve on-premises VMware virtual machines or physical servers that are replicated to Azure using Site Recovery.
77107
78108

79109
### Query replication health
80110

81-
This query plots a pie chart for the current replication health of all protected Azure VMs, broken down into three states: Normal, Warning, or Critical.
111+
This query plots a pie chart for the current replication health of all protected Azure virtual machines, broken down into three states: Normal, Warning, or Critical.
82112

83113
```
84114
AzureDiagnostics 
@@ -91,7 +121,7 @@ AzureDiagnostics 
91121
```
92122
### Query Mobility service version
93123

94-
This query plots a pie chart for Azure VMs replicated with Site Recovery, broken down by the version of the Mobility agent that they're running.
124+
This query plots a pie chart for Azure virtual machines replicated with Site Recovery, broken down by the version of the Mobility agent that they're running.
95125

96126
```
97127
AzureDiagnostics 
@@ -105,7 +135,7 @@ AzureDiagnostics 
105135

106136
### Query RPO time
107137

108-
This query plots a bar chart of Azure VMs replicated with Site Recovery, broken down by recovery point objective (RPO): Less than 15 minutes, between 15-30 minutes, more than 30 minutes.
138+
This query plots a bar chart of Azure virtual machines replicated with Site Recovery, broken down by recovery point objective (RPO): Less than 15 minutes, between 15-30 minutes, more than 30 minutes.
109139

110140
```
111141
AzureDiagnostics 
@@ -119,7 +149,7 @@ rpoInSeconds_d <= 1800, "15-30Min", ">30Min") 
119149
| render barchart
120150
```
121151

122-
![Screenshot showing a bar chart of Azure VMs replicated with Site Recovery.](./media/monitoring-log-analytics/example1.png)
152+
![Screenshot showing a bar chart of Azure virtual machines replicated with Site Recovery.](./media/monitoring-log-analytics/example1.png)
123153

124154
### Query Site Recovery jobs
125155

@@ -145,7 +175,7 @@ AzureDiagnostics  
145175

146176
### Query test failover state (pie chart)
147177

148-
This query plots a pie chart for the test failover status of Azure VMs replicated with Site Recovery.
178+
This query plots a pie chart for the test failover status of Azure virtual machines replicated with Site Recovery.
149179

150180
```
151181
AzureDiagnostics 
@@ -160,7 +190,7 @@ AzureDiagnostics 
160190

161191
### Query test failover state (table)
162192

163-
This query plots a table for the test failover status of Azure VMs replicated with Site Recovery.
193+
This query plots a table for the test failover status of Azure virtual machines replicated with Site Recovery.
164194

165195
```
166196
AzureDiagnostics  
@@ -173,7 +203,7 @@ AzureDiagnostics  
173203

174204
### Query machine RPO
175205

176-
This query plots a trend graph that tracks the RPO of a specific Azure VM (ContosoVM123) for the last 72 hours.
206+
This query plots a trend graph that tracks the RPO of a specific Azure virtual machine (ContosoVM123) for the last 72 hours.
177207

178208
```
179209
AzureDiagnostics  
@@ -184,11 +214,11 @@ AzureDiagnostics  
184214
| project TimeGenerated, name_s , RPO_in_seconds = rpoInSeconds_d  
185215
| render timechart
186216
```
187-
![Screenshot of a trend graph tracking the RPO of a specific Azure VM.](./media/monitoring-log-analytics/example2.png)
217+
![Screenshot of a trend graph tracking the RPO of a specific Azure virtual machine.](./media/monitoring-log-analytics/example2.png)
188218

189-
### Query data change rate (churn) and upload rate for an Azure VM
219+
### Query data change rate (churn) and upload rate for an Azure virtual machine
190220

191-
This query plots a trend graph for a specific Azure VM (ContosoVM123), that represents the data change rate (Write Bytes per Second), and data upload rate.
221+
This query plots a trend graph for a specific Azure virtual machine (ContosoVM123), that represents the data change rate (Write Bytes per Second), and data upload rate.
192222

193223
```
194224
AzureDiagnostics  
@@ -201,7 +231,7 @@ Category contains "Upload", "UploadRate", "none") 
201231
| project TimeGenerated , InstanceWithType , Churn_MBps = todouble(Value_s)/1048576  
202232
| render timechart 
203233
```
204-
![screenshot of a trend graph for a specific Azure VM.](./media/monitoring-log-analytics/example3.png)
234+
![screenshot of a trend graph for a specific Azure virtual machine.](./media/monitoring-log-analytics/example3.png)
205235

206236
### Query data change rate (churn) and upload rate for a VMware or physical machine
207237

@@ -222,7 +252,7 @@ Process Server pushes this data every 5 minutes to the Log Analytics workspace.
222252

223253
### Query disaster recovery summary (Azure to Azure)
224254

225-
This query plots a summary table for Azure VMs replicated to a secondary Azure region. It shows the VM name, replication, and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the source location.
255+
This query plots a summary table for Azure virtual machines replicated to a secondary Azure region. It shows the virtual machine name, replication, and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the source location.
226256

227257
```
228258
AzureDiagnostics 
@@ -234,7 +264,7 @@ AzureDiagnostics 
234264

235265
### Query disaster recovery summary (VMware/physical servers)
236266

237-
This query plots a summary table for VMware VMs and physical servers replicated to Azure. It shows the machine name, replication and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the relevant process server.
267+
This query plots a summary table for VMware virtual machines and physical servers replicated to Azure. It shows the machine name, replication and protection status, RPO, test failover status, Mobility agent version, any active replication errors, and the relevant process server.
238268

239269
```
240270
AzureDiagnostics 
@@ -249,11 +279,11 @@ AzureDiagnostics 
249279
You can set up Site Recovery alerts based on Azure Monitor data. [Learn more](../azure-monitor/alerts/alerts-log.md#create-a-new-log-alert-rule-in-the-azure-portal) about setting up log alerts.
250280

251281
> [!NOTE]
252-
> Some of the examples use **replicationProviderName_s** set to **A2A**. This sets alerts for Azure VMs that are replicated to a secondary Azure region. In these examples, you can replace **A2A** with **InMageRcm** if you want to set alerts for on-premises VMware VMs or physical servers replicated to Azure.
282+
> Some of the examples use **replicationProviderName_s** set to **A2A**. This sets alerts for Azure virtual machines that are replicated to a secondary Azure region. In these examples, you can replace **A2A** with **InMageRcm** if you want to set alerts for on-premises VMware virtual machines or physical servers replicated to Azure.
253283
254284
### Multiple machines in a critical state
255285

256-
Set up an alert if more than 20 replicated Azure VMs go into a Critical state.
286+
Set up an alert if more than 20 replicated Azure virtual machines go into a Critical state.
257287

258288
```
259289
AzureDiagnostics  
@@ -267,7 +297,7 @@ For the alert, set **Threshold value** to `20`.
267297

268298
### Single machine in a critical state
269299

270-
Set up an alert if a specific replicated Azure VM goes into a Critical state.
300+
Set up an alert if a specific replicated Azure virtual machine goes into a Critical state.
271301

272302
```
273303
AzureDiagnostics  
@@ -282,7 +312,7 @@ For the alert, set **Threshold value** to `1`.
282312

283313
### Multiple machines exceed RPO
284314

285-
Set up an alert if the RPO for more than 20 Azure VMs exceeds 30 minutes.
315+
Set up an alert if the RPO for more than 20 Azure virtual machines exceeds 30 minutes.
286316
```
287317
AzureDiagnostics  
288318
| where replicationProviderName_s == "A2A"  
@@ -296,7 +326,7 @@ For the alert, set **Threshold value** to `20`.
296326

297327
### Single machine exceeds RPO
298328

299-
Set up an alert if the RPO for a single Azure VM exceeds 30 minutes.
329+
Set up an alert if the RPO for a single Azure virtual machine exceeds 30 minutes.
300330

301331
```
302332
AzureDiagnostics  
@@ -312,7 +342,7 @@ For the alert, set **Threshold value** to `1`.
312342

313343
### Test failover for multiple machines exceeds 90 days
314344

315-
Set up an alert if the last successful test failover was more than 90 days, for more than 20 VMs.
345+
Set up an alert if the last successful test failover was more than 90 days, for more than 20 virtual machines.
316346

317347
```
318348
AzureDiagnostics 
@@ -327,7 +357,7 @@ For the alert, set **Threshold value** to `20`.
327357

328358
### Test failover for a single machine exceeds 90 days
329359

330-
Set up an alert if the last successful test failover for a specific VM was more than 90 days ago.
360+
Set up an alert if the last successful test failover for a specific virtual machine was more than 90 days ago.
331361
```
332362
AzureDiagnostics 
333363
| where replicationProviderName_s == "A2A"  

0 commit comments

Comments
 (0)