Skip to content

Commit 8e8b13d

Browse files
Merge pull request #235933 from SnehaSudhirG/26Apr-TroubleshootHRWDocUpdate
Added info on Job suspension
2 parents 8e727a8 + b72f5fd commit 8e8b13d

File tree

2 files changed

+78
-2
lines changed

2 files changed

+78
-2
lines changed

articles/automation/troubleshoot/extension-based-hybrid-runbook-worker.md

Lines changed: 36 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
title: Troubleshoot extension-based Hybrid Runbook Worker issues in Azure Automation
33
description: This article tells how to troubleshoot and resolve issues that arise with Azure Automation extension-based Hybrid Runbook Workers.
44
services: automation
5-
ms.date: 02/09/2023
5+
ms.date: 04/26/2023
66
ms.topic: troubleshooting
77
ms.custom: devx-track-azurepowershell
88
---
@@ -40,6 +40,41 @@ To help troubleshoot issues with extension-based Hybrid Runbook Workers:
4040
Logs are in `C:\HybridWorkerExtensionLogs`.
4141
- For Linux: Logs are in folders </br>`/var/log/azure/Microsoft.Azure.Automation.HybridWorker.HybridWorkerForLinux` and `/home/hweautomation`.
4242

43+
### Scenario: Job failed to start as the Hybrid Worker was not available when the scheduled job started
44+
45+
#### Issue
46+
Job fails to start on a Hybrid Worker and you see the following error:
47+
48+
*Failed to start, as hybrid worker was not available when scheduled job started, the hybrid worker was last active at mm/dd/yyyy*.
49+
50+
#### Cause
51+
This error can occur due to the following reasons:
52+
- The machines doesn't exist anymore.
53+
- The machine is turned off and is unreachable.
54+
- The machine has a network connectivity issue.
55+
- The Hybrid Runbook Worker extension has been uninstalled from the machine.
56+
57+
#### Resolution
58+
- Ensure that the machine exists, and Hybrid Runbook Worker extension is installed on it. The Hybrid Worker should be healthy and should give a heartbeat. Troubleshoot any network issues by checking the Microsoft-SMA event logs on the Workers in the Hybrid Runbook Worker Group that tried to run this job.
59+
- You can also monitor [HybridWorkerPing](../../azure-monitor/essentials/metrics-supported.md#microsoftautomationautomationaccounts) metric that provides the number of pings from a Hybrid Worker and can help to check ping-related issues.
60+
61+
### Scenario: Job was suspended as it exceeded the job limit for a Hybrid Worker
62+
63+
#### Issue
64+
Job gets suspended with the following error message:
65+
66+
*Job was suspended as it exceeded the job limit for a Hybrid Worker. Add more Hybrid Workers to the Hybrid Worker group to overcome this issue.*
67+
68+
#### Cause
69+
Jobs might get suspended due to any of the following reasons:
70+
- Each active Hybrid Worker in the group will poll for jobs every 30 seconds to see if any jobs are available. The Worker picks jobs on a first-come, first-serve basis. Depending on when a job was pushed, whichever Hybrid Worker within the Hybrid Worker Group pings the Automation service first picks up the job. A single hybrid worker can generally pick up four jobs per ping (that is, every 30 seconds). If your rate of pushing jobs is higher than four per 30 seconds and no other Worker picks up the job, the job might get suspended.
71+
- Hybrid Worker might not be polling as expected every 30 seconds. This could happen if the Worker is not healthy or there are network issues.
72+
73+
#### Resolution
74+
- If the job limit for a Hybrid Worker exceeds four jobs per 30 seconds, you can add more Hybrid Workers to the Hybrid Worker group for high availability and load balancing. You can also schedule jobs so they do not exceed the limit of four jobs per 30 seconds. The processing time of the jobs queue depends on the Hybrid worker hardware profile and load. Ensure that the Hybrid Worker is healthy and gives a heartbeat.
75+
- Troubleshoot any network issues by checking the Microsoft-SMA event logs on the Workers in the Hybrid Runbook Worker Group that tried to run this job.
76+
- You can also monitor the [HybridWorkerPing](../../azure-monitor/essentials/metrics-supported.md#microsoftautomationautomationaccounts) metric that provides the number of pings from a Hybrid Worker and can help to check ping-related issues.
77+
4378
### Scenario: Hybrid Worker deployment fails with Private Link error
4479

4580
#### Issue

articles/automation/troubleshoot/hybrid-runbook-worker.md

Lines changed: 42 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,7 @@
22
title: Troubleshoot agent-based Hybrid Runbook Worker issues in Azure Automation
33
description: This article tells how to troubleshoot and resolve issues that arise with Azure Automation agent-based Hybrid Runbook Workers.
44
services: automation
5-
ms.date: 03/15/2023
5+
ms.date: 04/26/2023
66
ms.topic: troubleshooting
77
---
88

@@ -55,6 +55,47 @@ The Hybrid Runbook Worker jobs failed to refresh when communicating through a Lo
5555

5656
Verify the Log Analytics Gateway server is online and is accessible from the machine hosting the Hybrid Runbook Worker role. For additional troubleshooting information, see [Troubleshoot Log Analytics Gateway](../../azure-monitor/agents/gateway.md#troubleshooting).
5757

58+
59+
### Scenario: Job failed to start as the Hybrid Worker was not available when the scheduled job started
60+
61+
#### Issue
62+
Job fails to start on a Hybrid Worker and you see the following error:
63+
64+
*Failed to start, as hybrid worker was not available when scheduled job started, the hybrid worker was last active at mm/dd/yyyy*.
65+
66+
#### Cause
67+
This error can occur due to the following reasons:
68+
- The machines doesn't exist anymore.
69+
- The machine is turned off and is unreachable.
70+
- The machine has a network connectivity issue.
71+
- The Hybrid Runbook Worker extension has been uninstalled from the machine.
72+
73+
#### Resolution
74+
- Ensure that the machine exists, and Hybrid Runbook Worker extension is installed on it. The Hybrid Worker should be healthy and should give a heartbeat. Troubleshoot any network issues by checking the Microsoft-SMA event logs on the Workers in the Hybrid Runbook Worker Group that tried to run this job.
75+
- You can also monitor [HybridWorkerPing](../../azure-monitor/essentials/metrics-supported.md#microsoftautomationautomationaccounts) metric that provides the number of pings from a Hybrid Worker and can help to check ping-related issues.
76+
77+
### Scenario: Job was suspended as it exceeded the job limit for a Hybrid Worker
78+
79+
#### Issue
80+
Job gets suspended with the following error message:
81+
82+
*Job was suspended as it exceeded the job limit for a Hybrid Worker. Add more Hybrid Workers to the Hybrid Worker group to overcome this issue.*
83+
84+
#### Cause
85+
Jobs might get suspended due to any of the following reasons:
86+
- Each active Hybrid Worker in the group will poll for jobs every 30 seconds to see if any jobs are available. The Worker picks jobs on a first-come, first-serve basis. Depending on when a job was pushed, whichever Hybrid Worker within the Hybrid Worker Group pings the Automation service first picks up the job. A single hybrid worker can generally pick up four jobs per ping (that is, every 30 seconds). If your rate of pushing jobs is higher than four per 30 seconds and no other Worker picks up the job, the job might get suspended.
87+
- Hybrid Worker might not be polling as expected every 30 seconds. This could happen if the Worker is not healthy or there are network issues.
88+
89+
#### Resolution
90+
- If the job limit for a Hybrid Worker exceeds four jobs per 30 seconds, you can add more Hybrid Workers to the Hybrid Worker group for high availability and load balancing. You can also schedule jobs so they do not exceed the limit of four jobs per 30 seconds. The processing time of the jobs queue depends on the Hybrid worker hardware profile and load. Ensure that the Hybrid Worker is healthy and gives a heartbeat.
91+
- Troubleshoot any network issues by checking the Microsoft-SMA event logs on the Workers in the Hybrid Runbook Worker Group that tried to run this job.
92+
- You can also monitor the [HybridWorkerPing](../../azure-monitor/essentials/metrics-supported.md#microsoftautomationautomationaccounts) metric that provides the number of pings from a Hybrid Worker and can help to check ping-related issues.
93+
94+
95+
96+
97+
98+
5899
### <a name="cannot-connect-signalr"></a>Scenario: Event 15011 in the Hybrid Runbook Worker
59100

60101
#### Issue

0 commit comments

Comments
 (0)