You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/batch/batch-pool-node-error-checking.md
+11-2Lines changed: 11 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -5,7 +5,7 @@ services: batch
5
5
ms.service: batch
6
6
author: mscurrell
7
7
ms.author: markscu
8
-
ms.date: 9/25/2018
8
+
ms.date: 05/28/2019
9
9
ms.topic: conceptual
10
10
---
11
11
@@ -79,19 +79,28 @@ You can specify one or more application packages for a pool. Batch downloads the
79
79
80
80
The node [errors](https://docs.microsoft.com/rest/api/batchservice/computenode/get#computenodeerror) property reports a failure to download and uncompress an application package. Batch sets the node state to **unusable**.
81
81
82
+
### Container download failure
83
+
84
+
You can specify one or more container references on a pool. Batch downloads the specified containers to each node. The node [errors](https://docs.microsoft.com/rest/api/batchservice/computenode/get#computenodeerror) property reports a failure to download a container and sets the node state to **unusable**.
85
+
82
86
### Node in unusable state
83
87
84
88
Azure Batch might set the [node state](https://docs.microsoft.com/rest/api/batchservice/computenode/get#computenodestate) to **unusable** for many reasons. With the node state set to **unusable**, tasks can't be scheduled to the node, but it still incurs charges.
85
89
86
-
Batch always tries to recover unusable nodes, but recovery may or may not be possible depending on the cause.
90
+
Nodes in an **unsuable**, but without [errors](https://docs.microsoft.com/rest/api/batchservice/computenode/get#computenodeerror) state means that Batch is unable to communicate with the VM. In this case, Batch always tries to recover the VM. Batch will not automatically attempt to recover VMs which failed to install application packages or containers even though their state is **unusable**.
87
91
88
92
If Batch can determine the cause, the node [errors](https://docs.microsoft.com/rest/api/batchservice/computenode/get#computenodeerror) property reports it.
89
93
90
94
Additional examples of causes for **unusable** nodes include:
91
95
92
96
- A custom VM image is invalid. For example, an image that's not properly prepared.
97
+
93
98
- A VM is moved because of an infrastructure failure or a low-level upgrade. Batch recovers the node.
94
99
100
+
- A VM image has been deployed on hardware which doesn’t support it. For example an “HPC” VM image running on non-HPC hardware. For example, trying to run a CentOS HPC image on a [Standard_D1_v2](../virtual-machines/linux/sizes-general.md#dv2-series) VM.
101
+
102
+
- The VMs are in an [Azure virtual network](batch-virtual-network.md), and traffic has been blocked to key ports.
103
+
95
104
### Node agent log files
96
105
97
106
The Batch agent process that runs on each pool node can provide log files which might be helpful if you need to contact support about a pool node issue. Log files for a node can be uploaded via the Azure portal, Batch Explorer, or an [API](https://docs.microsoft.com/rest/api/batchservice/computenode/uploadbatchservicelogs). It's useful to upload and save the log files. Afterward, you can delete the node or pool to save the cost of the running nodes.
0 commit comments