Skip to content

Commit ee082da

Browse files
committed
Create hyper-v-start-state-access-failures-clustered-standalone.md
1 parent 01a0c60 commit ee082da

File tree

1 file changed

+168
-0
lines changed

1 file changed

+168
-0
lines changed
Lines changed: 168 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,168 @@
1+
---
2+
title: Troubleshooting Hyper-V virtual machine start, state, and access failures in clustered and standalone environments
3+
description: Provides a detailed troubleshooting guide to help you resolve issues related to Hyper-V virtual machines (VMs) that fail to start, become stuck in transitional states, or become inaccessible in both clustered and standalone environments.
4+
ms.date: 08/22/2025
5+
manager: dcscontentpm
6+
audience: itpro
7+
ms.topic: troubleshooting
8+
ms.reviewer: kaushika, jeffhugh, v-lianna
9+
ms.custom:
10+
- sap:virtualization and hyper-v\virtual machine state
11+
- pcy:WinComm Storage High Avail
12+
---
13+
# Troubleshooting Hyper-V virtual machine start, state, and access failures in clustered and standalone environments
14+
15+
This article provides a detailed troubleshooting guide to help you resolve issues related to Hyper-V virtual machines (VMs) that fail to start, become stuck in transitional states (such as starting, stopping, saved, or paused), or become inaccessible in both clustered and standalone environments. Common causes include VM configuration file corruption, storage or network problems, process lockups, checkpoint or AVHDX (automatic virtual hard disk) issues, and permission or driver errors. Timely identification and resolution of these problems are essential to minimize VM downtime, prevent business disruption, and avoid data loss in production environments.
16+
17+
## Symptoms
18+
19+
You may encounter various symptoms when dealing with Hyper-V VM issues, including the following:
20+
21+
### End-user and technical symptoms
22+
23+
- VMs fail to start or power on in Hyper-V Manager or Failover Cluster Manager.
24+
- VMs are stuck in states like "starting," "stopping," "saved-critical," "paused," or "restoring."
25+
- VMs are missing or not visible in Hyper-V Manager or the output of Get-VM.
26+
- VM states are displayed as "running critical," "stopping," or "online pending."
27+
- VM consoles are inaccessible, and remote desktop connections are unavailable.
28+
- VMs fail to migrate successfully between cluster nodes.
29+
- Hyper-V Manager or Failover Cluster Manager cannot change VM states or report their status.
30+
- VMMS (virtual machine management service) or VMM services are stuck in a "Stopping" state.
31+
- Storage volumes, such as Cluster Shared Volumes (CSVs), appear as RAW or offline, and VHDX files are inaccessible or locked.
32+
33+
### Error messages, event logs, and codes
34+
35+
- “A virtual machine or container with the specified identifier already exists in Hyper-V.”
36+
- “Failed to start worker process: Catastrophic failure 0x8000FFFF.”
37+
- “Virtual machine failed to generate VHD tree: The system cannot find the file specified (0x80070002).”
38+
- “The process cannot access the file because it is being used by another process.”
39+
- “Failed to perform the Cleaning up stale reference point(s) operation. The virtual machine is currently performing: Turning Off.”
40+
- “The file or directory is corrupted and unreadable. (0x80070570)”
41+
- Event IDs: 21502, 1069, 1205, 5120, 1135, 225, 15500, 1793, 1795, 7034, 7031, 7036, 16300, 14102, 4092, 18012, 18016, 20848, 20864, 12620, 12240, 153, 20848, 18524, 1146, 1230.
42+
- Cluster resources stuck in "online pending" or "failed" states.
43+
- VMs are unavailable after patching, host restarts, or storage and network events.
44+
45+
## Cause
46+
47+
Hyper-V VM failures may originate from several root causes categorized as follows:
48+
49+
### 1. Configuration and metadata corruption
50+
51+
- Corrupt or missing VM configuration files (e.g., .VMCX, .XML) prevent the VM from being recognized or started by Hyper-V, often after failed migrations, storage issues, or abrupt shutdowns.
52+
- Checkpoint (AVHDX) chain corruption or missing differencing disks prevent the VM from starting.
53+
- Orphaned checkpoints, incomplete merges, or invalid entries in configuration files block VM operations.
54+
- Duplicate VM GUIDs or object entries, particularly with System Center Virtual Machine Manager (SCVMM), can cause "already exists" errors and prevent VM imports or starts.
55+
56+
### 2. Storage and file system issues
57+
58+
- CSVs or volumes are offline, RAW, or inaccessible due to storage subsystem failures, disk corruption, or drive letter conflicts.
59+
- VHD or VHDX files are locked or in use by another process, such as a backup or antivirus program.
60+
- Missing or corrupted VM runtime state files (VMRS) impede VM operations.
61+
- BitLocker-locked disks prevent VMs from starting after patching or reboots.
62+
63+
### 3. Process and service lockups
64+
65+
- Stale VMWP (VM Worker Process) or VMMS (VM Management Service) processes are stuck due to storage or network issues or deadlocks.
66+
- Failed attempts to terminate VM processes via Task Manager, taskkill, or Process Explorer persist due to kernel or resource locks.
67+
68+
### 4. Permissions, security, and driver problems
69+
70+
- Permissions issues restrict the Hyper-V service account from accessing VM files or folders.
71+
- Antivirus or third-party filter drivers interfere with Hyper-V, blocking file access or causing merge failures.
72+
- Outdated or misconfigured storage or network drivers lead to connectivity loss or failover events.
73+
74+
### 5. Cluster, network, and failover issues
75+
76+
- CSV or network communication failures, such as cluster node isolation, result in mass VM failovers or reboots.
77+
- Improper cluster configurations or inconsistent patching across nodes cause instability.
78+
- Live migration or failover failures occur due to insufficient memory, incompatible settings, or node misconfigurations.
79+
80+
## Resolution
81+
82+
To resolve these issues, follow the troubleshooting steps below:
83+
84+
### A. Initial checks
85+
86+
1. Identify error messages, event IDs, and affected VMs using Hyper-V Manager, Failover Cluster Manager, or PowerShell.
87+
2. Review system logs, Hyper-V logs, and cluster event logs for relevant entries.
88+
89+
### B. File system and storage checks
90+
91+
1. Verify storage volumes:
92+
93+
- Use Disk Management or diskpart to ensure volumes are online and properly assigned.
94+
- If volumes are RAW or missing, reassign drive letters and repair disk corruption using chkdsk:
95+
96+
chkdsk <drive_letter>: /f /r</drive_letter>
97+
2. Check VM configuration and disk file presence:
98+
99+
- Confirm the existence of .VMCX, .VMRS, .VHDX, and .AVHDX files in the VM folder.
100+
- For missing or corrupt configuration files, rebuild the VM using existing VHDX files or restore from a backup.
101+
- For missing or corrupt AVHDX files:
102+
103+
Set-VHD -Path <vhdx path> -ParentPath <parent vhdx path> -IgnoreIDMismatch</parent></vhdx>
104+
- If BitLocker is enabled, unlock the disk:
105+
106+
manage-bde -unlock D: -RecoveryPassword <yourrecoverypassword></yourrecoverypassword>
107+
- For locked or in-use files, use Process Explorer to identify and terminate the locking process, or reboot the host to release the lock.
108+
109+
### C. Process and service recovery
110+
111+
1. If the VM is stuck in transitional states:
112+
113+
- End the VM process:
114+
115+
taskkill /PID <pid> /F</pid>
116+
- Restart VMMS or the host if processes remain stuck.
117+
2. Remove saved states or checkpoints:
118+
119+
Get-VMSnapshot <vmname> | Remove-VMSavedState<br>Remove-VMSavedState <vmname></vmname></vmname>
120+
121+
### D. Permission and security configuration
122+
123+
1. Ensure the Hyper-V service account has full control over VM files and folders.
124+
2. Apply antivirus exclusions as per Microsoft’s Hyper-V documentation.
125+
3. Identify and unload problematic filter drivers:
126+
127+
fltmc
128+
fltmc unload <drivername></drivername>
129+
130+
### E. Cluster and network remediation
131+
132+
1. Validate cluster health and configuration using the cluster validation wizard or:
133+
134+
Test-Cluster
135+
2. Resolve network issues by reviewing Event IDs (e.g., 5120, 1135) and adjusting parameters:
136+
137+
(Get-Cluster).SameSubnetThreshold = <value></value>
138+
3. Ensure consistent patching and proper network/storage configurations across nodes.
139+
140+
### F. VM configuration repairs and rebuilds
141+
142+
1. For corrupt configuration files, edit the .VMCX file or create a new VM with existing disks.
143+
2. Address saved state or checkpoint issues by removing invalid checkpoints or reattaching disks.
144+
145+
### G. Escalation and bug reference
146+
147+
1. If known bugs or product defects are involved (e.g., UEFI firmware bugs or cluster communication issues), review vendor advisories and apply recommended updates or fixes.
148+
149+
## Data collection
150+
151+
Gather the following logs and diagnostics to assist with troubleshooting:
152+
153+
- Hyper-V event logs:
154+
155+
Get-WinEvent -LogName Microsoft-Windows-Hyper-V-VMMS-Admin | Export-Csv -Path <unc path></unc>
156+
- Cluster logs:
157+
158+
Get-ClusterLog -UseLocalTime -Destination <folder></folder>
159+
- Process dumps for stuck services:
160+
161+
procdump -ma <pid> <output_path></output_path></pid>
162+
163+
## References
164+
165+
For additional details, consult the following resources:
166+
167+
- [Hyper-V performance tuning guide](https://learn.microsoft.com/en-us/windows-server/virtualization/hyper-v)
168+
- [Failover cluster troubleshooting](https://learn.microsoft.com/en-us/windows-server/failover-clustering)

0 commit comments

Comments
 (0)