|
| 1 | +--- |
| 2 | +title: Troubleshooting Hyper-V virtual machine start, state, and access failures in clustered and standalone environments |
| 3 | +description: Provides a detailed troubleshooting guide to help you resolve issues related to Hyper-V virtual machines (VMs) that fail to start, become stuck in transitional states, or become inaccessible in both clustered and standalone environments. |
| 4 | +ms.date: 08/22/2025 |
| 5 | +manager: dcscontentpm |
| 6 | +audience: itpro |
| 7 | +ms.topic: troubleshooting |
| 8 | +ms.reviewer: kaushika, jeffhugh, v-lianna |
| 9 | +ms.custom: |
| 10 | +- sap:virtualization and hyper-v\virtual machine state |
| 11 | +- pcy:WinComm Storage High Avail |
| 12 | +--- |
| 13 | +# Troubleshooting Hyper-V virtual machine start, state, and access failures in clustered and standalone environments |
| 14 | + |
| 15 | +This article provides a detailed troubleshooting guide to help you resolve issues related to Hyper-V virtual machines (VMs) that fail to start, become stuck in transitional states (such as starting, stopping, saved, or paused), or become inaccessible in both clustered and standalone environments. Common causes include VM configuration file corruption, storage or network problems, process lockups, checkpoint or AVHDX (automatic virtual hard disk) issues, and permission or driver errors. Timely identification and resolution of these problems are essential to minimize VM downtime, prevent business disruption, and avoid data loss in production environments. |
| 16 | + |
| 17 | +## Symptoms |
| 18 | + |
| 19 | +You may encounter various symptoms when dealing with Hyper-V VM issues, including the following: |
| 20 | + |
| 21 | +### End-user and technical symptoms |
| 22 | + |
| 23 | +- VMs fail to start or power on in Hyper-V Manager or Failover Cluster Manager. |
| 24 | +- VMs are stuck in states like "starting," "stopping," "saved-critical," "paused," or "restoring." |
| 25 | +- VMs are missing or not visible in Hyper-V Manager or the output of Get-VM. |
| 26 | +- VM states are displayed as "running critical," "stopping," or "online pending." |
| 27 | +- VM consoles are inaccessible, and remote desktop connections are unavailable. |
| 28 | +- VMs fail to migrate successfully between cluster nodes. |
| 29 | +- Hyper-V Manager or Failover Cluster Manager cannot change VM states or report their status. |
| 30 | +- VMMS (virtual machine management service) or VMM services are stuck in a "Stopping" state. |
| 31 | +- Storage volumes, such as Cluster Shared Volumes (CSVs), appear as RAW or offline, and VHDX files are inaccessible or locked. |
| 32 | + |
| 33 | +### Error messages, event logs, and codes |
| 34 | + |
| 35 | +- “A virtual machine or container with the specified identifier already exists in Hyper-V.” |
| 36 | +- “Failed to start worker process: Catastrophic failure 0x8000FFFF.” |
| 37 | +- “Virtual machine failed to generate VHD tree: The system cannot find the file specified (0x80070002).” |
| 38 | +- “The process cannot access the file because it is being used by another process.” |
| 39 | +- “Failed to perform the Cleaning up stale reference point(s) operation. The virtual machine is currently performing: Turning Off.” |
| 40 | +- “The file or directory is corrupted and unreadable. (0x80070570)” |
| 41 | +- Event IDs: 21502, 1069, 1205, 5120, 1135, 225, 15500, 1793, 1795, 7034, 7031, 7036, 16300, 14102, 4092, 18012, 18016, 20848, 20864, 12620, 12240, 153, 20848, 18524, 1146, 1230. |
| 42 | +- Cluster resources stuck in "online pending" or "failed" states. |
| 43 | +- VMs are unavailable after patching, host restarts, or storage and network events. |
| 44 | + |
| 45 | +## Cause |
| 46 | + |
| 47 | +Hyper-V VM failures may originate from several root causes categorized as follows: |
| 48 | + |
| 49 | +### 1. Configuration and metadata corruption |
| 50 | + |
| 51 | +- Corrupt or missing VM configuration files (e.g., .VMCX, .XML) prevent the VM from being recognized or started by Hyper-V, often after failed migrations, storage issues, or abrupt shutdowns. |
| 52 | +- Checkpoint (AVHDX) chain corruption or missing differencing disks prevent the VM from starting. |
| 53 | +- Orphaned checkpoints, incomplete merges, or invalid entries in configuration files block VM operations. |
| 54 | +- Duplicate VM GUIDs or object entries, particularly with System Center Virtual Machine Manager (SCVMM), can cause "already exists" errors and prevent VM imports or starts. |
| 55 | + |
| 56 | +### 2. Storage and file system issues |
| 57 | + |
| 58 | +- CSVs or volumes are offline, RAW, or inaccessible due to storage subsystem failures, disk corruption, or drive letter conflicts. |
| 59 | +- VHD or VHDX files are locked or in use by another process, such as a backup or antivirus program. |
| 60 | +- Missing or corrupted VM runtime state files (VMRS) impede VM operations. |
| 61 | +- BitLocker-locked disks prevent VMs from starting after patching or reboots. |
| 62 | + |
| 63 | +### 3. Process and service lockups |
| 64 | + |
| 65 | +- Stale VMWP (VM Worker Process) or VMMS (VM Management Service) processes are stuck due to storage or network issues or deadlocks. |
| 66 | +- Failed attempts to terminate VM processes via Task Manager, taskkill, or Process Explorer persist due to kernel or resource locks. |
| 67 | + |
| 68 | +### 4. Permissions, security, and driver problems |
| 69 | + |
| 70 | +- Permissions issues restrict the Hyper-V service account from accessing VM files or folders. |
| 71 | +- Antivirus or third-party filter drivers interfere with Hyper-V, blocking file access or causing merge failures. |
| 72 | +- Outdated or misconfigured storage or network drivers lead to connectivity loss or failover events. |
| 73 | + |
| 74 | +### 5. Cluster, network, and failover issues |
| 75 | + |
| 76 | +- CSV or network communication failures, such as cluster node isolation, result in mass VM failovers or reboots. |
| 77 | +- Improper cluster configurations or inconsistent patching across nodes cause instability. |
| 78 | +- Live migration or failover failures occur due to insufficient memory, incompatible settings, or node misconfigurations. |
| 79 | + |
| 80 | +## Resolution |
| 81 | + |
| 82 | +To resolve these issues, follow the troubleshooting steps below: |
| 83 | + |
| 84 | +### A. Initial checks |
| 85 | + |
| 86 | +1. Identify error messages, event IDs, and affected VMs using Hyper-V Manager, Failover Cluster Manager, or PowerShell. |
| 87 | +2. Review system logs, Hyper-V logs, and cluster event logs for relevant entries. |
| 88 | + |
| 89 | +### B. File system and storage checks |
| 90 | + |
| 91 | +1. Verify storage volumes: |
| 92 | + |
| 93 | + - Use Disk Management or diskpart to ensure volumes are online and properly assigned. |
| 94 | + - If volumes are RAW or missing, reassign drive letters and repair disk corruption using chkdsk: |
| 95 | + |
| 96 | + chkdsk <drive_letter>: /f /r</drive_letter> |
| 97 | +2. Check VM configuration and disk file presence: |
| 98 | + |
| 99 | + - Confirm the existence of .VMCX, .VMRS, .VHDX, and .AVHDX files in the VM folder. |
| 100 | + - For missing or corrupt configuration files, rebuild the VM using existing VHDX files or restore from a backup. |
| 101 | + - For missing or corrupt AVHDX files: |
| 102 | + |
| 103 | + Set-VHD -Path <vhdx path> -ParentPath <parent vhdx path> -IgnoreIDMismatch</parent></vhdx> |
| 104 | + - If BitLocker is enabled, unlock the disk: |
| 105 | + |
| 106 | + manage-bde -unlock D: -RecoveryPassword <yourrecoverypassword></yourrecoverypassword> |
| 107 | + - For locked or in-use files, use Process Explorer to identify and terminate the locking process, or reboot the host to release the lock. |
| 108 | + |
| 109 | +### C. Process and service recovery |
| 110 | + |
| 111 | +1. If the VM is stuck in transitional states: |
| 112 | + |
| 113 | + - End the VM process: |
| 114 | + |
| 115 | + taskkill /PID <pid> /F</pid> |
| 116 | + - Restart VMMS or the host if processes remain stuck. |
| 117 | +2. Remove saved states or checkpoints: |
| 118 | + |
| 119 | + Get-VMSnapshot <vmname> | Remove-VMSavedState<br>Remove-VMSavedState <vmname></vmname></vmname> |
| 120 | + |
| 121 | +### D. Permission and security configuration |
| 122 | + |
| 123 | +1. Ensure the Hyper-V service account has full control over VM files and folders. |
| 124 | +2. Apply antivirus exclusions as per Microsoft’s Hyper-V documentation. |
| 125 | +3. Identify and unload problematic filter drivers: |
| 126 | + |
| 127 | + fltmc |
| 128 | +fltmc unload <drivername></drivername> |
| 129 | + |
| 130 | +### E. Cluster and network remediation |
| 131 | + |
| 132 | +1. Validate cluster health and configuration using the cluster validation wizard or: |
| 133 | + |
| 134 | + Test-Cluster |
| 135 | +2. Resolve network issues by reviewing Event IDs (e.g., 5120, 1135) and adjusting parameters: |
| 136 | + |
| 137 | + (Get-Cluster).SameSubnetThreshold = <value></value> |
| 138 | +3. Ensure consistent patching and proper network/storage configurations across nodes. |
| 139 | + |
| 140 | +### F. VM configuration repairs and rebuilds |
| 141 | + |
| 142 | +1. For corrupt configuration files, edit the .VMCX file or create a new VM with existing disks. |
| 143 | +2. Address saved state or checkpoint issues by removing invalid checkpoints or reattaching disks. |
| 144 | + |
| 145 | +### G. Escalation and bug reference |
| 146 | + |
| 147 | +1. If known bugs or product defects are involved (e.g., UEFI firmware bugs or cluster communication issues), review vendor advisories and apply recommended updates or fixes. |
| 148 | + |
| 149 | +## Data collection |
| 150 | + |
| 151 | +Gather the following logs and diagnostics to assist with troubleshooting: |
| 152 | + |
| 153 | +- Hyper-V event logs: |
| 154 | + |
| 155 | + Get-WinEvent -LogName Microsoft-Windows-Hyper-V-VMMS-Admin | Export-Csv -Path <unc path></unc> |
| 156 | +- Cluster logs: |
| 157 | + |
| 158 | + Get-ClusterLog -UseLocalTime -Destination <folder></folder> |
| 159 | +- Process dumps for stuck services: |
| 160 | + |
| 161 | + procdump -ma <pid> <output_path></output_path></pid> |
| 162 | + |
| 163 | +## References |
| 164 | + |
| 165 | +For additional details, consult the following resources: |
| 166 | + |
| 167 | +- [Hyper-V performance tuning guide](https://learn.microsoft.com/en-us/windows-server/virtualization/hyper-v) |
| 168 | +- [Failover cluster troubleshooting](https://learn.microsoft.com/en-us/windows-server/failover-clustering) |
0 commit comments