Skip to content

Commit a020233

Browse files
author
Simonx Xu
authored
Merge pull request #9637 from v-lianna/CI_6878
2 parents f173b70 + 443ad94 commit a020233

File tree

2 files changed

+214
-0
lines changed

2 files changed

+214
-0
lines changed

support/windows-server/toc.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2800,6 +2800,8 @@ items:
28002800
items:
28012801
- name: 'Troubleshooting guidance: Live Migration'
28022802
href: ./virtualization/troubleshoot-live-migration-guidance.md
2803+
- name: 'Troubleshooting guide: Hyper-V virtual machine live migration'
2804+
href: ./virtualization/hyper-v-virtual-machine-live-migration.md
28032805
- name: Can't migrate a virtual machine from one host to another
28042806
href: ./virtualization/unable-migrate-virtual-machine-processor-not-compatible-host.md
28052807
- name: Starting or live migrating Hyper-V VMs fails
Lines changed: 212 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,212 @@
1+
---
2+
title: Hyper-V Virtual Machine Live Migration Troubleshooting Guide
3+
description: Provides a comprehensive troubleshooting process, detailed solutions for common migration issues, structured data collection procedures, and quick reference resources.
4+
ms.date: 09/05/2025
5+
manager: dcscontentpm
6+
audience: itpro
7+
ms.topic: troubleshooting
8+
ms.reviewer: kaushika, jeffhugh, v-lianna
9+
ms.custom:
10+
- sap:virtualization and hyper-v\migration
11+
- pcy:WinComm Storage High Avail
12+
---
13+
# Troubleshooting guide: Hyper-V virtual machine live migration
14+
15+
Live migration in Hyper-V allows virtual machines (VMs) to be moved between hosts with minimal downtime—a critical feature for high availability and maintenance in Windows Server environments. Despite its capabilities, live migration can fail due to a wide array of factors—hardware incompatibility, authentication, network configuration, VM settings, and storage issues. Properly diagnosing and resolving live migration failures is vital for business continuity, especially in clustered and production settings. This guide provides a comprehensive troubleshooting process, detailed solutions for common migration issues, structured data collection procedures, and quick reference resources.
16+
17+
## Troubleshooting checklist
18+
19+
Use this checklist as a stepwise process before and during troubleshooting:
20+
21+
1. Baseline health checks
22+
23+
- Confirm VM integration services are up-to-date.
24+
- Ensure all hosts are fully patched and running supported Windows Server versions.
25+
- Confirm the VM is in a supported state for migration (for example, not in "Backing up" or "Stopping").
26+
27+
1. Cluster and host configuration
28+
29+
- Check cluster node/host compatibility (CPU, BIOS/firmware versions, configuration versions).
30+
- Verify all cluster nodes are online, healthy, and visible in Failover Cluster Manager.
31+
- Validate VM configuration version consistency across hosts.
32+
33+
1. Network and storage
34+
35+
- Confirm live migration, management, and storage networks are correctly configured and reachable.
36+
- Ensure the VM storage is accessible on the destination host.
37+
- Check firewall rules and port accessibility (for example, UDP 3343 for WSFC).
38+
39+
1. Authentication or permissions
40+
41+
- Verify Kerberos or CredSSP is enabled and delegated properly for live migrations.
42+
- Ensure the required Service Principal Names (SPNs) are registered.
43+
- Confirm permission levels of migration accounts.
44+
45+
1. Virtual switches or networks
46+
47+
- Ensure the required VM switches exist and are identically configured across hosts.
48+
- Validate network teaming (SET or LBFO) consistency.
49+
50+
1. VM-specific features
51+
52+
- If using vTPM/Shielded VMs, confirm certificate requirements between hosts.
53+
- Check for snapshots/checkpoints—merge or delete as appropriate.
54+
55+
Here are common issues and their respective resolutions:
56+
57+
## Hardware or CPU incompatibility
58+
59+
- Error message:
60+
61+
> The virtual machine uses processor-specific features not supported on the physical computer
62+
- Event ID 21502
63+
64+
### Resolution
65+
66+
- In Hyper-V Manager, set the VM's processor settings. Enable **Migrate to a physical computer with a different processor version**.
67+
- Ensure the target host supports the VM's CPU features.
68+
69+
> [!TIP]
70+
> Always start VMs for the first time on the oldest (least capable) CPU host.
71+
72+
## VM configuration version mismatch
73+
74+
- Migration works one way but not in reverse (especially after moving to a newer OS).
75+
- Error message:
76+
77+
> Live migration or quick migration fails with compatibility issues.
78+
- Event IDs 10698 and 21502
79+
80+
### Resolution
81+
82+
- Upgrade the VM configuration version when moving to a newer host (Hyper-V Manager: **Action** > **Upgrade VM Configuration Version**).
83+
84+
> [!NOTE]
85+
> This action can't be rolled back. VMs with the upgraded version can't be migrated back to older hosts.
86+
87+
- Verify the configuration version with the `Get-VM * | select Name, Version` cmdlet.
88+
89+
## Network connectivity or configuration issues
90+
91+
- Live migration fails before/during the transfer.
92+
- Migration fails with: "The client cannot connect to the destination specified in the request" or "WinRM protocol errors."
93+
- Event IDs: 20406 and 280
94+
95+
### Resolution
96+
97+
- Ensure hosts can resolve and reach each other via hostname/IP (test with `ping`).
98+
- Use `winrm quickconfig` to verify/configure WinRM on both hosts.
99+
- Update TrustedHosts as needed: `Set-Item WSMan:\localhost\Client\TrustedHosts -Value "<Host1>,<Host2>"`.
100+
- Verify firewall allows the necessary ports (default SMB, WinRM, and clustering ports).
101+
- Match network configurations (switch names/types, teaming) across hosts.
102+
103+
## Authentication or delegation issues
104+
105+
- Error code: "General access denied error (0x80070005)"
106+
- Errors 0x8009030E and 0x8009030D (delegation/Kerberos/SPN issues)
107+
108+
### Resolution
109+
110+
- Enable Kerberos authentication: **Hyper-V Settings** > **Live Migrations** > **Advanced** > **Authentication protocol: Kerberos**.
111+
- Configure constrained delegation in Active Directory (AD): **Computer Properties** > **Delegation** tab > **Trust this computer for delegation to specified services only** > **Add cifs and Microsoft Virtual System Migration Service**.
112+
- Register missing SPNs: `setspn -s Microsoft Virtual System Migration Service/<FQDN> <ComputerName>`.
113+
- Purge old Kerberos tickets: `KLIST PURGE -li 0x3e7`.
114+
115+
## Storage or shared disk issues
116+
117+
- Migration fails for VMs with shared VHDX/shared disks.
118+
- Error: Shared disk missing in migration options.
119+
120+
### Resolution
121+
122+
- Shared virtual disks can only be used in clustered VM roles and can't be migrated using standard methods.
123+
- Manually move and reattach shared disks at the destination where required.
124+
125+
## VTPM or shielded VM certificate issues
126+
127+
- Error: "The key protector for the virtual machine could not be unwrapped."
128+
- Migration fails only for vTPM-protected VMs.
129+
130+
### Resolution
131+
132+
- Export shielding or key protector certificates from the source host and import them on the destination.
133+
- Windows certificates snap-in (**certmgr.msc**) can be used.
134+
- PowerShell cmdlets: `Export-PfxCertificate` and `Import-PfxCertificate`.
135+
136+
## State file or checkpoint issues
137+
138+
- Error message:
139+
140+
> Cannot restore this virtual machine because the saved state data cannot be read. Delete the saved state data and then try to start the virtual machine. (0xC0370027)
141+
- VMs are stuck in the "Backing up" state.
142+
143+
### Resolution
144+
145+
- In Hyper-V Manager: Select **VM** > **Delete Saved State**.
146+
- Manually delete `.bin` and `.vsv` files from the VM folder if necessary.
147+
- Remove or merge corrupted checkpoints.
148+
149+
## Cluster or migration limit issues
150+
151+
- Error message:
152+
153+
> Virtual machine migration limit 1 was reached, please wait for completion of an ongoing migration operation.
154+
- VMs enter the "Saved" state during a node drain or reboot.
155+
156+
### Resolution
157+
158+
- Increase Simultaneous Migrations: **Hyper-V Settings** > **Live Migrations** > **Simultaneous Migrations**.
159+
- PowerShell cmdlet:
160+
161+
```powershell
162+
Set-VMHost -MaximumVirtualMachineMigrations N
163+
```
164+
165+
- Always drain roles before rebooting with `Suspend-ClusterNode -Name <NodeName> -Drain`.
166+
167+
## Code defects or bugs
168+
169+
VMs are stuck in the "Stopping" state after migration; orphaned vports are reported in live dump analysis.
170+
171+
### Resolution
172+
173+
- Update to the latest Windows Server Cumulative Update.
174+
- If an upgrade isn't possible, request Microsoft support for a hotfix if the bug is referenced.
175+
176+
## Data collection
177+
178+
Standard data collection checklist:
179+
180+
- Event logs: `Get-WinEvent -LogName "Microsoft-Windows-Hyper-V-VMMS/Admin"`.
181+
- Cluster logs: `Get-ClusterLog -UseLocalTime -Destination <Path>`.
182+
- Hyper-V migration logs: `Get-WinEvent -LogName Microsoft-Windows-Hyper-V-Worker-Admin`.
183+
- Network diagnostics: `Get-NetAdapter`, `ipconfig /all`, and ensure all network paths are up.
184+
- Delegation and SPN checks: run `setspn -L <HostName>`, and check the **Delegation** tab in AD.
185+
- VM and system configuration: `Get-VM * | select Name, Version`, `Get-VMSwitch`, and `Get-VMProcessor -VMName <VMName>`.
186+
- WinRM configuration: `winrm quickconfig Set-Item WSMan:\localhost\Client\TrustedHosts -Value "<Host1>,<Host2>"`.
187+
- PowerShell to gather live migration traces: `TSS.ps1 -CollectLog SHA_support-all -noBasicLog -noUpdate -NewSession`.
188+
- Snapshot or checkpoint chain: `Get-VHDChain -Path <PathToVHD>`.
189+
190+
## Common issue quick reference table
191+
192+
| Symptom or error | Possible root cause | Resolution summary |
193+
| --- | --- | --- |
194+
| "The VM uses processor-specific features not supported..." | CPU feature/BIOS/firmware mismatch | Enable migration compatibility; check the host CPU features and BIOS. |
195+
| VM migration fails with a version error. | Configuration version mismatch | Upgrade the VM configuration version on the new host; can't migrate back to the old host. |
196+
| "General access denied error (0x80070005)," 0x8009030E/0x8009030D | SPN/delegation/Kerberos misconfiguration | Configure constrained delegation, register SPNs, set Kerberos, and check AD. |
197+
| "The server does not support WS-Management Identify operations..." | WinRM/proxy/trustedhosts misconfiguration | Set WinRM/trusted hosts, update hosts files, and bypass the proxy. |
198+
| "Cannot restore this virtual machine because the saved state data..." | Corrupt checkpoints/saved state files | Delete the "Saved" state, remove/merge checkpoints, and manually delete .bin/.vsv files. |
199+
| vTPM-enabled VM fails with "The key protector... could not be unwrapped." | Certificate/Shielded VM configuration | Export/import key protector certificates. |
200+
| Migration fails—the shared disk isn't available in the migration wizard. | Unsupported scenario | Manually reattach disks; follow the shared disk migration documentation. |
201+
| Live migration limit reached; VMs enter the "Saved" state in draining. | Low concurrency settings; improper drain | Raise the migration limit; drain roles before rebooting. |
202+
| "No matching virtual switch found..." | Inconsistent network switch configuration | Ensure the identical virtual switch configuration on all hosts. |
203+
| Live migration events show failures post-update. | Patch/Firmware/Speculation settings | Ensure all nodes are updated/patched; check speculation control settings. |
204+
| VM backs up forever and can't move in/out of the "Backing up" state. | Backup software lock | Restart the backup service and VMMS service, or power on the VM to merge checkpoints. |
205+
206+
Ensuring successful Hyper-V live migrations requires diligent configuration management—including CPU, firmware, network, authentication, storage, and permissions—across all participating hosts. By following the troubleshooting checklist, addressing the known root causes as outlined in the resolution sections, and using targeted data collection, most migration failures can be swiftly resolved or appropriately escalated. Staying current with patches, standardizing cluster configurations, and understanding log outputs minimizes downtime and operational risk for mission-critical VM workloads.
207+
208+
## References
209+
210+
- WinRM and `TrustedHosts` configuration: [Installation and configuration for Windows Remote Management](/windows/win32/winrm/installation-and-configuration-for-windows-remote-management)
211+
- Cluster Shared Volumes and migration limits: [Cluster Shared Volumes overview](/windows-server/failover-clustering/failover-cluster-csvs)
212+
- SCVMM VMware-to-Hyper-V conversion: [Convert a VMware VM to Hyper-V in the VMM fabric](/system-center/vmm/vm-convert-vmware)

0 commit comments

Comments
 (0)