Skip to content

Commit 70886ea

Browse files
author
Simonx Xu
authored
Merge pull request #9568 from v-lianna/CI_6875
AB#6875 Create hyper-v-cluster-connectivity-management-configuration.md
2 parents 5448f71 + 045bbee commit 70886ea

File tree

2 files changed

+234
-0
lines changed

2 files changed

+234
-0
lines changed

support/windows-server/toc.yml

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2802,6 +2802,10 @@ items:
28022802
href: ./virtualization/starting-or-live-migrating-hyper-v-vms-fails.md
28032803
- name: Troubleshoot live migration issues
28042804
href: ./virtualization/troubleshoot-live-migration-issues.md
2805+
- name: Remote administration of the Hyper-V role
2806+
items:
2807+
- name: Troubleshoot Hyper-V cluster connectivity, management, and configuration failures
2808+
href: ./virtualization/hyper-v-cluster-connectivity-management-configuration.md
28052809
- name: Shielded VMs
28062810
items:
28072811
- name: Troubleshoot a guarded fabric
Lines changed: 230 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,230 @@
1+
---
2+
title: Troubleshoot Hyper-V Cluster Connectivity, Management, and Configuration Failures
3+
description: Provides a comprehensive guide to troubleshooting various issues related to Hyper-V clusters and management environments on Windows Server 2019 and Windows Server 2022.
4+
ms.date: 09/01/2025
5+
manager: dcscontentpm
6+
audience: itpro
7+
ms.topic: troubleshooting
8+
ms.reviewer: kaushika, jeffhugh, v-lianna
9+
ms.custom:
10+
- sap:virtualization and hyper-v\remote administration of the hyper-v role
11+
- pcy:WinComm Storage High Avail
12+
---
13+
# Troubleshoot Hyper-V cluster connectivity, management, and configuration failures
14+
15+
This article provides a comprehensive guide to troubleshooting various issues related to Hyper-V clusters and management environments on Windows Server 2019, Windows Server 2022, and Windows Server 2025. These issues can manifest as failures in cluster management consoles, virtual machine (VM) migration, remote administration, authentication, storage configuration, and network connectivity. The guide identifies symptoms and root causes, offering actionable resolution steps to address the problems. Effective troubleshooting of these issues is critical to maintaining high availability, business continuity, and compliance in enterprise virtualization environments.
16+
17+
## End-user symptoms
18+
19+
- Inability to connect to Hyper-V Manager or Failover Cluster Manager consoles.
20+
- Failures in remote VM management operations (for example, start, stop, or migrate) from certain hosts or accounts.
21+
- Virtual machines fail to migrate between cluster nodes or become inaccessible after migration or upgrade.
22+
- Hyper-V Manager displays "Loading virtual machines…" or "Connecting to Virtual Machine Management service…".
23+
- Certain user accounts can't perform administrative actions while built-in administrators succeed.
24+
- Backup or cluster operations hang, fail, or report errors.
25+
- VMs report duplicate universally unique identifiers (UUIDs) detected by third-party tools.
26+
27+
## Technical/system symptoms
28+
29+
- Error messages:
30+
31+
- > Cannot connect to the virtual machine. Try to connect again. If the problem persists, contact your system administrator.
32+
- > You do not have the required permission to complete this task. Contact the administrator of the authorization policy for the computer.
33+
- > The operation on computer failed: WinRM process the request. The error code 0x80090311 occurred while using Kerberos authentication.
34+
- > SEC_E_NO_AUTHENTICATING_AUTHORITY: No authority could be contacted for authentication.
35+
- > Start-VM: Hyper-V encountered an error trying to access an object on computer [host] because the object was not found.
36+
- > Get-vm: The paging file is too small for this operation to complete.
37+
38+
- Hyper-V Manager becomes unresponsive or doesn't display any VMs.
39+
- VMConnect fails from certain remote hosts but succeeds from others.
40+
- Netlogon and Domain Name System (DNS) logs show failed domain controller discovery.
41+
- Windows Management Instrumentation (WMI) errors, including **WMIPrvSE.exe** crashes.
42+
- Virtual Machine Management Service (VMMS) hangs or enters a deadlock state.
43+
- Duplicate UUIDs detected for VMs (for example, by vendor extraction tools).
44+
- No error messages, but failures in adding storage, configuring quorum disks, or managing updates.
45+
- Network traces show blocked ports between hosts and domain controllers.
46+
- Event logs and analytic logs might be missing or not generated for affected actions.
47+
48+
The issue occurs because of one or more of the following causes:
49+
50+
## Network and authentication issues
51+
52+
- Firewall blocking: Required ports (for example, UDP/TCP 389 for LDAP, port 2179 for VMConnect) are blocked, disrupting authentication, domain controller discovery, or management traffic.
53+
- DNS/Netlogon failures: UDP pings or DNS queries to domain controllers fail, which breaks Kerberos authentication and remote management.
54+
55+
## Permission and group membership misconfiguration
56+
57+
- Insufficient privileges: Users lack membership in necessary administrative groups (for example, Hyper-V Administrators, Storage Replica Administrators, or Remote Management Users).
58+
- Group Policy restrictions: Policies block credential delegation or Windows Remote Management (WinRM) operations in workgroup or multi-domain environments.
59+
60+
## Configuration and state file corruption
61+
62+
- Corrupt VM files: Corrupted `.VMRS`, `.VMCX`, or configuration files caused by improper shutdowns or service crashes.
63+
- Deadlocks in VMMS: Service deadlocks triggered by pending network operations or driver issues.
64+
65+
## Software/code defects
66+
67+
High availability (HA) VM settings bugs: Known defects in Hyper-V HA settings, especially in Windows Server 2019 and Windows Server 2022, that affect storage management or cluster operations.
68+
69+
## Environmental and update problems
70+
71+
- Outdated operating system (OS): Missing Windows updates cause instability in Hyper-V role installation or operation.
72+
- Backup or third-party software impact: Backup operations or third-party tools cause services (for example, WMI) to crash or malfunction.
73+
74+
## Storage and hardware identifier issues
75+
76+
- Duplicate UUIDs: Duplicate VM UUIDs due to manufacturer settings or cloning.
77+
- Storage configuration gaps: Misconfigured or unrecognized shared storage and storage pools.
78+
79+
## Console/management tool failures
80+
81+
Failover Cluster GUI/console issues: Misconfigurations or duplicate case scenarios prevent the management interface from operating properly.
82+
83+
Here are the resolutions for each scenario respectively:
84+
85+
## Scenario 1: Cluster or Hyper-V Manager console fails to connect or authenticate
86+
87+
1. Verify network connectivity and firewall rules:
88+
89+
1. Ensure UDP/TCP port 389 is open between Hyper-V hosts and domain controllers.
90+
1. Open port 2179 for VMConnect/console access.
91+
1. Use network trace tools to identify blocked traffic.
92+
1. Check DNS and Netlogon settings:
93+
94+
1. Review Netlogon logs for failed domain controller discovery.
95+
1. Verify DNS settings and confirm domain controller reachability.
96+
1. Confirm group memberships and permissions:
97+
98+
1. Ensure users are members of:
99+
- Hyper-V Administrators
100+
- Remote Management Users
101+
- Storage Replica Administrators (if applicable)
102+
1. Remove affected users from the default Users group if necessary.
103+
104+
1. Update Group Policy and WinRM settings:
105+
1. Enable PowerShell remoting by using the following cmdlet:
106+
107+
```powershell
108+
Enable-PSRemoting
109+
```
110+
111+
1. Configure CredSSP for authentication by using the following cmdlet:
112+
113+
```powershell
114+
Enable-WSManCredSSP -Role serverEnable-WSManCredSSP -Role client -DelegateComputer "<Hyper-V host>"
115+
```
116+
117+
1. Adjust trusted hosts and firewall rules:
118+
119+
1. Set trusted hosts for WinRM by using the following cmdlet:
120+
121+
```powershell
122+
Set-Item wsman:localhost\client\trustedhosts <Hyper-V host IP>
123+
```
124+
125+
1. Enable relevant firewall rules:
126+
127+
- Remote Administration
128+
- Remote Desktop
129+
- Remote Volume Management
130+
- Windows Management Instrumentation (WMI)
131+
132+
## Scenario 2: Virtual machine fails to migrate, start, or is inaccessible after upgrade
133+
134+
1. Check storage and network configuration:
135+
1. Verify shared storage and storage pools are configured and accessible from all cluster nodes.
136+
1. Ensure consistent network settings across all nodes.
137+
1. Validate OS updates:
138+
139+
Run Windows Update on all Hyper-V hosts before installing or migrating the Hyper-V role.
140+
141+
1. Investigate VM state and configuration files:
142+
143+
If a VM is unresponsive:
144+
145+
1. Shut down all healthy VMs.
146+
1. Reboot the Hyper-V host.
147+
1. Delete the problematic VM from Hyper-V Manager (retain the VHDX file).
148+
1. Back up and, if needed, delete or rename the `.vmcx` and `.vmrs` files.
149+
1. Re-create the VM using the existing VHDX disk.
150+
151+
1. Repair corrupted files:
152+
1. Collect a process dump and analyze for VMMS deadlocks.
153+
1. Use tools like Process Explorer or TSS scripts (from [aka.ms/getTSS](https://aka.ms/getTSS)) to collect logs and terminate stuck processes.
154+
155+
## Scenario 3: Permission/access denied errors when managing VMs
156+
157+
1. Update user group memberships: Add the user to the appropriate administrative groups on the Hyper-V host.
158+
1. Recompile WMI classes: Run the `MOFCOMP %SYSTEMROOT%\System32\WindowsVirtualization.V2.mof` command.
159+
1. Reconfigure credential delegation in Group Policy: Allow delegation of fresh credentials with NTLM-only server authentication.
160+
161+
## Scenario 4: Duplicate VM UUIDs detected by vendor tools
162+
163+
- For VMware:
164+
165+
Edit the VM's `.vmx` file: Locate the **uuid.bios** entry and assign it a unique value.
166+
167+
- For Hyper-V:
168+
169+
1. Use a third-party tool to randomize or update the BIOS GUID.
170+
2. If not feasible, create a new VM from scratch to ensure a unique UUID.
171+
172+
## Scenario 5: Cluster or Hyper-V role installation fails
173+
174+
1. Install the latest Windows updates: Fully patch the server before attempting the Hyper-V feature installation.
175+
1. Retry role installation by using Server Manager, PowerShell, or Deployment Image Servicing and Management (DISM) tools:
176+
177+
```powershell
178+
Install-WindowsFeature -Name Hyper-V -IncludeManagementTools
179+
```
180+
181+
## Scenario 6: Backup or WMI-related cluster node failures
182+
183+
1. Check WMI service status: If **WMIPrvSE.exe** crashes, restart the service or reboot the node.
184+
1. Collect logs during the incident: Gather SDP, TSS, and cluster logs for root cause analysis.
185+
186+
## Scenario 7: Failover Cluster console/GUI not working
187+
188+
1. Check for duplicate cases: Verify the issue isn't already tracked in an open support case.
189+
1. Verify cluster configuration: Review cluster logs and configurations for inconsistencies.
190+
191+
## Scenario 8: Hardening and best practices
192+
193+
1. Run Best Practices Analyzer (BPA):
194+
195+
Use Server Manager or the following PowerShell cmdlet:
196+
197+
```powershell
198+
Invoke-BpaModel -ModelId Microsoft/Windows/Hyper-V
199+
```
200+
201+
1. Implement hardening recommendations: Apply security recommendations from Microsoft documentation and BPA output.
202+
203+
## Data collection: Logs and commands
204+
205+
- TSS script for Hyper-V diagnostics:
206+
207+
Download and run [TSS](https://aka.ms/getTSS). For example:
208+
209+
```powershell
210+
Set-ExecutionPolicy -scope Process -ExecutionPolicy RemoteSigned.\TSS.ps1 -SDP HyperV
211+
```
212+
213+
- Event log collection: Copy and zip event log folders under **%SystemRoot%\\System32\\winevt**.
214+
- WMI and VM UUID queries:
215+
216+
```powershell
217+
Get-CimInstance -ClassName Win32_ComputerSystemProduct | Select UUIDGet-WmiObject Win32_ComputerSystemProduct | Select-Object -ExpandProperty UUID
218+
```
219+
220+
- WinRM and PowerShell remoting setup:
221+
222+
```powershell
223+
Enable-PSRemotingEnable-WSManCredSSP -Role server/clientSet-Item wsman:localhost\client\trustedhosts <host IP>
224+
```
225+
226+
- VM operations via remote PowerShell:
227+
228+
```powershell
229+
$cred = Get-CredentialInvoke-Command -Credential $cred -ComputerName <Hyper-V host IP> -ScriptBlock { Start-VM <VM Name> }
230+
```

0 commit comments

Comments
 (0)