You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -399,15 +399,16 @@ If an upgrade is stuck in an `Upgrading System Service` state for an extended pe
399
399
400
400
---
401
401
402
-
### 8. The `registry.suse.com/harvester-beta/vmdp:latest` image is not available in airgapped environment
402
+
### 8. The `registry.suse.com/harvester-beta/vmdp:latest` image is not available in air-gapped environment
403
403
404
404
Harvester does not package the `registry.suse.com/harvester-beta/vmdp:latest` image in the ISO file as of v1.1.0. For Windows VMs before v1.1.0, they used this image as a container disk. However, kubelet may remove old images to free up bytes. Windows VMs can't access an air-gapped environment when this image is removed. You can fix this issue by changing the image to `registry.suse.com/suse/vmdp/vmdp:2.5.4.2` and restarting the Windows VMs.
405
405
406
406
- Related issue:
407
407
- [[BUG] VMDP Image wrong after upgrade to Harvester 1.2.0](https://github.com/harvester/harvester/issues/4534)
408
408
409
409
---
410
-
### 9. Upgrade stuck in the Post-draining state
410
+
411
+
### 9. An Upgrade is stuck in the Post-draining state
411
412
412
413
The node might be stuck in the OS upgrade process if you encounter the **Post-draining** state, as shown below.
413
414
@@ -483,3 +484,111 @@ After performing the steps above, you should pass post-draining with the next re
483
484
- [A potential bug in NewElementalPartitionsFromList which caused upgrade error code 33](https://github.com/rancher/elemental-toolkit/issues/1827)
4. The pod `fleet-agent-*` has following error log:
530
+
531
+
```
532
+
fleet-agent pod log:
533
+
534
+
time="2023-09-19T12:18:10Z" level=error msg="Failed to register agent: looking up secret cattle-fleet-local-system/fleet-agent-bootstrap: Post \"https://192.168.122.199/apis/fleet.cattle.io/ v1alpha1/namespaces/fleet-local/clusterregistrations\": tls: failed to verify certificate: x509: cannot validate certificate for 192.168.122.199 because it doesn't contain any IP SANs"
535
+
```
536
+
537
+
5. Check the `ssl-certificates` settings in Harvester:
538
+
539
+
From the command line:
540
+
541
+
```
542
+
# kubectl get settings.harvesterhci.io ssl-certificates
6. Check the `server-url` setting, it is the value of VIP:
552
+
553
+
```
554
+
# kubectl get settings.management.cattle.io -n cattle-system server-url
555
+
NAME VALUE
556
+
server-url https://192.168.122.199
557
+
```
558
+
559
+
7. The root cause:
560
+
561
+
User sets the self-signed `ssl-certificates` with FQDN in the Harvester settings, but the `server-url` points to the VIP, the `fleet-agent` pod fails to register.
562
+
563
+
```
564
+
For example: create self-signed certificate for (*).example.com
# kubectl get settings.management.cattle.io -n cattle-system server-url
583
+
NAME VALUE
584
+
server-url https://harv31.example.com
585
+
```
586
+
587
+
After the workaround is applied, the `fleet-agent` pod is replaced by Rancher automatically and registers successfully, the upgrade continues.
588
+
589
+
- Related issue:
590
+
- [[BUG] Upgrade to Harvester 1.2.0 fails in fleet-agent due to customer provided SSL certificate without IP SAN](https://github.com/harvester/harvester/issues/4519)
@@ -330,7 +330,7 @@ If you notice the upgrade is stuck in the **Upgrading System Service** state for
330
330
331
331
---
332
332
333
-
### 7. Upgrade stuck in the `Upgrading System Service` state
333
+
### 7. An upgrade is stuck in the `Upgrading System Service` state
334
334
335
335
If an upgrade is stuck in an `Upgrading System Service` state for an extended period, some system services' certificates may have expired. To investigate and resolve this issue, follow these steps:
336
336
@@ -399,15 +399,16 @@ If an upgrade is stuck in an `Upgrading System Service` state for an extended pe
399
399
400
400
---
401
401
402
-
### 8. The `registry.suse.com/harvester-beta/vmdp:latest` image is not available in airgapped environment
402
+
### 8. The `registry.suse.com/harvester-beta/vmdp:latest` image is not available in air-gapped environment
403
403
404
404
Harvester does not package the `registry.suse.com/harvester-beta/vmdp:latest` image in the ISO file as of v1.1.0. For Windows VMs before v1.1.0, they used this image as a container disk. However, kubelet may remove old images to free up bytes. Windows VMs can't access an air-gapped environment when this image is removed. You can fix this issue by changing the image to `registry.suse.com/suse/vmdp/vmdp:2.5.4.2` and restarting the Windows VMs.
405
405
406
406
- Related issue:
407
407
- [[BUG] VMDP Image wrong after upgrade to Harvester 1.2.0](https://github.com/harvester/harvester/issues/4534)
408
408
409
409
---
410
-
### 9. Upgrade stuck in the Post-draining state
410
+
411
+
### 9. An Upgrade is stuck in the Post-draining state
411
412
412
413
The node might be stuck in the OS upgrade process if you encounter the **Post-draining** state, as shown below.
413
414
@@ -484,3 +485,110 @@ After performing the steps above, you should pass post-draining with the next re
4. The pod `fleet-agent-*` has following error log:
530
+
531
+
```
532
+
fleet-agent pod log:
533
+
534
+
time="2023-09-19T12:18:10Z" level=error msg="Failed to register agent: looking up secret cattle-fleet-local-system/fleet-agent-bootstrap: Post \"https://192.168.122.199/apis/fleet.cattle.io/ v1alpha1/namespaces/fleet-local/clusterregistrations\": tls: failed to verify certificate: x509: cannot validate certificate for 192.168.122.199 because it doesn't contain any IP SANs"
535
+
```
536
+
537
+
5. Check the `ssl-certificates` settings in Harvester:
538
+
539
+
From the command line:
540
+
541
+
```
542
+
# kubectl get settings.harvesterhci.io ssl-certificates
6. Check the `server-url` setting, it is the value of VIP:
552
+
553
+
```
554
+
# kubectl get settings.management.cattle.io -n cattle-system server-url
555
+
NAME VALUE
556
+
server-url https://192.168.122.199
557
+
```
558
+
559
+
7. The root cause:
560
+
561
+
User sets the self-signed `ssl-certificates` with FQDN in the Harvester settings, but the `server-url` points to the VIP, the `fleet-agent` pod fails to register.
562
+
563
+
```
564
+
For example: create self-signed certificate for (*).example.com
# kubectl get settings.management.cattle.io -n cattle-system server-url
583
+
NAME VALUE
584
+
server-url https://harv31.example.com
585
+
```
586
+
587
+
After the workaround is applied, the `fleet-agent` pod is replaced by Rancher automatically and registers successfully, the upgrade continues.
588
+
589
+
- Related issue:
590
+
- [[BUG] Upgrade to Harvester 1.2.0 fails in fleet-agent due to customer provided SSL certificate without IP SAN](https://github.com/harvester/harvester/issues/4519)
0 commit comments