Skip to content

Commit c16c37e

Browse files
committed
OCPBUGS-9096: when trying this on a customer cluster, we also needed to define api-int.. for the workers to be able to resolve.
1 parent 83d1499 commit c16c37e

File tree

2 files changed

+49
-0
lines changed

2 files changed

+49
-0
lines changed

installing/installing_bare_metal_ipi/ipi-install-troubleshooting.adoc

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,9 +36,11 @@ include::modules/ipi-install-troubleshooting-bootstrap-vm-inspecting-logs.adoc[l
3636
include::modules/ipi-install-troubleshooting-cluster-nodes-will-not-pxe.adoc[leveloffset=+1]
3737
include::modules/ipi-install-troubleshooting_unable-to-discover-new-bare-metal-hosts-using-the-bmc.adoc[leveloffset=+1]
3838
include::modules/ipi-install-troubleshooting-api-not-accessible.adoc[leveloffset=+1]
39+
include::modules/ipi-install-troubleshooting_proc_worker-nodes-cannot-join-the-cluster.adoc[leveloffset=+1]
3940
include::modules/ipi-install-troubleshooting-cleaning-up-previous-installations.adoc[leveloffset=+1]
4041
include::modules/ipi-install-troubleshooting-registry-issues.adoc[leveloffset=+1]
4142
include::modules/ipi-install-troubleshooting-misc-issues.adoc[leveloffset=+1]
4243
include::modules/ipi-install-troubleshooting-failed-ignition-during-firstboot.adoc[leveloffset=+2]
4344
include::modules/ipi-install-troubleshooting-ntp-out-of-sync.adoc[leveloffset=+2]
4445
include::modules/ipi-install-troubleshooting-reviewing-the-installation.adoc[leveloffset=+1]
46+
Lines changed: 47 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,47 @@
1+
// This module is included in the following assemblies:
2+
//
3+
// installing/installing_bare_metal_ipi/ipi-install-troubleshooting.adoc
4+
5+
:_content-type: PROCEDURE
6+
[id="worker-nodes-cannot-join-the-cluster_{context}"]
7+
= Troubleshooting worker nodes that cannot join the cluster
8+
9+
Installer-provisioned clusters deploy with a DNS server that includes a DNS entry for the `api-int.<cluster_name>.<base_domain>` URL. If the nodes within the cluster use an external or upstream DNS server to resolve the `api-int.<cluster_name>.<base_domain>` URL and there is no such entry, worker nodes might fail to join the cluster. Ensure that all nodes in the cluster can resolve the domain name.
10+
11+
.Procedure
12+
13+
. Add a DNS A/AAAA or CNAME record to internally identify the API load balancer. For example, when using dnsmasq, modify the `dnsmasq.conf` configuration file:
14+
+
15+
[source,terminal,options="nowrap",role="white-space-pre"]
16+
----
17+
$ sudo nano /etc/dnsmasq.conf
18+
----
19+
+
20+
[source,terminal,options="nowrap",role="white-space-pre"]
21+
----
22+
address=/api-int.<cluster_name>.<base_domain>/<IP_address>
23+
address=/api-int.mycluster.example.com/192.168.1.10
24+
address=/api-int.mycluster.example.com/2001:0db8:85a3:0000:0000:8a2e:0370:7334
25+
----
26+
27+
. Add a DNS PTR record to internally identify the API load balancer. For example, when using dnsmasq, modify the `dnsmasq.conf` configuration file:
28+
+
29+
[source,terminal,options="nowrap",role="white-space-pre"]
30+
----
31+
$ sudo nano /etc/dnsmasq.conf
32+
----
33+
+
34+
[source,terminal,options="nowrap",role="white-space-pre"]
35+
----
36+
ptr-record=<IP_address>.in-addr.arpa,api-int.<cluster_name>.<base_domain>
37+
ptr-record=10.1.168.192.in-addr.arpa,api-int.mycluster.example.com
38+
----
39+
40+
. Restart the DNS server. For example, when using dnsmasq, execute the following command:
41+
+
42+
[source,terminal,subs="+quotes",options="nowrap",role="white-space-pre"]
43+
----
44+
$ sudo systemctl restart dnsmasq
45+
----
46+
47+
These records must be resolvable from all the nodes within the cluster.

0 commit comments

Comments
 (0)