You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: playbooks/roles/healthchecks/files/meshpinger_readme.md
+18-14Lines changed: 18 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,9 +2,9 @@
2
2
# OCI Meshpinger
3
3
4
4
Meshpinger is a tool for validating network layer connectivity between RDMA NICs on a
5
-
cluster network in OCI. The tool is capable of initiating ICMP ping from every RDMA NIC
5
+
cluster network in OCI. The tool initiates an ICMP ping from every RDMA NIC
6
6
port on the cluster network to every other RDMA NIC port on the same cluster network and
7
-
reporting back the success/failure status of the pings performed in the form of logs
7
+
reports back the success/failure status of the pings performed in the form of logs
8
8
9
9
Running the tool before starting workload on a cluster network should serve as a good precheck
10
10
step to gain confidence on the network reachability between RDMA NICs. Typical causes for
@@ -18,23 +18,23 @@ reachability failures that the tool can help pinpoint are,
18
18
19
19
4. Host rdma interface enumeration issues
20
20
21
-
5.ping failure between a <src,dst> pair of IPs
21
+
5.Network connectivity issues between <src,dst> pair of IPs
22
22
23
23
# Running Meshpinger
24
24
25
-
Meshpinger is installed on controller node of the hpc cluster and can be run in following ways after logging into the controller node
25
+
Meshpinger is installed on the controller host of the hpc cluster. Once user is logged into the controller host, they can trigger meshpinger using the following options,
26
26
27
-
- Run meshpinger on all nodes in the cluster, cluster is auto-detected in this case
27
+
- Run meshpinger on all hosts in the cluster. The cluster is auto-detected in this option.
28
28
```
29
29
/opt/oci-hpc/healthchecks/run_meshpinger.sh
30
30
```
31
31
32
-
- Run meshpinger on all nodes in the cluster explicitly specified by clustername
32
+
- Run meshpinger on all hosts in the cluster explicitly specified by clustername
Run meshpinger on a list of nodes specified in a file. A host can be specified by its ssh IP address or hostname but it should be SSH-able from controller node
37
+
Run meshpinger on a list of hosts specified in a file. A host can be specified by its IP address or hostname. It is expected that the host will be SSH-able from the controller host
Logfile of the current run that enumerates all <srcInterface,dstInterface> combinations that failed ping is printed like,
66
66
67
67
```
68
-
<src,dst> interfaces that failed ping is listed at end of the log file meshpinger_log_20241008220615_ocid1.tenancy.oc1..aaaaaaaabddc4obuhgvifcrh6esmw6554ityaqrvxulcksl255gbwehtcq.txt
68
+
<src,dst> interfaces that failed ping are listed at end of the log file meshpinger_log_20241008220615_ocid1.tenancy.oc1..aaaaaaaabddc4obuhgvifcrh6esmw6554ityaqrvxulcksl255gbwehtcq.txt
69
69
```
70
70
71
71
@@ -74,6 +74,10 @@ Logfile of the current run that enumerates all <srcInterface,dstInterface> combi
74
74
```
75
75
All pings succeeded!!
76
76
```
77
+
- Cluster information that includes rdma interface details gathered from the run is stored in a file cluster_info.txt in the current directory, same is printed as below,
78
+
```
79
+
clusterinfo file - cluster_info.txt
80
+
```
77
81
78
82
# Options
79
83
Other options supported are shown in the help text below.
@@ -114,9 +118,9 @@ optional arguments:
114
118
--objectstoreurl OBJECTSTOREURL
115
119
ObjectStore PAR URL where mesh pinger logs will be
116
120
uploaded
117
-
--singlesubnet Include this argument if all RDMA NICs are on a single
118
-
subnetted cluster network. If so pinger will do a full
119
-
mesh ping
121
+
--enable_inter_rail_ping
122
+
Include this argument to perform pings across the rails.
123
+
If so pinger will do a full mesh ping
120
124
--threads_per_intf THREADS_PER_INTF
121
125
parallel ping threads per local rdma interface,
122
126
default is 16
@@ -162,11 +166,11 @@ NIC model to use (e.g MT2910 for CX-7) for filtering out RDMA interfaces from fr
162
166
163
167
**--objectstoreurl**
164
168
165
-
Pre-Authenticated Request(PAR) url where meshpinger logs will be uploaded. This can be used by customers to easily share meshpinger logs to OCI operator. OCI operator can create a PAR to objectstore bucket owned by them and share it with customer to enable them to share the logs
169
+
Pre-Authenticated Request(PAR) url where meshpinger logs will be uploaded. This can be used by customers to easily share meshpinger logs with OCI during any incidents. OCI can provide a PAR to objectstore bucket and share it with customer to enable sharing of meshpinger logs.
166
170
167
-
**--singlesubnet**
171
+
**--enable_inter_rail_ping**
168
172
169
-
This option specifies all rdma interfaces on hosts in the hostlist file are part of a single subnet. In this case meshpinger will do pings to all remote IPs from all local interfaces on a given host. It is to be noted that when this option is chosen net.ipv4.neigh.default.gc_threshX [X=1-3] sysctl setting on every host may need to be bumped up to hold the necessary arp entries per local interface. Eg. For running meshpinger on a 512 host cluster with each host having 16 rdma interface, size of the arp table should be atleast 130816(511 * 16 * 16). Accordingly it is recommended to set all the 3 sysctl thresholds - net.ipv4.neigh.default.gc_threshX[X=1-3] to 130816. Be default meshpinger assumes each rdma interface on a host is on a separate subnet and performs pings between rdma interfaces that have the same pci address.
173
+
This option specifies all rdma interfaces on hosts in the hostlist file are part of a single subnet. In this case meshpinger will do pings to all remote IPs from all local interfaces on a given host. It is to be noted that when this option is chosen net.ipv4.neigh.default.gc_threshX [X=1-3] sysctl setting on every host may need to be bumped up to hold the necessary arp entries per local interface. Eg. For running meshpinger on a 512 host cluster with each host having 16 rdma interface, size of the arp table should be atleast 130816(511 * 16 * 16). Accordingly it is recommended to set all the 3 sysctl thresholds - net.ipv4.neigh.default.gc_threshX[X=1-3] to 130816. By default, meshpinger only pings along the rails.
0 commit comments