You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/operator-nexus/howto-replace-network-devices.md
+76-29Lines changed: 76 additions & 29 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,9 +9,12 @@ ms.date: 08/12/2024
9
9
ms.custom: template-how-to, devx-track-azurecli
10
10
---
11
11
12
-
# Replace a device in Azure Operator Nexus Network Fabric (NNF)
12
+
# Replace a network device in Azure Operator Nexus Network Fabric (NNF)
13
13
14
-
This article describes how to replace a faulty or underperforming device in Azure Operator Nexus Network Fabric (NNF) using the RMA (Return Material Authorization) process which ensures minimal disruption and safe reintegration of the replacement hardware into the fabric.
14
+
This article explains how to replace a faulty or underperforming network device in Azure Operator Nexus Network Fabric (NNF).
15
+
It covers devices such as the Top of Rack (TOR) switch, Customer Edge (CE) switch, Network Packet Broker (NPB), and the Management Switch.
16
+
The replacement is performed using the Return Material Authorization (RMA) process.
17
+
This process is designed to minimize service disruption and safely reintegrate the new hardware into the fabric.
15
18
16
19
## Scenarios for device replacement
17
20
@@ -25,15 +28,20 @@ Device replacement may be required in the following situations:
25
28
26
29
## Prerequisites
27
30
28
-
- Azure CLI installed and configured.
31
+
To ensure a smooth and timely RMA process, verify the following prerequisites before initiating deployment:
29
32
30
-
- Required permissions to manage Microsoft.ManagedNetworkFabric resources.
33
+
- Azure CLI is installed and properly configured
31
34
32
-
- Replacement device powered on and connected physically.
35
+
- Permissions are granted to manage Microsoft.ManagedNetworkFabric resources
33
36
34
-
- Replacement device must support Zero Touch Provisioning (ZTP).
37
+
- Replacement device is powered on and physically connected
35
38
36
-
- To ensure a smooth and timely RMA process, please verify the following before initiating deployment:
39
+
- Replacement device supports Zero Touch Provisioning (ZTP)
40
+
41
+
- To prevent failure during the device disable action if the device is affected by continuous reboots due to hardware issues, it is advised to power off the device prior to initiating the RMA process.
42
+
43
+
- Before initiating the RMA deployment, perform the following checks:
44
+
37
45
38
46
- Interface Speed Validation
39
47
@@ -42,14 +50,20 @@ Device replacement may be required in the following situations:
42
50
- If the speed is below 100 Mbps, update it accordingly to prevent delays or potential timeouts during the RMA process.
43
51
44
52
- Device Storage Check
45
-
- Ensure the device has a minimum of 2 GB of free space available.
53
+
- Ensure the device has a minimum of 3 GB of free space available.
46
54
47
-
- This is required to successfully download and stage the necessary image files.
55
+
- This action is required to successfully download and stage the necessary image files.
48
56
57
+
## Device types supported
58
+
59
+
- Customer Edge (CE)
60
+
- Top of Rack (TOR)
61
+
- Management Switch (Mgmt Switch)
62
+
- Network Packet Broker (NPB)
49
63
50
64
## Steps to replace a device
51
65
52
-
1. Disable administrative state.
66
+
### Step 1: Disable administrative state
53
67
54
68
Use the following command to disable the administrative state of the device:
55
69
@@ -60,19 +74,25 @@ az networkfabric device update-admin-state \
60
74
--resource-group "resource-group-name"
61
75
```
62
76
63
-
This action:
77
+
This action sets the following states:
64
78
65
-
-Moves the device to a degraded state: EnabledDegraded.
79
+
-Device Administrative State: Disabled
66
80
67
-
-Excludes the device from all control plane actions such as:
81
+
-Fabric Administrative State: EnabledDegraded
68
82
69
-
- Certificate rotations
70
-
71
-
- Password rotations
72
-
73
-
- Fabric upgrades
83
+
>[!Note]
84
+
> This action is not permitted by the service, if any of the following operations are in progress at the fabric level:
85
+
> - Device upgrade
86
+
> - Configuration push
87
+
> - Secret or certificate updates
88
+
> - Administrative lock
89
+
> - Terminal Server (TS) reprovisioning.
90
+
91
+
### Step 2: Update the serial number
74
92
75
-
2. Update the serial number.
93
+
Execution conditions:
94
+
- Device Administrative State must be `Disabled`
95
+
- Fabric Administrative State must be `EnabledDegraded`
76
96
77
97
Once the replacement device is physically installed, update its serial number in the fabric resource:
78
98
@@ -83,14 +103,26 @@ az networkfabric device update \
83
103
--resource-group "resource-group-name"
84
104
```
85
105
86
-
3. Ensure device is in ZTP Mode.
106
+
Error recovery guidance:
107
+
108
+
- If RMA fails due to an incorrect serial number, repatching is allowed without a support ticket.
109
+
110
+
- If validation fails after device bootstrap, the system returns the status: Device Unable to Boot Up - Failed.
111
+
112
+
This action performs the following tasks:
113
+
114
+
- Update serial number stored in Azure ARM resource
115
+
116
+
- Keeps the device in `Disabled` state and Fabric Administrative State in `EnabledDegraded`
117
+
118
+
### Step 3: Ensure device is in ZTP Mode
87
119
88
120
Verify that the replacement device is in ZTP mode. If not, configure the device for ZTP before continuing.
89
121
90
122
> [!Note]
91
123
> ZTP enables automatic configuration retrieval during the RMA process.
92
124
93
-
4. Set RMA State.
125
+
### Step 4: Initiate RMA process
94
126
95
127
Initiate the RMA process using the following command:
96
128
@@ -101,23 +133,33 @@ az networkfabric device update-admin-state \
101
133
--resource-group "resource-group-name"
102
134
```
103
135
104
-
This will:
136
+
- Network Fabric Controller pushes all required configuration files to the new replaced device. It is advised to retry the operation if there's transient failures until success is confirmed.
137
+
138
+
- The device boots into its base configuration using the maintenance profile. This condition applies only to TOR and CE device types.
139
+
140
+
This action sets the following states:
105
141
106
-
-Trigger the Network Fabric Controller to push all required configuration files to the replacement device.
142
+
-Device Administrative State: UnderMaintenance
107
143
108
-
-Retry the operation if there is transient failures until success is confirmed.
144
+
-Fabric Administrative State: EnabledDegraded
109
145
110
-
5. Refresh configuration
146
+
### Step 5: Refresh configuration
111
147
112
148
This step pushes the latest configuration to the device after it enters maintenance mode (applicable only for CE and TOR).
113
149
114
150
```Azure CLI
115
151
az networkfabric device refresh-configuration --resource-name <resource-name> --resource-group <rg-name>
116
152
```
117
153
118
-
This will push the latest config to the device.
154
+
This action pushes the latest configuration to the device.
119
155
120
-
6. Enable administrative state.
156
+
This action keeps the device in following states:
157
+
158
+
- Device Administrative State: UnderMaintenance
159
+
160
+
- Fabric Administrative State: EnabledDegraded
161
+
162
+
### Step 6: Enable administrative state.
121
163
122
164
Once configuration is applied successfully, bring the device back into active service:
123
165
@@ -128,9 +170,14 @@ az networkfabric device update-admin-state \
128
170
--resource-group "resource-group-name"
129
171
```
130
172
131
-
This will:
173
+
This action sets the following state once it's fully healthy and synchronized with the fabric:
174
+
175
+
- Device Administrative State: `Enabled`
176
+
177
+
- Fabric Administrative State: `Enabled`
132
178
133
-
- Sets device state to Enabled once it's fully healthy and synchronized with the fabric.
179
+
>[!Note]
180
+
> In a given fabric if there are any other device is in Disabled state then the Fabric Administrative State will maintained as : `EnabledDegraded`
0 commit comments