Skip to content

Commit d28fd58

Browse files
committed
Updated with SME suggestions from Jia and Vibha.
1 parent a9cb93e commit d28fd58

File tree

1 file changed

+48
-62
lines changed

1 file changed

+48
-62
lines changed

articles/databox-online/azure-stack-edge-gpu-clustering-overview.md

Lines changed: 48 additions & 62 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ author: alkohli
77
ms.service: databox
88
ms.subservice: edge
99
ms.topic: conceptual
10-
ms.date: 03/07/2024
10+
ms.date: 03/19/2024
1111
ms.author: alkohli
1212
---
1313

@@ -56,11 +56,11 @@ The infrastructure cluster on your device provides persistent storage and is sho
5656
The Kubernetes cluster allows for application orchestration whereas the infrastructure cluster provides persistent storage.
5757

5858

59-
## Supported networking topologies
59+
## Supported network topologies
6060

61-
Based on the use-case and workloads, you can select how the two Azure Stack Edge device nodes will be connected. The networking topologies available will differ depending on whether you use an Azure Stack Edge Pro GPU device or an Azure Stack Edge Pro 2 device.
61+
Based on the use-case and workloads, you can select how the two Azure Stack Edge device nodes will be connected. Network topologies will differ depending on whether you use an Azure Stack Edge Pro GPU device or an Azure Stack Edge Pro 2 device.
6262

63-
The supported network topologies for each of the device types are described here.
63+
At a high level, supported network topologies for each of the device types are described here.
6464

6565
### [Azure Stack Edge Pro GPU](#tab/1)
6666

@@ -91,65 +91,51 @@ For more information, see how to [Choose a network topology for your device node
9191

9292
On your Azure Stack Edge Pro 2 device node, the following network topologies are supported:
9393

94-
- **Option 1** - Port 1 and Port 2 in separate subnets, Port 3 and Port 4 use and external virtual switch.
94+
- **Option 1** - Port 1 and Port 2 are in different subnets. Separate virtual switches will be created. Port 3 and Port 4 connect to an external virtual switch.
9595

96-
- **Option 2** - Port 1 and Port 2 in the same subnet, Port 3 and Port 4 use an external virtual switch.
97-
98-
- **Option 3** - Port 1 and Port 2 in separate subnets, Port 3 and Port 4 are connected back-to-back, switchless.
99-
100-
- **Option 4** - Port 1 and Port 2 in the same subnet, Port 3 and Port 4 are connected back-to-back, switchless.
101-
102-
Usage considerations:
103-
104-
- Port 1 is used for initial configuration. Port 1 is then reconfigured and assigned an IP address that may or may not be in the same subnet as the Port 2.
105-
- Port 1 and Port 2 are used for clustering, storage and management traffic.
106-
- If you select the **Using external switches** option, Port 1 and Port 2 are used for storage in both teaming and non-teaming modes.
107-
- **Switchless** and **Using external switches** options are for Port 3 and Port 4.
108-
- When using the **Switchless** option, Port 3 and Port 4 are connected back-to-back directly without a switch. These ports are dedicated to storage and Azure Stack Edge cluster traffic. Port 3 and Port 4 aren't available for workload traffic.
109-
- For Private Multi-Access Edge Computing (PMEC) workload deployments:
110-
- If you run PMEC workloads, use option 1 or option 2.
111-
- Port 3 and Port 4 are used for PMEC workload deployments or for storage traffic.
112-
- For PMEC/AP5GC workload deployments, select the **Using external switches** option. In this case, Port 3 and Port 4 are reserved for PMEC/AP5GC workloads.
113-
- Port 3 and Port 4 are used for PMEC workload deployments or for storage traffic.
114-
115-
116-
- **Switchless** - Use this option when you don't have high speed switches available in the environment for storage and cluster traffic. There are further sub-options:
117-
118-
- **With Port 1 and Port 2 in separate subnets** - This is the default option. In this case, Port 1 and Port 2 have separate virtual switches and are connected to separate subnets.
119-
120-
- **With Port 1 and Port 2 in same subnets** - In this case, Port 1 and Port 2 have a teamed virtual switch and both the ports are in the same subnet.
121-
122-
In each case, Port 3 and Port 4 are connected back-to-back directly without a switch. These ports are dedicated to storage and Azure Stack Edge cluster traffic and aren't available for workload traffic.
123-
124-
- **Using external switches** - Use this option when you have high speed switches (10 GbE switches) available for use with your device nodes for storage and cluster traffic. There are further sub-options:
125-
126-
- **With Port 1 and Port 2 in separate subnets** - This is the default option. In this case, Port 1 and Port 2 have separate virtual switches and are connected to separate subnets.
127-
128-
- **With Port 1 and Port 2 in same subnets** - In this case, Port 1 and Port 2 have a teamed virtual switch and both the ports are in the same subnet.
129-
130-
In each case, Port 3 and Port 4 are reserved for PMEC workload deployments.
131-
132-
The pros and cons for each of the above supported topologies can be summarized as follows:
133-
134-
| Local web UI option | Advantages | Disadvantages |
135-
|----------------------------------------------------------------|--------------------------------------------------------------------------|-----------------------------------------------------------------------|
136-
| Switchless, Port 1 and Port 2 in separate subnet, separate virtual switches | Redundant paths for management and storage traffic. | Clients need to reconnect if Port 1 or Port 2 fails. |
137-
| | No single point of failure within the device. | |
138-
| | Lots of bandwidth for storage and cluster traffic across the nodes. | |
139-
| | Can be deployed with Port 1 and Port 2 in different subnets. | |
140-
| | | |
141-
| Switchless, Port 1 and Port 2 in the same subnet, teamed virtual switch | Redundant paths for management and storage traffic. | Teamed virtual switch is a single point of failure in the software. |
142-
| | Lots of bandwidth for storage and cluster traffic across the nodes. | |
143-
| | Higher fault tolerance. | |
144-
| | | |
145-
| Using external switch, Port 1 and Port 2 in separate subnet, separate virtual switches | Two independent virtual switches and network paths provide redundancy. | Clients need to reconnect if Port 1 or Port 2 fails. |
146-
| | No single point of failure with the device. | |
147-
| | Port 1 and Port 2 can be connected to different subnets. | |
148-
| | | |
149-
| Using external switch, Port 1 and Port 2 in same subnet, teamed virtual switch | Load balancing. | Teamed switch is a single point of failure in software. |
150-
| | Higher fault toelerance. | Can't be deployed in an environment with different subnets. |
151-
| | Two independent, redundant paths between the nodes. | |
152-
| | Clients do not need to reconnect. | |
96+
- **Option 2** - Port 1 and Port 2 are in the same subnet. A teamed virtual switch will be created. Port 3 and Port 4 connect to an external virtual switch.
97+
98+
- **Option 3** - Port 1 and Port 2 are in separate subnets. A teamed virtual switch will be created. Port 3 and Port 4 are connected back-to-back, switchless for Port 3 and Port 4.
99+
100+
- **Option 4** - Port 1 and Port 2 are in the same subnet. A teamed virtual switch will be created. Port 3 and Port 4 are connected back-to-back, switchless for Port 3 and Port 4.
101+
102+
> [!NOTE]
103+
> If you run PMEC workloads, use Option 1 or Option 2.
104+
105+
Usage considerations on your Azure Stack Edge Pro 2 device nodes:
106+
107+
- **Switchless for Port 3 and Port 4** - Use this option when you don't have high speed switches available in the environment, or you want to dedicate Port 3 and Port 4 for storage and cluster traffic.
108+
- **Port 1 and Port 2 in separate subnets** - This is the default option. In this case, Port 1 and Port 2 have separate virtual switches and are connected to separate subnets.
109+
- **Port 1 and Port 2 in the same subnet** - In this case, Port 1 and Port 2 have a teamed virtual switch and both ports are in the same subnet.
110+
- **Using external switches for Port 3 and Port 4** - Use this option when you have high speed switches (>=10 GbE bandwidth) available for use with your device nodes and you want to allow a VM network adapter to connect to the virtual network created on Port 3 or Port 4, like a PMEC use case.
111+
- Additional options for **Switchless** scenarios that use **external switches**:
112+
- **Port 1 and Port 2 in separate subnets** - This is the default option. In this case, Port 1 and Port 2 have separate virtual switches and are connected to separate subnets.
113+
- **Port 1 and Port 2 in the same subnet** - In this case, Port 1 and Port 2 have a teamed virtual switch and both ports are in the same subnet.
114+
115+
Additional considerations:
116+
117+
- Port 1 is used for initial configuration. Port 1 is then reconfigured and assigned an IP address that may or may not be in the same subnet as Port 2.
118+
- If you select the **Using external switches** option, Port 1 and Port 2 are used for storage in both teaming and non-teaming modes.
119+
- When using the **Switchless** option, Port 3 and Port 4 are connected back-to-back directly, without a switch. These ports are dedicated to storage and Azure Stack Edge cluster traffic. Port 3 and Port 4 aren't available for workload traffic.
120+
121+
Pros and cons for supported topologies are summarized as follows:
122+
123+
| Local web UI option | Advantages | Disadvantages |
124+
|---------------------|------------|---------------|
125+
| Port 3 and Port 4 Switchless, Port 1 and Port 2 in separate subnet, separate virtual switches. | Redundant paths for management and storage traffic. | Clients must reconnect if Port 1 or Port 2 fails. |
126+
| | No single point of failure within the device. | VM workload can't leverage Port 3 or Port 4 to connect to network endpoints other than a peer Azure Stack Edge node. This is why PMEC workloads can't use this option. |
127+
| | Lots of bandwidth for storage and cluster traffic across nodes. | |
128+
| | Can be deployed with Port 1 and Port 2 in different subnets. | |
129+
| Port 3 and Port 4 are Switchless, Port 1 and Port 2 are in the same subnet, teamed virtual switch. | Redundant paths for management and storage traffic. | VM workload can't leverage Port 3 or Port 4 to connect to network endpoints other than a peer Azure Stack Edge node. This is why PMEC workloads can't use this option. |
130+
| | Lots of bandwidth for storage and cluster traffic across nodes. | |
131+
| | Higher fault tolerance. | |
132+
| Port 3 and Port 4 use an external switch with >=10Gbps link bandwidth, Port 1 and Port 2 in separate subnets, separate virtual switches | Two independent virtual switches and network paths provide redundancy. | Clients must reconnect if Port 1 or Port 2 fails. |
133+
| | No single point of failure with the device. | |
134+
| | Port 1 and Port 2 can be connected to different subnets. | |
135+
| Port 3 and Port 4 use an external switch with >=10Gbps link bandwidth, Port 1 and Port 2 in the same subnet, teamed virtual switch. | Load balancing. | |
136+
| | Higher fault tolerance. | Can't be deployed in an environment with different subnets. |
137+
| | Two independent, redundant paths between nodes. | |
138+
| | Clients do not need to reconnect. | |
153139

154140
---
155141

0 commit comments

Comments
 (0)