You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
fix inaccurate information of topology-manager admin guide
Topology Manager deal with Guaranteed QoS Pod.
=>Topology Manager deal with all Pod QoS Classes since 1.17
Topology Manager works on Nodes with the Static CPU Policy enabled
=> changing wording to make this clearer and adding examples
Signed-off-by: Byonggon Chun <[email protected]>
An increasing number of systems leverage a combination of CPUs and hardware accelerators to support latency-critical execution and high-throughput parallel computation. These include workloads in fields such as telecommunications, scientific computing, machine learning, financial services and data analytics. Such hybrid systems comprise a high performance environment.
19
20
@@ -53,11 +54,16 @@ Support for the Topology Manager requires `TopologyManager` [feature gate](/docs
53
54
54
55
The Topology Manager currently:
55
56
56
-
-Works on Nodes with the `static` CPU Manager Policy enabled. See [control CPU Management Policies](/docs/tasks/administer-cluster/cpu-management-policies/)
57
-
-Works on Pods making CPU requests or Device requests via extended resources
57
+
-Aligns Pods of all QoS classes.
58
+
-Aligns the requested resources that Hint Provider provides topology hints for.
58
59
59
60
If these conditions are met, Topology Manager will align the requested resources.
60
61
62
+
{{< note >}}
63
+
To align CPU resources with other requested resources in a Pod Spec, the CPU Manager should be enabled and proper CPU Manager policy should be configured on a Node. See [control CPU Management Policies](/docs/tasks/administer-cluster/cpu-management-policies/).
64
+
{{< /note >}}
65
+
66
+
61
67
Topology Manager supports four allocation policies. You can set a policy via a Kubelet flag, `--topology-manager-policy`.
62
68
There are four supported policies:
63
69
@@ -72,7 +78,7 @@ This is the default policy and does not perform any topology alignment.
72
78
73
79
### best-effort policy {#policy-best-effort}
74
80
75
-
For each container in a Guaranteed Pod, kubelet, with `best-effort` topology
81
+
For each container in a Pod, the kubelet, with `best-effort` topology
76
82
management policy, calls each Hint Provider to discover their resource availability.
77
83
Using this information, the Topology Manager stores the
78
84
preferred NUMA Node affinity for that container. If the affinity is not preferred,
@@ -83,7 +89,7 @@ resource allocation decision.
83
89
84
90
### restricted policy {#policy-restricted}
85
91
86
-
For each container in a Guaranteed Pod, kubelet, with `restricted` topology
92
+
For each container in a Pod, the kubelet, with `restricted` topology
87
93
management policy, calls each Hint Provider to discover their resource availability.
88
94
Using this information, the Topology Manager stores the
89
95
preferred NUMA Node affinity for that container. If the affinity is not preferred,
For each container in a Guaranteed Pod, kubelet, with `single-numa-node` topology
106
+
For each container in a Pod, the kubelet, with `single-numa-node` topology
101
107
management policy, calls each Hint Provider to discover their resource availability.
102
108
Using this information, the Topology Manager determines if a single NUMA Node affinity is possible.
103
109
If it is, Topology Manager will store this and the *Hint Providers* can then use this information when making the
@@ -135,8 +141,7 @@ spec:
135
141
136
142
This pod runs in the `Burstable` QoS class because requests are less than limits.
137
143
138
-
If the selected policy is anything other than `none` , Topology Manager would not consider either of these Pod
139
-
specifications.
144
+
If the selected policy is anything other than `none`, Topology Manager would consider these Pod specifications. The Topology Manager would consult the Hint Providers to get topology hints. In the case of the `static`, the CPU Manager policy would return default topology hint, because these Pods do not have explicity request CPU resources.
140
145
141
146
142
147
```yaml
@@ -155,7 +160,26 @@ spec:
155
160
example.com/device: "1"
156
161
```
157
162
158
-
This pod runs in the `Guaranteed` QoS class because `requests` are equal to `limits`.
163
+
This pod with integer CPU request runs in the `Guaranteed` QoS class because `requests` are equal to `limits`.
164
+
165
+
166
+
```yaml
167
+
spec:
168
+
containers:
169
+
- name: nginx
170
+
image: nginx
171
+
resources:
172
+
limits:
173
+
memory: "200Mi"
174
+
cpu: "300m"
175
+
example.com/device: "1"
176
+
requests:
177
+
memory: "200Mi"
178
+
cpu: "300m"
179
+
example.com/device: "1"
180
+
```
181
+
182
+
This pod with sharing CPU request runs in the `Guaranteed` QoS class because `requests` are equal to `limits`.
159
183
160
184
161
185
```yaml
@@ -173,10 +197,15 @@ spec:
173
197
```
174
198
This pod runs in the `BestEffort` QoS class because there are no CPU and memory requests.
175
199
176
-
The Topology Manager would consider both of the above pods. The Topology Manager would consult the Hint Providers, which are CPU and Device Manager to get topology hints for the pods.
177
-
In the case of the `Guaranteed` pod the `static` CPU Manager policy would return hints relating to the CPU request and the Device Manager would send back hints for the requested device.
200
+
The Topology Manager would consider the above pods. The Topology Manager would consult the Hint Providers, which are CPU and Device Manager to get topology hints for the pods.
201
+
202
+
In the case of the `Guaranteed` pod with integer CPU request, the `static` CPU Manager policy would return topology hints relating to the exclusive CPU and the Device Manager would send back hints for the requested device.
203
+
204
+
In the case of the `Guaranteed` pod with sharing CPU request, the `static` CPU Manager policy would return default topology hint as there is no exclusive CPU request and the Device Manager would send back hints for the requested device.
205
+
206
+
In the above two cases of the `Guaranteed` pod, the `none` CPU Manager policy would return default topology hint.
178
207
179
-
In the case of the `BestEffort` pod the CPU Manager would send back the default hint as there is no CPU request and the Device Manager would send back the hints for each of the requested devices.
208
+
In the case of the `BestEffort` pod, the `static` CPU Manager policy would send back the default topology hint as there is no CPU request and the Device Manager would send back the hints for each of the requested devices.
180
209
181
210
Using this information the Topology Manager calculates the optimal hint for the pod and stores this information, which will be used by the Hint Providers when they are making their resource assignments.
0 commit comments