@@ -86,6 +86,21 @@ Example usage:
86
86
87
87
perf stat -a -e nvidia_nvlink_c2c0_pmu_3/event=0x0/
88
88
89
+ The NVLink-C2C has two ports that can be connected to one GPU (occupying both
90
+ ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
91
+ parameter to select the port(s) to monitor. Each bit represents the port number,
92
+ e.g. "port=0x1" corresponds to port 0 and "port=0x3" is for port 0 and 1.
93
+
94
+ Example for port filtering:
95
+
96
+ * Count event id 0x0 from the GPU connected with socket 0 on port 0::
97
+
98
+ perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x1/
99
+
100
+ * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
101
+
102
+ perf stat -a -e nvidia_nvlink_c2c0_pmu_0/event=0x0,port=0x3/
103
+
89
104
NVLink-C2C1 PMU
90
105
-------------------
91
106
@@ -116,6 +131,21 @@ Example usage:
116
131
117
132
perf stat -a -e nvidia_nvlink_c2c1_pmu_3/event=0x0/
118
133
134
+ The NVLink-C2C has two ports that can be connected to one GPU (occupying both
135
+ ports) or to two GPUs (one GPU per port). The user can use "port" bitmap
136
+ parameter to select the port(s) to monitor. Each bit represents the port number,
137
+ e.g. "port=0x1" corresponds to port 0 and "port=0x3" is for port 0 and 1.
138
+
139
+ Example for port filtering:
140
+
141
+ * Count event id 0x0 from the GPU connected with socket 0 on port 0::
142
+
143
+ perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x1/
144
+
145
+ * Count event id 0x0 from the GPUs connected with socket 0 on port 0 and port 1::
146
+
147
+ perf stat -a -e nvidia_nvlink_c2c1_pmu_0/event=0x0,port=0x3/
148
+
119
149
CNVLink PMU
120
150
---------------
121
151
0 commit comments