@@ -119,6 +119,22 @@ REPORT OPTIONS
119
119
And the default sort keys are changed to local_weight, mem, sym, dso,
120
120
symbol_daddr, dso_daddr, snoop, tlb, locked, blocked, local_ins_lat.
121
121
122
+ -F::
123
+ --fields=::
124
+ Specify output field - multiple keys can be specified in CSV format.
125
+ Please see linkperf:perf-report[1] for details.
126
+
127
+ In addition to the default fields, 'perf mem report' will provide the
128
+ following fields to break down sample periods.
129
+
130
+ - op: operation in the sample instruction (load, store, prefetch, ...)
131
+ - cache: location in CPU cache (L1, L2, ...) where the sample hit
132
+ - mem: location in memory or other places the sample hit
133
+ - dtlb: location in Data TLB (L1, L2) where the sample hit
134
+ - snoop: snoop result for the sampled data access
135
+
136
+ Please take a look at the OUTPUT FIELD SELECTION section for caveats.
137
+
122
138
-T::
123
139
--type-profile::
124
140
Show data-type profile result instead of code symbols. This requires
@@ -156,6 +172,40 @@ but one sample with weight 180 and the other with weight 20:
156
172
90% [k] memcpy
157
173
10% [.] strcmp
158
174
175
+ OUTPUT FIELD SELECTION
176
+ ----------------------
177
+ "perf mem report" adds a number of new output fields specific to data source
178
+ information in the sample. Some of them have the same name with the existing
179
+ sort keys ("mem" and "snoop"). So unlike other fields and sort keys, they'll
180
+ behave differently when it's used by -F/--fields or -s/--sort.
181
+
182
+ Using those two as output fields will aggregate samples altogether and show
183
+ breakdown.
184
+
185
+ $ perf mem report -F mem,snoop
186
+ ...
187
+ # ------ Memory ------- --- Snoop ----
188
+ # RAM Uncach Other HitM Other
189
+ # ..................... ..............
190
+ #
191
+ 3.5% 0.0% 96.5% 25.1% 74.9%
192
+
193
+ But using the same name for sort keys will aggregate samples for each type
194
+ separately.
195
+
196
+ $ perf mem report -s mem,snoop
197
+ # Overhead Samples Memory access Snoop
198
+ # ........ ............ ....................................... ............
199
+ #
200
+ 47.99% 1509 L2 hit N/A
201
+ 25.08% 338 core, same node Any cache hit HitM
202
+ 10.24% 54374 N/A N/A
203
+ 6.77% 35938 L1 hit N/A
204
+ 6.39% 101 core, same node Any cache hit N/A
205
+ 3.50% 69 RAM hit N/A
206
+ 0.03% 158 LFB/MAB hit N/A
207
+ 0.00% 2 Uncached hit N/A
208
+
159
209
SEE ALSO
160
210
--------
161
211
linkperf:perf-record[1], linkperf:perf-report[1], linkperf:perf-arm-spe[1]
0 commit comments