Skip to content

Commit 906944c

Browse files
authored
Fix XPU and CUDA tables in profiler recipe.
1 parent 06f9c4b commit 906944c

File tree

1 file changed

+16
-20
lines changed

1 file changed

+16
-20
lines changed

recipes_source/recipes/profiler_recipe.py

Lines changed: 16 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -209,8 +209,6 @@
209209
# Self CPU time total: 23.015ms
210210
# Self CUDA time total: 11.666ms
211211
#
212-
######################################################################
213-
214212

215213
######################################################################
216214
# (Note: the first use of XPU profiling may bring an extra overhead.)
@@ -220,26 +218,24 @@
220218
#
221219
# .. code-block:: sh
222220
#
223-
#------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
224-
# Name Self XPU Self XPU % XPU total XPU time avg # of Calls
225-
# ------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
226-
# model_inference 0.000us 0.00% 2.567ms 2.567ms 1
227-
# aten::conv2d 0.000us 0.00% 1.871ms 93.560us 20
228-
# aten::convolution 0.000us 0.00% 1.871ms 93.560us 20
229-
# aten::_convolution 0.000us 0.00% 1.871ms 93.560us 20
230-
# aten::convolution_overrideable 1.871ms 72.89% 1.871ms 93.560us 20
231-
# gen_conv 1.484ms 57.82% 1.484ms 74.216us 20
232-
# aten::batch_norm 0.000us 0.00% 432.640us 21.632us 20
233-
# aten::_batch_norm_impl_index 0.000us 0.00% 432.640us 21.632us 20
234-
# aten::native_batch_norm 432.640us 16.85% 432.640us 21.632us 20
235-
# conv_reorder 386.880us 15.07% 386.880us 6.448us 60
236-
# ------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
237-
# Self CPU time total: 712.486ms
238-
# Self XPU time total: 2.567ms
239-
221+
# ------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
222+
# Name Self XPU Self XPU % XPU total XPU time avg # of Calls
223+
# ------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
224+
# model_inference 0.000us 0.00% 2.567ms 2.567ms 1
225+
# aten::conv2d 0.000us 0.00% 1.871ms 93.560us 20
226+
# aten::convolution 0.000us 0.00% 1.871ms 93.560us 20
227+
# aten::_convolution 0.000us 0.00% 1.871ms 93.560us 20
228+
# aten::convolution_overrideable 1.871ms 72.89% 1.871ms 93.560us 20
229+
# gen_conv 1.484ms 57.82% 1.484ms 74.216us 20
230+
# aten::batch_norm 0.000us 0.00% 432.640us 21.632us 20
231+
# aten::_batch_norm_impl_index 0.000us 0.00% 432.640us 21.632us 20
232+
# aten::native_batch_norm 432.640us 16.85% 432.640us 21.632us 20
233+
# conv_reorder 386.880us 15.07% 386.880us 6.448us 60
234+
# ------------------------------------------------------- ------------ ------------ ------------ ------------ ------------
235+
# Self CPU time total: 712.486ms
236+
# Self XPU time total: 2.567ms
240237
#
241238

242-
243239
######################################################################
244240
# Note the occurrence of on-device kernels in the output (e.g. ``sgemm_32x32x32_NN``).
245241

0 commit comments

Comments
 (0)