You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/how-to/profile/mode.rst
+4-4Lines changed: 4 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -197,7 +197,7 @@ an Instinct MI210 vs an Instinct MI250.
197
197
``sysinfo.csv``, is created to reflect the target device settings. All
198
198
profiling output is stored in ``log.txt``. Roofline-specific benchmark
199
199
results are stored in ``roofline.csv`` and roofline plots are outputted into PDFs as
200
-
``empirRoof_gpu-0_[datatype1]_..._[datatypeN].pdf`` where datatypes requested through
200
+
``empirRoof_gpu-0_[datatype1]_..._[datatypeN].pdf`` where data types requested through
201
201
``--roofline-data-type`` option are listed in the file name.
202
202
203
203
.. code-block:: shell-session
@@ -477,11 +477,11 @@ Roofline options
477
477
running a roofline benchmark on your system.
478
478
479
479
``--roofline-data-type <datatype>``
480
-
Allows you to specify datatypes that you want plotted in the roofline PDF output(s). Selecting more than one datatype will overlay the results onto the same plot. Default: FP32
480
+
Allows you to specify data types that you want plotted in the roofline PDF output(s). Selecting more than one data type will overlay the results onto the same plot. Default: FP32
481
481
482
482
.. note::
483
483
484
-
For more information on datatypes supported based on the GPU architecture, see :doc:`../../conceptual/performance-model`
484
+
For more information on data types supported based on the GPU architecture, see :doc:`../../conceptual/performance-model`
485
485
486
486
To distinguish different kernels in your ``.pdf`` roofline plot use
487
487
``--kernel-names``. This will give each kernel a unique marker identifiable from
@@ -525,7 +525,7 @@ successfully.
525
525
526
526
.. note::
527
527
528
-
* ROCm Compute Profiler currently captures roofline profiling for all data types, but has the ability to reduce clutter in the PDF outputs by selecting datatype(s). Selecting multiple datatypes will overlay the results into the same PDF. If the user would like separate PDFs for each datatype off of the same workload run, the user can run the profiling command again with the single datatype as long as the roofline.csv still exists in the workload folder.
528
+
* ROCm Compute Profiler currently captures roofline profiling for all data types, and you can reduce the clutter in the PDF outputs by filtering the data type(s). Selecting multiple data types will overlay the results into the same PDF. To generate results in separate PDFs for each data type from the same workload run, you can re-run the profiling command with each data type as long as the ``roofline.csv`` file still exists in the workload folder.
529
529
* Roofline feature is currently not enabled on AMD Instinct MI350.
530
530
531
531
The following image is a sample ``empirRoof_gpu-0_FP32.pdf`` roofline
0 commit comments