You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To best use profiling data, it's important to understand the role of various
17
-
hardware blocks of AMD Instinct accelerators. This section describes each
18
-
hardware block on the accelerator as interacted with by a software developer to
130
+
hardware blocks of AMD Instinct accelerators. Refer to the following top level GPU architecture diagram to understand the hardware blocks of each architectures.
131
+
132
+
.. tab-set::
133
+
134
+
.. tab-item:: CDNA
135
+
136
+
.. image:: ../data/conceptual/CDNA.png
137
+
:alt:CDNA top level architecture diagram with zoomed view of Compute unit
138
+
139
+
.. tab-item:: CDNA2
140
+
141
+
.. image:: ../data/conceptual/CDNA2.png
142
+
:alt:CDNA2 top level architecture diagram with zoomed view of Compute unit
143
+
144
+
.. tab-item:: CDNA3
145
+
146
+
.. image:: ../data/conceptual/CDNA3.png
147
+
:alt:CDNA3 top level architecture diagram with zoomed view of Accelerator Complex Dies (XCDs)
148
+
149
+
.. tab-item:: CDNA4
150
+
151
+
.. image:: ../data/conceptual/CDNA4.png
152
+
:alt:CDNA4 top level architecture diagram
153
+
154
+
This section describes each hardware block on the accelerator as interacted with by a software developer to
19
155
give a deeper understanding of the metrics reported by profiling data. Refer to
20
156
:doc:`/tutorial/profiling-by-example` for more practical examples and details on how
21
157
to use ROCm Compute Profiler to optimize your code.
@@ -24,15 +160,12 @@ to use ROCm Compute Profiler to optimize your code.
24
160
25
161
.. note::
26
162
27
-
In this chapter, **MI2XX** refers to any of the CDNA2 architecture-based AMD
163
+
In this documentation, **MI2XX** refers to any of the CDNA2 architecture-based MI200 series accelerators such as AMD
28
164
Instinct MI250X, MI250, and MI210 accelerators interchangeably in cases
29
-
where the exact product at hand is not relevant.
165
+
where the exact product at hand is not relevant. For product details, see `AMD Instinct GPUs <https://www.amd.com/en/products/accelerators/instinct.html>`_.
30
166
31
167
For a comparison of AMD Instinct accelerator specifications, refer to
32
-
:doc:`Hardware specifications <rocm:reference/gpu-arch-specs>`. For product
33
-
details, see the :prod-page:`MI250X <mi200/mi250x>`,
34
-
:prod-page:`MI250 <mi200/mi250>`, and :prod-page:`MI210 <mi200/mi210>`
0 commit comments