You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- We have :math:`13.2304\cdot10^10` instructions per second.
20
21
That are :math:`13.2304\cdot10^10 / 8 = 16.538\cdot10^9` instructions per ALU per second.
21
-
This aligns with a **throughput of :math:`\approx 4` instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
22
+
This aligns with a **throughput of** :math:`\approx4` **instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
23
+
22
24
25
+
**FMLA (vector) with arrangement specifier ``2S``**
- We have :math:`6.65221\cdot10^10` instructions per second.
29
31
That are :math:`6.65221\cdot10^10 / 8 = 8.31526\cdot10^9` instructions per ALU per second.
30
-
This aligns with a **throughput of :math:`\approx 2` instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
32
+
This aligns with a **throughput of** :math:`\approx2` **instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
- We have :math:`1.12728\cdot10^10` instructions per second.
38
41
That are :math:`1.12728\cdot10^10 / 8 = 1.4091\cdot10^9` instructions per ALU per second.
39
-
This aligns with a **throughput of :math:`\approx 1/3` instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
42
+
This aligns with a **throughput of**:math:`\approx1/3` **instruction per cycle**, as it is known from benchmarks that the performance cores of the M4 chip have a clock speed of 4.4 GHz.
40
43
41
44
42
45
1. Microbenchmark the execution latency of FMLA (vector) with arrangement specifier 4S. Consider the following two cases:
0 commit comments