File tree Expand file tree Collapse file tree 1 file changed +0
-11
lines changed
Expand file tree Collapse file tree 1 file changed +0
-11
lines changed Original file line number Diff line number Diff line change @@ -93,15 +93,4 @@ used to compute the entire matrix $C$.
9393Eigen is used to generate data and compute the GEMM result on the CPU.
9494In addition, error computing and time profiling code are implemented for
9595the GPU computing result. For details, see
96- [ first_attempt.cu] ( https://github.com/openmlsys/openmlsys-cuda/blob/main/first_attempt.cu ) .
97- After the program is compiled and executed, output results are as
98- follows:
9996
100- Average time: 48.961 ms
101- Max error: 0.000092
102-
103- The peak GPU throughput can be approximated by using the following
104- formula: 2 $\times$ Frequency $\times$ Number of single-precision
105- compute units. The number of single-precision compute units equals the
106- number of SMs in the GPU multiplied by the number of single-precision
107- compute units in each SM. The results are as follows:
You can’t perform that action at this time.
0 commit comments