You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/board_test/README.md
+41-51Lines changed: 41 additions & 51 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -264,7 +264,7 @@ The tests listed above check the following interfaces in a platform:
264
264
265
265
## Example Output
266
266
267
-
Running on FPGA device (Terasic’s DE10-Agilex Development Board). Performance results are based on testing as of August 30, 2023.
267
+
Running on FPGA device (Intel® FPGA SmartNIC N6001-PL). Performance results are based on testing as of May 10, 2024.
268
268
269
269
> **Note**: Refer to the [Performance Disclaimers](/DirectProgramming/C++SYCL_FPGA/README.md#performance-disclaimers) section for important performance information.
270
270
@@ -287,21 +287,21 @@ The tests are:
287
287
Note: Kernel Clock Frequency is run along with all tests except 1 (Host Speed and Host Read Write test)
288
288
289
289
Running all tests
290
-
Running on device: de10_agilex : Agilex Reference Platform (aclde10_agilex0)
290
+
Running on device: ofs_n6001 : Intel OFS Platform (ofs_ee00000)
Copy file name to clipboardExpand all lines: DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/cholesky_inversion/README.md
+4-5Lines changed: 4 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -79,11 +79,10 @@ Performance results are based on testing as of April 26, 2022.
79
79
80
80
| Device | Throughput
81
81
|:--- |:---
82
-
| Terasic’s DE10-Agilex Development Board| 415k matrices/s for real matrices of size 32x32
82
+
| Intel® FPGA SmartNIC N6001-PL | 389k matrices/s for real matrices of size 32x32
83
83
84
84
## Key Implementation Details
85
85
86
-
In this reference design, the Cholesky decomposition algorithm is used to factor a real _n_ × _n_ matrix. The algorithm computes the vector dot product of two rows of the matrix. In our FPGA implementation, the dot product is computed in a loop over the row's _n_ elements. The loop is fully unrolled to maximize throughput. As a result, *n* real multiplication operations are performed in parallel on the FPGA, followed by sequential additions to compute the dot product result.
87
86
88
87
With this optimization, our FPGA implementation requires _n_ DSPs to compute the real floating point dot product. The input matrix is also replicated two times in order to be able to read two full rows per cycle. The matrix size is constrained by the total FPGA DSP and RAM resources available.
89
88
@@ -320,11 +319,11 @@ You can apply the Cholesky-based inversion to 8 matrices repeated a number of ti
320
319
## Example Output
321
320
322
321
```
323
-
Running on device: de10_agilex : Agilex Reference Platform (aclde10_agilex0)
322
+
Running on device: ofs_n6001 : Intel OFS Platform (ofs_ee00000)
324
323
Generating 8 random real matrices of size 32x32
325
324
Computing the Cholesky-based inversion of 8 matrices 819200 times
Copy file name to clipboardExpand all lines: DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/crr/README.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -61,9 +61,9 @@ Performance results are based on testing as of August 30, 2023.
61
61
62
62
> **Note**: Refer to the [Performance Disclaimers](/DirectProgramming/C++SYCL_FPGA/README.md#performance-disclaimers) section for important performance information.
63
63
64
-
| Device | Throughput
65
-
|:--- |:---
66
-
| Terasic’s DE10-Agilex Development Board| 653 assets/s
Copy file name to clipboardExpand all lines: DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/matmul/README.md
+6-6Lines changed: 6 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -60,7 +60,7 @@ Performance results are based on testing as of March 6, 2023.
60
60
61
61
| Device | Throughput
62
62
|:--- |:---
63
-
| Terasic’s DE10-Agilex Development Board| 144k matrices/s for single-precision floating-point matrices of size 64 * 64, computed using a systolic array of 8 * 8 PEs (64 DSPs)
63
+
| Intel® FPGA SmartNIC N6001-PL | 142k matrices/s for single-precision floating-point matrices of size 64 * 64, computed using a systolic array of 8 * 8 PEs (64 DSPs)
64
64
65
65
## Key Implementation Details
66
66
@@ -354,16 +354,16 @@ You can perform the multiplication of the set of matrices repeatedly. This step
354
354
355
355
## Example Output
356
356
357
-
Example output when running on **Terasic’s DE10-Agilex Development Board** for the multiplication of 8 matrices 819200 times (each matrix consisting of 64x64 single-precision floating point numbers, computed using a systolic array of 8x8 PEs).
357
+
Example output when running on **Intel® FPGA SmartNIC N6001-PL** for the multiplication of 8 matrices 819200 times (each matrix consisting of 64x64 single-precision floating point numbers, computed using a systolic array of 8x8 PEs).
358
358
359
359
```
360
-
Running on device: de10_agilex : Agilex Reference Platform (aclde10_agilex0)
360
+
Running on device: ofs_n6001 : Intel OFS Platform (ofs_ee00000)
361
361
Matrix A size: 64 x 64 (tile: 8 x 64)
362
362
Matrix B size: 64 x 64 (tile: 64 x 8)
363
363
Systolic array size: 8 x 8 PEs
364
364
Running matrix multiplication of 2 matrices 819200 times
365
-
Total duration: 11.3746 s
366
-
Throughput: 144.04k matrices/s
365
+
Total duration: 11.4577 s
366
+
Throughput: 142.995k matrices/s
367
367
368
368
PASSED
369
369
```
@@ -372,4 +372,4 @@ PASSED
372
372
373
373
Code samples are licensed under the MIT license. See [License.txt](/License.txt) for details.
374
374
375
-
Third party program Licenses can be found here: [third-party-programs.txt](/third-party-programs.txt).
375
+
Third party program Licenses can be found here: [third-party-programs.txt](/third-party-programs.txt).
0 commit comments