Skip to content

Commit f3a8d5f

Browse files
authored
FPGA: Update QoR numbers for the Convolution sample (#2348)
Update readme with latest QoR numbers
1 parent 8f9c38f commit f3a8d5f

File tree

1 file changed

+6
-5
lines changed
  • DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/convolution2d

1 file changed

+6
-5
lines changed

DirectProgramming/C++SYCL_FPGA/ReferenceDesigns/convolution2d/README.md

Lines changed: 6 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -56,20 +56,21 @@ You can also find more information about [troubleshooting build errors](/DirectP
5656
5757
### Performance
5858

59-
Performance results are based on testing conducted with a pre-release version of oneAPI 2024.1, with released Intel® Quartus® Prime Pro Edition 23.3 software. Testing was conducted January 22, 2024. Area and f<sub>MAX</sub> estimates are averaged across 8 seeds.
59+
Performance results are based on testing conducted with a pre-release version of oneAPI 2024.2, with released Intel® Quartus® Prime Pro Edition 24.1 software. Testing was conducted May 25, 2024. Area and f<sub>MAX</sub> estimates are averaged across 8 seeds.
6060
* These area estimates are ONLY for the `Convolution2d` kernel, and do not include the `RGB2Grey` or `Grey2RGB` kernels. You can compile the design with only the `Convolution2d` kernel by compiling with the `-DTEST_CONV2D_ISOLATED=1` compiler flag, or by adding `#define TEST_CONV2D_ISOLATED 1` in `src/main.cpp`.
6161
* These estimates were achieved by setting a 600 MHz clock target for the `Agilex7` device. You can set the clock target by adding the `-Xsclock=600MHz` flag to CMakeLists.txt, or by passing it to the `cmake` command as shown in [Building the `convolution2d` Tutorial](#building-the-convolution2d-tutorial).
62+
* The reported fMAX is the 'restricted fMAX' as reported by Intel® Quartus® Prime.
6263

6364
> **Note**: Refer to the [Performance Disclaimers](/DirectProgramming/C++SYCL_FPGA/README.md#performance-disclaimers) section for important performance information.
6465
6566
#### Intel Agilex® 7 FPGA
6667

6768
| Parallel Pixels | Window Dimensions | Coefficient Type | Input Type | f<sub>MAX</sub> (MHz) | ALMs | DSP blocks | M20K Block RAM
6869
|--- |--- |--- |--- |--- |--- |--- |---
69-
| 1 | 3x3 | `float` | 10-bit Integer | 639.8 | 2742 | 9 | 19
70-
| 2 | 3x3 | `float` | 10-bit Integer | 639.8 | 4326 | 18 | 19
71-
| 4 | 3x3 | `float` | 10-bit Integer | 639.8 | 7341 | 36 | 18
72-
| 8 | 3x3 | `float` | 10-bit Integer | 639.8 | 13791 | 72 | 19
70+
| 1 | 3x3 | `float` | 10-bit Integer | 639.8 | 3026 | 9 | 19
71+
| 2 | 3x3 | `float` | 10-bit Integer | 639.8 | 4618 | 18 | 19
72+
| 4 | 3x3 | `float` | 10-bit Integer | 639.8 | 7677 | 36 | 18
73+
| 8 | 3x3 | `float` | 10-bit Integer | 639.8 | 14410 | 72 | 19
7374

7475
> **Note**: This design uses a relatively large number of ALM resources because of the floating-point conversions in `ConvolutionFunction()` in `src/convolution_kernel.hpp`. The coefficients for this design were specified as floating-point for maximal flexibility in coefficient values, but the enthusiastic user is encouraged to convert this function to fixed-point using the `ac_fixed` types, as described in [this sample](/DirectProgramming/C%2B%2BSYCL_FPGA/Tutorials/Features/ac_fixed).
7576

0 commit comments

Comments
 (0)