@@ -10,22 +10,62 @@ In this Example we highlight the Synchronous and Asynchronous data
1010transfer between SSD and FPGA and capture the performance numbers for
1111p2p transfer, Kernel and XDMA.
1212
13- Following is the real log reported while running the design on samsung
14- platform:
13+ By default synchronous flow is ran and following is the real log
14+ reported while running the design on samsung platform:
1515
1616::
1717
18- ./host copy_kernel.xclbin -w /dev/nvme0n1
18+ ./p2p_bandwidth -x copy_kernel.xclbin -i /dev/nvme0n1
1919 INFO: Successfully opened NVME SSD /dev/nvme0n1
2020 INFO: Preparing 1048576KB test data in 8 pipelines
2121 ############################################################
2222 Synchronous P2P
2323 ############################################################
2424 INFO: Kick off test
2525 HOST -> FPGA(host BO) -> FPGA(p2p BO) -> SSD
26- overall 556211ms 100.00% 1841 .03MB/s
27- p2p 359223ms 64 .58% 2850 .60MB/s
28- kernel 199578ms 35 .88% 5130 .83MB/s
29- XDMA 313203ms 56.31 % 3269 .45MB/s
26+ overall 556211ms 100.00% 1861 .03MB/s
27+ p2p 359223ms 57 .58% 3250 .60MB/s
28+ kernel 199578ms 207 .88% 871 .83MB/s
29+ XDMA 313203ms 99.11 % 1866 .45MB/s
3030 INFO: Evaluating test result
3131 INFO: Test passed
32+
33+ If we need to run the asynchronous flow, build the host with
34+ ENABLE_ASYNC_FLOW=yes :
35+
36+ .. code :: cpp
37+
38+ make host ENABLE_ASYNC_FLOW=yes
39+
40+ Following is the real log reported while running the design
41+ on samsung platform for asynchronous flow:
42+
43+ ::
44+
45+ ./p2p_bandwidth -x copy_kernel.xclbin -i /dev/nvme0n1
46+ INFO: Successfully opened NVME SSD /dev/nvme0n1
47+ INFO: Preparing 1048576KB test data in 8 pipelines
48+ ############################################################
49+ Synchronous P2P
50+ ############################################################
51+ INFO: Kick off test
52+ HOST -> FPGA(host BO) -> FPGA(p2p BO) -> SSD
53+ overall 551211ms 100.00% 1862.23MB/s
54+ p2p 357223ms 58.58% 3200.30MB/s
55+ kernel 192578ms 211.88% 879.89MB/s
56+ XDMA 315203ms 99.31% 1867.35MB/s
57+ INFO: Evaluating test result
58+ INFO: Test passed
59+ ############################################################
60+ Asynchronous P2P
61+ ############################################################
62+ INFO: Kick off test
63+ HOST -> FPGA(host BO) -> FPGA(p2p BO) -> SSD
64+ overall 556211ms 100.00% 1548.03MB/s
65+ p2p 359223ms 61.58% 3371.60MB/s
66+ kernel 199578ms 198.88% 697.83MB/s
67+ XDMA 313203ms 98.23% 1806.45MB/s
68+ INFO: Evaluating test result
69+ INFO: Test passed
70+
71+
0 commit comments