forked from linux-rdma/perftest
-
Notifications
You must be signed in to change notification settings - Fork 0
Rebase and merge min_bw from downstream #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Open
liayan
wants to merge
200
commits into
master
Choose a base branch
from
ly/merge-min-bw-from-downstream
base: master
Could not load branches
Branch not found: {{ refName }}
Loading
Could not load tags
Nothing to show
Loading
Are you sure you want to change the base?
Some commits from the old base branch may be removed from the timeline,
and old review comments may become outdated.
Open
Changes from 1 commit
Commits
Show all changes
200 commits
Select commit
Hold shift + click to select a range
ba4580a
add printf
4b9c639
Revert "Perftest: replace rand() with getrandom() during MR buffer in…
HassanKhadour 8ff29c1
modify --source_ip to --bind_sounce_ip to fix init connection establi…
ff1a84a
Fix issue with PD deallocation.
pim-pesochek 9711d16
Merge pull request #210 from w180112/master
HassanKhadour 4ad453f
Merge pull request #213 from HassanKhadour/master
HassanKhadour 0fc987c
Merge pull request #215 from pim-pesochek/master
HassanKhadour f226ca2
Perftest: Align WQE length to MTU in case of shared queue
HassanKhadour 7d00c4b
Perftest: Version increase to 6.16
HassanKhadour 5a218cd
Add ipv6 address support for initial communication.
HassanKhadour abf6dd9
Perftest: Fix and optimize initial communication functions.
HassanKhadour 56a53ec
Merge pull request #218 from HassanKhadour/master
HassanKhadour dbf395e
Fix race in non-rdmacm ctx_close_connection()
rolandd d051db0
Add support for DMA-buffers in Neuron devices
YonatanNachum 9c4f8ed
Add missing HW accelerator flags to perftest's man
YonatanNachum 48c0974
Perftest: Fix limit_bw in ib_send_bw bidir traffic duration mode
HassanKhadour 47a7de5
Merge pull request #222 from YonatanNachum/neuron
HassanKhadour 1e211ce
Merge pull request #220 from rolandd/fix-shutdown
HassanKhadour c1f8d3b
Perftest: Version increase to 6.17
HassanKhadour e3f0700
Perftest: changing spec file version to 23.10.0.
HassanKhadour 69bc866
perftest: support set flow_label in GRH
changchengx 5856a7f
Merge pull request #224 from changchengx/flow_label
HassanKhadour 59fb505
Error out if CuDeviceGetByPCIBusId fails
jithinjosepkl 5b7407d
Perftest: Add missing newline characters for error messages
ddmatsu 0b5bfb6
Perftest: Print an error message to stderr
ddmatsu 8c0f1cc
Minor update to error log
jithinjosepkl c1a401f
Merge pull request #227 from jithinjosepkl/patch-1
HassanKhadour 8eef5cb
Update perftest_resources.c
ecjtusbs 4d05679
Merge pull request #228 from ddmatsu/newlines
HassanKhadour 5607b0b
Merge pull request #229 from ecjtusbs/master
HassanKhadour c09edf9
Change index to the right data type
HassanKhadour dffd1dd
perftest: Add minimum rx_depth size in case of UD qp type
HassanKhadour fd329b1
Add option for write-with-immediate verb for write_lat test
rauteric d977590
Add write-with-immediate option for write_bw test
rauteric d18f441
Fix wrong-result bug in write_bw-with-imm duration mode
rauteric dcfcaa0
Perftest: Version increase to 6.18
HassanKhadour 45eb075
Perftest: changing spec file version to 24.01.0.
HassanKhadour 3665d98
Merge pull request #230 from rauteric/lat_write_imm_simplified
HassanKhadour a43ab3c
feat: add more valid RATE_VALUES
FujiZ 5ff3afa
Perftest: fix completion count with special combination of tx_depth/c…
sshaulnv 0da9a02
Fix Neuron dmabuf uninitialized offset
mrgolin f9e82f7
Make output JSON standard
blochl 9afcc72
Merge pull request #239 from blochl/lb-dev2
HassanKhadour 22ac823
Merge pull request #237 from mrgolin/fix-neuron-offset
HassanKhadour aadec43
Merge pull request #235 from FujiZ/add-rate
HassanKhadour d34bf0f
Display a warning if BW peak measurement was disabled implicitly
blochl 7fa7374
Merge pull request #238 from blochl/lb-dev
HassanKhadour 5191f3e
perftest: Add Broadcom gen p7 adapter device ids
selvintxavier ecab99a
Merge pull request #242 from selvintxavier/genp7_devid
HassanKhadour 8f441d3
Perftest: define use_write_with_imm_flag outside HAVE_AES_XTS ifdef
HassanKhadour def27db
Perftest: Version increase to 6.19
HassanKhadour d414c55
Revert "perftest: Add minimum rx_depth size in case of UD qp type"
HassanKhadour 5039890
Add support for loongarch64.
cnmushiba afe39a9
Perftest: Version increase to 6.20
HassanKhadour 4a83eea
add support for ROCm6.0 API changes
edgargabriel ada1a29
add support for ROCm6.0 API changes (II)
edgargabriel 921730a
Both the server and client call rdma_disconnect() synchronously, When…
Anumula-Murali-Mohan-Reddy 248d1b7
Fix man page handling for out-of-tree builds
chuckcranor 7044f45
Merge pull request #244 from cnmushiba/la64-dev
HassanKhadour f6fdf37
Merge pull request #245 from edgargabriel/topic/rocm-6.0-support
HassanKhadour 05317fd
Merge pull request #246 from Anumula-Murali-Mohan-Reddy/master
HassanKhadour 6f30da3
Merge pull request #248 from chuckcranor/master
HassanKhadour f5b054f
Perftest: Version increase to 6.21
HassanKhadour c3ecbd8
Perftest: Add condition to force stop polling in write_with_imm
HassanKhadour 7e34f40
Perftest: changing spec file version to 24.04.0.
HassanKhadour 366b209
Fix type in print
cliffburdick fe1d7aa
Ignore generated config files from git
agirault 9ed5637
Fix out-of-tree man page build when top_srcdir is an absolute path
chuckcranor 592e09e
Fix Integer overflow issue
sshaulnv ed44d77
Support RDMA with Tegra integrated GPUs
agirault 1d202bb
Display proper unit for MiB/sec results
agirault 2cdf25b
Perftest: changing spec file version to 24.07.0.
HassanKhadour 8bbfe3c
Perftest: Version increase to 6.22
HassanKhadour 1a11412
Perftest: Fix indentation between results after the fix of the BW unit
HassanKhadour b61a9e7
Fix format of man pages
6ab70de
Fix syntax of script run_perftest_loopback
7c282ef
Implement get_cycles for hppa architecture
8867689
Perftest: Fix rx_depth check for XRC
cdc9e27
Perftest: Support selecting congestion control algorithms
2e3aa2f
perftest: Add minimum rx_depth size in case of SRQ and UD/UC qp type
sshaulnv febba14
Add support for Cambricon devices
463300b
Add 0xefa3 pciid to the database
mrgolin 2e88853
Perftest: add error message for DC runs with small queue depth
sshaulnv b6f957f
Perftest: Add support for TD lock-free mode
bbb237d
Perftest: Version increase to 6.23
sshaulnv ccae524
perftest: Set Ack timeout for rdma_cm connection id
8f87972
Perftest: changing spec file version to 24.10.0
sshaulnv d7989b2
Perftest: Fix TD lock-free mode not working for QP
279d92e
Write cuda device id to json file
shaulerez ad2a685
Add support for SRD unsolicited write w/ imm. receive
mrgolin fba7ce7
Perftest: Fix failure in creating cq when create cq ex is not support…
2fb0e05
Merge pull request #286 from hginjgerx/td
sshaulnv d2def67
Merge pull request #268 from mrgolin/unsolicited-write-recv
sshaulnv f136038
create_comm_struct: Copy in user_param->qp_timeout
b8aa202
Set qp_timeout for create_rdma_cm_connection path
5c29996
Merge pull request #288 from raphael-s-norwitz/fix-qp-timeout-for-cm
sshaulnv 91fadb5
Perftest: fix qp_timeout with rdma_cm and UD
sshaulnv 01be1f4
Support hl (#1)
sdashevsky-ai ecaccf0
Perftest: add DDP support
sshaulnv 1ed34b1
perftest: Turn on comp_mask create flag and add DDP indication
sshaulnv 60e87cc
Perftest: Add flag to disable DDP
sshaulnv b88cad2
Perftest: add a check for the max ooo_recv_wrs_caps
sshaulnv afc2277
Perftest: Version increase to 6.24
sshaulnv a765cb2
Perftest: changing spec file version to 25.01.0
sshaulnv f51c4f0
Perftest: change exit flow in run_infinitely mode
sshaulnv 9616ab9
Perftest: disable infinitely trap over client in SEND
sshaulnv a21bdbe
Merge pull request #293 from sshaulnv/master
sshaulnv 289c058
Perftest: Add connectionless server support for multicast traffic
sshaulnv 45f9f32
Perftest: Add support for CONNECTX9
sshaulnv d6e1c86
Use zero size receive buffers for write w/imm
3f012ee
Merge pull request #301 from mrgolin/write-recv-buf
sshaulnv aec645e
perftest: support set flow_label list val in GRH with RR method
changchengx d61f8fb
Merge pull request #279 from changchengx/flow_label
sshaulnv 1926a1f
fix core dump issue when disable HAVE_IBV_WR_API
tianx666 5ae53ba
dmabuf: Add data-direct option to dmabuf
ShacharKagan 937074b
debian: Rely on dh_shlibdeps for determining the dependencies
bdrung 384add6
debian: add missing build dependency on libibumad-dev and libpci-dev
bdrung 54d8147
debian: Bump debhelper to version 10
bdrung bba6e13
perftest: update payload_file_path & flow_label format man page
changchengx f5c3035
Enable dmabuf to ROCm
paklui 9c7235c
fix some build and linker issues
paklui 2d6ce96
Fixed issue with loss of one payload part due to skip of one address
43a5e4f
fix compilation warnings
paklui d125f1e
dmabuf: Update dmabuf flag for data-direct traffic
ShacharKagan 6730e97
Perftest: enable pcie mapping type only if supported by cuda
sshaulnv 6bd09ff
Merge pull request #304 from tianx666/master
sshaulnv bfd9b40
Merge branch 'linux-rdma:master' into master
sdashevsky-ai e97f4cf
Fixed CR styling notes
sdashevsky-ai 9f0b86c
Perftest: Fix for loop initial declarations
sshaulnv b58e46e
Add DMA-BUF option for ROCm in man page
paklui 70d3aa2
Merge pull request #309 from paklui/master-dmabuf-rocm
sshaulnv 2465f54
Perftest: Fix Cppcheck warnings in rocm_memory.c
sshaulnv 02bb98c
Fix size_t formatting
sdashevsky-ai 7131375
Merge pull request #307 from bdrung/debian-dependencies
sshaulnv 59e2c61
Merge pull request #292 from sdashevsky-ai/master
sshaulnv 27acd3a
Properly handle the error case when running data_direct with binaries…
drossetti 3a37aef
add runtime check for data direct support in the GPU driver
drossetti f339bdf
fix typo
drossetti 38acb9d
Perftest: Fix configure variable for data_direct
sshaulnv ec748de
fix autoconf issue with detection of CUDA_MEM_RANGE_FLAG_DMA_BUF_MAPP…
drossetti 629ae8e
fix another typo
drossetti 32e807a
improve error diagnostic
drossetti 84a5f73
Merge pull request #317 from drossetti/fixes
sshaulnv fea0734
Perftest: Clarify flag symmetry in app help
sshaulnv f9e4f50
Perftest: Add IPv6 support to bind_source_ip
sshaulnv 05df6d6
Perftest: Optimize cqe polling batch
sshaulnv f5adde7
Perftest: Fix TD lock-free mode not working for SRQ/XRC QP
81b94cf
Merge pull request #310 from changchengx/data_pattern
sshaulnv 1598b00
fix typo and align code format
changchengx 989d224
Perftest: remove is_contig_supported
changchengx 9a0e76a
rename var to align with semantic
changchengx d59fa4d
remove unnecessary parameter
changchengx 8428b92
Merge pull request #299 from changchengx/part_clean
sshaulnv 800ae2d
Perftest: random buffer initialization optimization
sshaulnv 008457c
Perftest: Do not align SRQ recv length to MTU for hns
5b86637
Perftest: Adding support for new CUDA memory types
059911f
Perftest: Add gpu_touch flag to test GPU buffer accesses interference
a4bd71e
Perftest: Add OpenCL memory type support
89aafcf
Perftest: Adding CUDA support for gpu_touch and lunching CUDA kernels
a46bfac
Perftest: Adding CCFLAGS and LDFLAGS to load libcudart
RoeyAzran1992 b8900f8
fixing code review comments
RoeyAzran1992 f5010dc
Bugfix - using the same stop flag for all running GPU kerenls (redund…
RoeyAzran1992 5588860
Perftest: optimize buffer init with 32-bit writes
sshaulnv c17219b
initializing stop touch GPU mem indicator only when touching flag is set
RoeyAzran1992 22e7c7f
code review - fixing printing type
RoeyAzran1992 c04922f
Merge pull request #319 from RoeyAzran1992/master
sshaulnv 045b162
Cuda: Use pcie mapping regardless of data direct
dkkranz 41d8dba
Perftest: Version increase to 6.25
sshaulnv 446dea7
Perftest: changing spec file version to 25.04.0
sshaulnv 9ab7ec0
Perftest: explicitly include standard C++ library
sshaulnv 6a3ddbe
Perftest: Make cudart linkage optional due to nvcc and GCC incompatib…
sshaulnv ff45fd4
Perftest: add cudart support info to README
sshaulnv d3330e5
Merge pull request #312 from TamaraBabayan/master
sshaulnv 766c13b
Perftest: Fix perform warm up process stuck
640b064
Merge pull request #323 from hginjgerx/td
sshaulnv 0b26bc3
Merge pull request #324 from hginjgerx/srq
sshaulnv abc99f2
Merge pull request #325 from dkkranz/use_pcie_mapping
sshaulnv 0bdfc10
Add support for DMA-buffers in Cambricon devices
0490645
Perftest supports MLU latency tests with read/send verbs only
9569a3f
Perftest: Add GitHub actions support
sshaulnv 64330bb
Merge pull request #329 from sshaulnv/master
sshaulnv f961e40
Perftest: Add null-mr support over server side
sshaulnv ae68798
Added TCU support
iyangsj 2af4110
add Yunsilicon dev types
tianx666 4d645d4
Merge pull request #320 from hc235280/support_ib_send_lat
sshaulnv ce4f20c
Merge pull request #330 from iyangsj/master
sshaulnv e503581
Merge branch 'master' into master
tianx666 9600494
Merge pull request #331 from tianx666/master
sshaulnv cd69ae0
Perftest: disable Scatter2CQE with gpudirect send_lat
sshaulnv 920087f
Perftest: fix OOO_RECV_WRS enablement flow
sshaulnv 5be2b4e
Perftest: Add print in case blueflame is not supported
maorgottlieb 4cef9b5
Perftest: Version increase to 6.26
sshaulnv 14ae7a0
Perftest: changing spec file version to 25.07.0
sshaulnv c543c84
Perftest: Dynamic CUDA linking
sshaulnv 89d0d54
feat(perftest): add --report-min-bw=X to measure the min bandwidth ov…
antgun42 c5c58ba
feat(perftest): report_min_bw_cycles must be uint64 to prevent wrappi…
antgun42 97510f1
fix(perftest): more simple and robust logic for measuring batch duration
antgun42 d21ca8d
feat(perftest): add report-min-bw to man page
antgun42 0adb524
feat(perftest): check dependencies for report-min-bw
antgun42 c7e9508
Merge branch 'master' into ly/merge-min-bw-from-downstream
liayan File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
You are viewing a condensed version of this merge commit. You can view the full changes here.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This seems redundant