Describe your performance question
As shown in the table, when running ./transfer_engine_bench on two H20 servers, the bandwidth initially increases and then decreases as the batch size increases.
The experimental details are as follows:
export MC_WORKERS_PER_CTX=1
export MC_MAX_WR=2048
./transfer_engine_bench --mode=target --metadata_server=etcd://xxx --local_server_name=abcd:12345 --device_name=mlx5_bond_1 --threads=8 --block_size=8192
./transfer_engine_bench --metadata_server=etcd://xxx --mode=initiator --segment_id=abcd:12345 --device_name=mlx5_bond_1 --threads=8 --batch_size=xx --block_size=8192
Before submitting a new issue...