Can two node connect to same GPU using tensor-fusion ? has anyone tried running collective operations ( maybe running nccl-tests )