Hi! Thank you for your excellent works and codes, but recently I confronted some compiling problems in my server whose environment is cuda11.3 and arch_sm=86.
The issues are reported as below:
"ptxas /tmp/tmpxft_0006c5ba_00000000-6_block6x6_pcg_weber.ptx, line 4136; error : Instruction 'shfl' without '.sync' is not supported on .target sm_70 and higher from PTX ISA version 6.4"
wish to get reply ~