-
Notifications
You must be signed in to change notification settings - Fork 15k
Description
NVIDIA https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-thor/ is GA now.
However, #124155 seems to have introduced SM101, which later was changed to SM110 (See cuda13 release notes here: https://docs.nvidia.com/cuda/cuda-toolkit-release-notes/index.html#id2)
Therefore, we need a follow-up PR to rename all the 101 101a etc to their new name.
This would impact downstream Triton -> PyTorch projects.
e.g. using
pytorch cuda 13 nightly: pip3 install --pre torch torchvision --index-url https://download.pytorch.org/whl/nightly/cu130
with the following triton wheel
https://download.pytorch.org/whl/nightly/pytorch_triton-3.4.0%2Bgitf7888497-cp310-cp310-linux_aarch64.whl
the following errors would be encountered:
`python test/inductor/test_control_flow.py CondTests.test_cond_mismatched_branch_output_size_device_cuda_dynamic_False
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
'sm_110a' is not a recognized processor for this target (ignoring processor)
LLVM ERROR: Cannot select: intrinsic %llvm.nvvm.shfl.sync.bfly.i32
`
cc @ptrblck @tinglvv @eqy @Aidyn-A @atalman @malfet @davidberard98
cc @durga4github