Skip to content

[libtorch] [xnnpack] [qnnpack] [cpuinfo] #46649

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 124 commits into
base: master
Choose a base branch
from

Conversation

EvilMcStevil
Copy link
Contributor

@EvilMcStevil EvilMcStevil commented Jul 29, 2025

  • Changes comply with the maintainer guide.
  • SHA512s are updated for each updated download.
  • The "supports" clause reflects platforms that may be fixed by this new version.
  • Any fixed CI baseline entries are removed from that file.
  • Any patches that are no longer applied are deleted from the port's directory.
  • The version database is fixed by rerunning ./vcpkg x-add-version --all and committing the result.
  • Only one version is added to each modified port's versions file.

Steve and others added 25 commits August 7, 2025 11:28
[1441/1606] C:\PROGRA~1\NVIDIA~2\CUDA\v12.9\bin\nvcc.exe -forward-unknown-to-host-compiler -DABSL_CONSUME_DLL -DAT_PER_OPERATOR_HEADERS -DCUTLASS_ENABLE_CUBLAS=1 -DCUTLASS_ENABLE_CUDNN=1 -DEXPORT_AOTI_FUNCTIONS -DFMT_HEADER_ONLY=1 -DMINIZ_DISABLE_ZIP_READER_CRC32_CHECKS -DNOMINMAX -DONNXIFI_ENABLE_EXT=1 -DONNX_ML=1 -DONNX_NAMESPACE=onnx -DPROTOBUF_USE_DLLS -DTORCH_CUDA_BUILD_MAIN_LIB -DTORCH_CUDA_USE_NVTX3 -DUSE_CUDA -DUSE_EXTERNAL_MZCRC -DUSE_MEM_EFF_ATTENTION -DUSE_MIMALLOC -DWIN32_LEAN_AND_MEAN -D_CRT_SECURE_NO_DEPRECATE=1 -D_UCRT_LEGACY_INFINITY -Dtorch_cuda_EXPORTS -ID:\b\libtorch\x64-windows-release-rel\aten\src -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src -ID:\b\libtorch\x64-windows-release-rel -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\THC -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\cuda -ID:\b\libtorch\x64-windows-release-rel\caffe2\aten\src -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\c10\cuda\..\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\c10\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\torch\csrc\api -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\torch\csrc\api\include -isystem D:\installed\x64-windows-release\include -isystem D:\installed\x64-windows-release\include\eigen3 -isystem "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.9\include" -isystem D:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\cmake\..\third_party\cudnn_frontend\include -DLIBCUDACXX_ENABLE_SIMPLIFIED_COMPLEX_OPERATIONS -Xcompiler  /Zc:__cplusplus -Xcompiler /w -w -Xcompiler /FS -Xfatbin -compress-all -DONNX_NAMESPACE=onnx --use-local-env -gencode arch=compute_50,code=sm_50 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -gencode arch=compute_100,code=sm_100 -gencode arch=compute_100a,code=sm_100a -gencode arch=compute_101a,code=sm_101a -gencode arch=compute_120,code=sm_120 -gencode arch=compute_120a,code=sm_120a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --Werror cross-execution-space-call --no-host-device-move-forward --expt-relaxed-constexpr --expt-extended-lambda  -Xcompiler=/wd4819,/wd4503,/wd4190,/wd4244,/wd4251,/wd4275,/wd4522 -Wno-deprecated-gpu-targets --expt-extended-lambda -DCUB_WRAPPED_NAMESPACE=at_cuda_detail -DCUDA_HAS_FP16=1 -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -Xcompiler="-O2 -Ob2" -DNDEBUG -Xcompiler /MD -std=c++17 -Xcompiler=-MD -MD -MT caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj -MF caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj.d -x cu -c D:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\native\cuda\SegmentReduce.cu -o caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj -Xcompiler=-Fdcaffe2\CMakeFiles\torch_cuda.dir\,-FS
FAILED: caffe2/CMakeFiles/torch_cuda.dir/__/aten/src/ATen/native/cuda/SegmentReduce.cu.obj
C:\PROGRA~1\NVIDIA~2\CUDA\v12.9\bin\nvcc.exe -forward-unknown-to-host-compiler -DABSL_CONSUME_DLL -DAT_PER_OPERATOR_HEADERS -DCUTLASS_ENABLE_CUBLAS=1 -DCUTLASS_ENABLE_CUDNN=1 -DEXPORT_AOTI_FUNCTIONS -DFMT_HEADER_ONLY=1 -DMINIZ_DISABLE_ZIP_READER_CRC32_CHECKS -DNOMINMAX -DONNXIFI_ENABLE_EXT=1 -DONNX_ML=1 -DONNX_NAMESPACE=onnx -DPROTOBUF_USE_DLLS -DTORCH_CUDA_BUILD_MAIN_LIB -DTORCH_CUDA_USE_NVTX3 -DUSE_CUDA -DUSE_EXTERNAL_MZCRC -DUSE_MEM_EFF_ATTENTION -DUSE_MIMALLOC -DWIN32_LEAN_AND_MEAN -D_CRT_SECURE_NO_DEPRECATE=1 -D_UCRT_LEGACY_INFINITY -Dtorch_cuda_EXPORTS -ID:\b\libtorch\x64-windows-release-rel\aten\src -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src -ID:\b\libtorch\x64-windows-release-rel -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\THC -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\cuda -ID:\b\libtorch\x64-windows-release-rel\caffe2\aten\src -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\c10\cuda\..\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\c10\.. -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\torch\csrc\api -ID:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\torch\csrc\api\include -isystem D:\installed\x64-windows-release\include -isystem D:\installed\x64-windows-release\include\eigen3 -isystem "C:\Program Files\NVIDIA GPU Computing Toolkit\CUDA\v12.9\include" -isystem D:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\cmake\..\third_party\cudnn_frontend\include -DLIBCUDACXX_ENABLE_SIMPLIFIED_COMPLEX_OPERATIONS -Xcompiler  /Zc:__cplusplus -Xcompiler /w -w -Xcompiler /FS -Xfatbin -compress-all -DONNX_NAMESPACE=onnx --use-local-env -gencode arch=compute_50,code=sm_50 -gencode arch=compute_80,code=sm_80 -gencode arch=compute_86,code=sm_86 -gencode arch=compute_89,code=sm_89 -gencode arch=compute_90,code=sm_90 -gencode arch=compute_90a,code=sm_90a -gencode arch=compute_100,code=sm_100 -gencode arch=compute_100a,code=sm_100a -gencode arch=compute_101a,code=sm_101a -gencode arch=compute_120,code=sm_120 -gencode arch=compute_120a,code=sm_120a -Xcudafe --diag_suppress=cc_clobber_ignored,--diag_suppress=field_without_dll_interface,--diag_suppress=base_class_has_different_dll_interface,--diag_suppress=dll_interface_conflict_none_assumed,--diag_suppress=dll_interface_conflict_dllexport_assumed,--diag_suppress=bad_friend_decl --Werror cross-execution-space-call --no-host-device-move-forward --expt-relaxed-constexpr --expt-extended-lambda  -Xcompiler=/wd4819,/wd4503,/wd4190,/wd4244,/wd4251,/wd4275,/wd4522 -Wno-deprecated-gpu-targets --expt-extended-lambda -DCUB_WRAPPED_NAMESPACE=at_cuda_detail -DCUDA_HAS_FP16=1 -D__CUDA_NO_HALF_OPERATORS__ -D__CUDA_NO_HALF_CONVERSIONS__ -D__CUDA_NO_HALF2_OPERATORS__ -D__CUDA_NO_BFLOAT16_CONVERSIONS__ -Xcompiler="-O2 -Ob2" -DNDEBUG -Xcompiler /MD -std=c++17 -Xcompiler=-MD -MD -MT caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj -MF caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj.d -x cu -c D:\b\libtorch\src\v2.7.1-2650bb1bd5.clean\aten\src\ATen\native\cuda\SegmentReduce.cu -o caffe2\CMakeFiles\torch_cuda.dir\__\aten\src\ATen\native\cuda\SegmentReduce.cu.obj -Xcompiler=-Fdcaffe2\CMakeFiles\torch_cuda.dir\,-FS
LLVM ERROR: out of memory
@EvilMcStevil EvilMcStevil marked this pull request as ready for review August 10, 2025 01:28
@EvilMcStevil
Copy link
Contributor Author

Also credit to @luncliff as i shamelessly stole the OSX fixes from #46700

luncliff and others added 2 commits August 10, 2025 14:15
# Conflicts:
#	ports/libtorch/fix-cmake.patch
#	ports/libtorch/portfile.cmake
#	versions/l-/libtorch.json
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
depends:vm-update PR contains changes to the VM provisioning scripts
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants