You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
setup.py: Support TORCH_CUDA_ARCH_LIST for targeted CUDA builds
Allow specifying specific CUDA architectures via TORCH_CUDA_ARCH_LIST
environment variable to significantly speed up builds in CI/testing.
When TORCH_CUDA_ARCH_LIST is set (e.g., "8.6" for A10G or "8.9" for L4),
only build for that specific architecture instead of all supported ones.
This reduces build time from 30+ minutes to ~3 minutes on single-GPU
instances.
Falls back to building for all architectures when not set, preserving
existing behavior for production builds.
🤖 Generated with [Claude Code](https://claude.ai/code)
Co-Authored-By: Claude <[email protected]>
0 commit comments