Update on "make aoti_torch_empty_strided support creating incontiguous tensor"

Gasoonjia · Gasoonjia · commit 7baa09f83418 · 2025-10-20T21:43:01.000-07:00
This diff modifies the `aoti_torch_empty_strided` function to support the creation of incontiguous tensors. To achieve it, this diff: 1. update the way to calculate the memory size by using both tensor size and the stride 2. skip stride check in ETensor by adding and checking cmake macro `USE_CUDA_BACKEND` when building with CUDA backend support. we will soon bring the ETensor check back for every backend after migrating to use slimtensor. Differential Revision: [D84938258](https://our.internmc.facebook.com/intern/diff/D84938258/) [ghstack-poisoned]
diff --git a/backends/cuda/runtime/shims/memory.cpp b/backends/cuda/runtime/shims/memory.cpp
@@ -234,10 +234,9 @@ AOTITorchError aoti_torch_empty_strided(
     }
     // For each dimension, add stride[i] * (size[i] - 1)
     // This gives us the maximum offset in that dimension
-    int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 0;
+    int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 1;
     if (strides_ptr == nullptr) {
       // Calculate contiguous stride if not provided
-      stride_i = 1;
       for (int64_t j = i + 1; j < ndim; j++) {
         stride_i *= sizes_ptr[j];
       }

Original file line number	Diff line number	Diff line change
`@@ -234,10 +234,9 @@ AOTITorchError aoti_torch_empty_strided(`
`234`	`234`	`}`
`235`	`235`	`// For each dimension, add stride[i] * (size[i] - 1)`
`236`	`236`	`// This gives us the maximum offset in that dimension`
`237`		`- int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 0;`
	`237`	`+ int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 1;`
`238`	`238`	`if (strides_ptr == nullptr) {`
`239`	`239`	`// Calculate contiguous stride if not provided`
`240`		`- stride_i = 1;`
`241`	`240`	`for (int64_t j = i + 1; j < ndim; j++) {`
`242`	`241`	`stride_i *= sizes_ptr[j];`
`243`	`242`	`}`