Skip to content

Commit c1167f8

Browse files
committed
Update on "add module level benchmark for gemma3 model"
This diff adds a module-level benchmark for the GEMMA3 model. Also introduce mutlmodal_benchmark.cpp to replace original voxtral_runner.cpp for benchmarking both gemma3 and voxtral model in module level. Differential Revision: [D84958564](https://our.internmc.facebook.com/intern/diff/D84958564/) [ghstack-poisoned]
2 parents 99340a5 + b6eae81 commit c1167f8

File tree

1 file changed

+1
-2
lines changed

1 file changed

+1
-2
lines changed

backends/cuda/runtime/shims/memory.cpp

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -234,10 +234,9 @@ AOTITorchError aoti_torch_empty_strided(
234234
}
235235
// For each dimension, add stride[i] * (size[i] - 1)
236236
// This gives us the maximum offset in that dimension
237-
int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 0;
237+
int64_t stride_i = (strides_ptr != nullptr) ? strides_ptr[i] : 1;
238238
if (strides_ptr == nullptr) {
239239
// Calculate contiguous stride if not provided
240-
stride_i = 1;
241240
for (int64_t j = i + 1; j < ndim; j++) {
242241
stride_i *= sizes_ptr[j];
243242
}

0 commit comments

Comments
 (0)