Fix incorrect memory allocation by aligning element computation with tensor extents #5

accable · 2025-07-02T12:41:27Z

This PR fixes a segmentation fault that occurs during tensor memory allocation for large extents.

The issue stems from using the global extent array when computing elementsA, elementsB, and elementsC. While this works under the original toy example {6, 6, 6, 4, 4, 4}, it breaks when dimensions and mode mappings differ, as seen in configurations such as {1, 8, 512, 8, 512}.

The fix replaces usage of extent[...] with the corresponding per-tensor extents (extentA, extentB, and extentC), which are already calculated properly based on mode permutations. This ensures the memory allocation aligns with how the tensor descriptors are defined.

Tested using NVIDIA Grace with this repo

Fixed segfault by reusing defined extents

a72b41b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix incorrect memory allocation by aligning element computation with tensor extents #5

Fix incorrect memory allocation by aligning element computation with tensor extents #5

Uh oh!

accable commented Jul 2, 2025 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Fix incorrect memory allocation by aligning element computation with tensor extents #5

Are you sure you want to change the base?

Fix incorrect memory allocation by aligning element computation with tensor extents #5

Uh oh!

Conversation

accable commented Jul 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

accable commented Jul 2, 2025 •

edited

Loading