Limit tensor block numel to triton's requirements #485

oulgen · 2025-08-11T22:05:01Z

Stacked PRs:

->Limit tensor block numel to triton's requirements #485

Limit tensor block numel to triton's requirements

Fixes #456

Fixes #456 stack-info: PR: #485, branch: oulgen/stack/63

jansel

See below, this will over-count block sizes from inner loops.

jansel · 2025-08-12T01:44:35Z

helion/autotuner/block_id_sequence.py

+        if name == "block_sizes" and math.prod(values) > 1048576:
+            raise InvalidConfig(
+                "Triton does not allow for tensor numel greater than 1048576"
+            )


Not all block sizes should be included in this count, only the ones in the top level loop. We might need to tag some block sizes as coming from the grid and only count those.

Limit tensor block numel to triton's requirements

9bd1e7f

Fixes #456 stack-info: PR: #485, branch: oulgen/stack/63

oulgen force-pushed the oulgen/stack/63 branch from be7d80a to 9bd1e7f Compare August 11, 2025 22:05

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Aug 11, 2025

oulgen requested a review from jansel August 11, 2025 22:07

jansel requested changes Aug 12, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Limit tensor block numel to triton's requirements #485

Limit tensor block numel to triton's requirements #485

Uh oh!

oulgen commented Aug 11, 2025 •

edited

Loading

Uh oh!

jansel left a comment •

edited

Loading

Uh oh!

jansel Aug 12, 2025

Uh oh!

Uh oh!

Limit tensor block numel to triton's requirements #485

Are you sure you want to change the base?

Limit tensor block numel to triton's requirements #485

Uh oh!

Conversation

oulgen commented Aug 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!