[BUG FIX] Refactor block min/max calculations #223

LoserCheems · 2026-01-18T11:55:11Z

Summary

Refactor block min/max calculations to utilize Triton's minimum and maximum functions.

Root Cause

The original implementation used Python's built-in min and max functions, which may not be optimal for performance in a Triton context.

Changes

Replaced instances of min and max with Triton's tl.minimum and tl.maximum functions in the block min/max calculation functions.

Reproduction

The issue can be reproduced by running block min/max calculations in the Triton environment with varying input sizes.

Tests

Validated changes by running existing tests that cover block min/max calculations.

Compatibility

No backward compatibility issues identified.

Checklist

Linked issue provided
Adds or updates tests
Updates docs if needed
No perf regressions

…um functions

Copilot

Pull request overview

This PR refactors block min/max calculations in Triton kernels to use Triton's native tl.minimum and tl.maximum functions instead of Python's built-in min and max functions, improving performance within the Triton compilation context.

Changes:

Replaced all instances of Python's min/max with tl.minimum/tl.maximum in block calculation functions
Applied changes consistently across all four Triton JIT-compiled functions in the file

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Improves masking flexibility by supporting seqlen, causal, and local windows while guarding unsupported configurations for swapped dimensions and packed GQA.

Refactor block min/max calculations to use triton's minimum and maxim…

9618d01

…um functions

Copilot AI review requested due to automatic review settings January 18, 2026 11:55

github-actions bot assigned Evanwu1125, ftgreat, SNHuan, Thanksyy, wubingheng111 and zacliu2023 Jan 18, 2026

Copilot started reviewing on behalf of LoserCheems January 18, 2026 11:55 View session

Copilot AI reviewed Jan 18, 2026

View reviewed changes

Adds combined seqlen, causal, local masking

b16ccad

Improves masking flexibility by supporting seqlen, causal, and local windows while guarding unsupported configurations for swapped dimensions and packed GQA.

LoserCheems merged commit 7c65745 into main Jan 19, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG FIX] Refactor block min/max calculations #223

[BUG FIX] Refactor block min/max calculations #223

Uh oh!

LoserCheems commented Jan 18, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants

[BUG FIX] Refactor block min/max calculations #223

[BUG FIX] Refactor block min/max calculations #223

Uh oh!

Conversation

LoserCheems commented Jan 18, 2026

Summary

Root Cause

Changes

Reproduction

Tests

Compatibility

Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

8 participants