Skip to content

Commit 4e505b2

Browse files
committed
Supports flexible mask and bias head dimensions
Adds support for mask and bias tensors with 1, num_heads_k, or num_heads dimensions instead of only num_heads_k. Enables more flexible attention patterns by allowing masks and biases to be broadcast across different head configurations. Updates parameter passing to track separate head counts for masks and biases, and adds appropriate validation checks. Temporarily disables variable-length attention variants to focus on core functionality improvements.
1 parent 3bc89c8 commit 4e505b2

File tree

1 file changed

+542
-460
lines changed

1 file changed

+542
-460
lines changed

0 commit comments

Comments
 (0)