Commit 2632da4
committed
Enable nvfp4 output for trtllm-gen keepsMmaAb kernel
- Update cubin artifact path/checksum to new build with nvfp4 output support
- Fix kernel selection: remove E2M1 output dtype condition from mixed-precision path,
allowing nvfp4 output to use GQA generation kernel selection heuristics
- Always invoke selectTileSizeQForGqaGeneration (not just for maxSeqLenQ > 1)
- Add mUsesSharedPagedKvIdx field to KernelParams for vLLM/FlashInfer paged KV index
- Remove speculative-decode skip for nvfp4 output in tests
- Expand test coverage: head_dim [64, 128, 256], additional batch configs
AI-assisted
Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
revert
revert
revert1 parent 1ddef01 commit 2632da4
File tree
2 files changed
+24
-35
lines changed- include/flashinfer/trtllm/fmha
- tests/attention
2 files changed
+24
-35
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
747 | 747 | | |
748 | 748 | | |
749 | 749 | | |
750 | | - | |
| 750 | + | |
751 | 751 | | |
752 | 752 | | |
753 | 753 | | |
| |||
773 | 773 | | |
774 | 774 | | |
775 | 775 | | |
776 | | - | |
777 | | - | |
778 | | - | |
779 | | - | |
780 | | - | |
| 776 | + | |
| 777 | + | |
781 | 778 | | |
782 | 779 | | |
783 | 780 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
846 | 846 | | |
847 | 847 | | |
848 | 848 | | |
849 | | - | |
850 | | - | |
851 | | - | |
852 | | - | |
853 | | - | |
854 | | - | |
855 | | - | |
856 | | - | |
857 | | - | |
858 | 849 | | |
859 | 850 | | |
860 | 851 | | |
| |||
1181 | 1172 | | |
1182 | 1173 | | |
1183 | 1174 | | |
1184 | | - | |
| 1175 | + | |
1185 | 1176 | | |
1186 | 1177 | | |
1187 | 1178 | | |
| |||
1632 | 1623 | | |
1633 | 1624 | | |
1634 | 1625 | | |
1635 | | - | |
| 1626 | + | |
1636 | 1627 | | |
1637 | | - | |
1638 | | - | |
1639 | | - | |
1640 | | - | |
1641 | | - | |
1642 | | - | |
1643 | | - | |
1644 | | - | |
1645 | | - | |
1646 | | - | |
1647 | | - | |
1648 | | - | |
1649 | | - | |
1650 | | - | |
1651 | | - | |
1652 | | - | |
1653 | | - | |
| 1628 | + | |
| 1629 | + | |
| 1630 | + | |
| 1631 | + | |
| 1632 | + | |
| 1633 | + | |
| 1634 | + | |
| 1635 | + | |
| 1636 | + | |
| 1637 | + | |
| 1638 | + | |
| 1639 | + | |
| 1640 | + | |
| 1641 | + | |
| 1642 | + | |
| 1643 | + | |
| 1644 | + | |
| 1645 | + | |
| 1646 | + | |
1654 | 1647 | | |
1655 | 1648 | | |
1656 | 1649 | | |
| |||
1672 | 1665 | | |
1673 | 1666 | | |
1674 | 1667 | | |
1675 | | - | |
1676 | 1668 | | |
1677 | 1669 | | |
1678 | 1670 | | |
| |||
0 commit comments