-
Notifications
You must be signed in to change notification settings - Fork 367
Labels
enhancementNew feature or requestNew feature or request
Description
Proposal
Create a new class that hacks Triton autotune to enable reading kernel autotune configuration files from local or remote disks.
Rationale
This can alleviate:
- potential bwd precision issues on H20
- compilation failures with certain configurations in some Triton versions
- cross-task precision alignment
- faster kernel launch speed
At present, we do not plan to consider automated configuration generation or cross-shape speed optimization; in other words, users will need to manually tune and generate configurations, and fla will provide a script to achieve similar tuning.
Nathancgy
Sub-issues
Metadata
Metadata
Assignees
Labels
enhancementNew feature or requestNew feature or request