Skip to content

CUDA: use mma PTX instructions for FlashAttention #10389

CUDA: use mma PTX instructions for FlashAttention

CUDA: use mma PTX instructions for FlashAttention #10389

Triggered via pull request February 2, 2025 15:27
Status Success
Total duration 25m 39s
Artifacts

server.yml

on: pull_request
Matrix: server
Fit to window
Zoom out
Zoom in