[CUDA] Support FP8 (E4M3) KV Cache for Group Query Attention #27321
+1,140
−422
Microsoft GitHub Policy Service / license/cla
succeeded
Feb 14, 2026 in 0s
All CLA requirements met.
This check verifies that the author has agreed to a CLA with Microsoft.
Loading