-
Notifications
You must be signed in to change notification settings - Fork 1.6k
Pull requests: NVIDIA/cutlass
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Fix incorrect tensor layout strides in Blackwell MMA tutorial comments
#2921
opened Jan 3, 2026 by
Johnsonms
Loading…
6 of 8 tasks
Refactor binary_op functions to remove unused result parameter
#2919
opened Jan 2, 2026 by
pbelevich
Loading…
docs: Add FP16 GEMM documentation to sgemm_sm80.cu - Fixes #1686
#2870
opened Dec 10, 2025 by
blueberrycongee
Loading…
Unit tests for Kernels that perform BF16 x BF16 = MXFP8 and MXFP8 x MXFP8 = BF16
#2857
opened Dec 8, 2025 by
Shreya-gaur
Loading…
use cp.async.bulk for per-row data; quiets synccheck
inactive-30d
#2850
opened Dec 5, 2025 by
v0i0
Loading…
[DOCS] Update docs to precisely describe env stream scenario
#2824
opened Nov 29, 2025 by
tqchen
Loading…
[FIX] Update nvidia-cutlass-dsl
requirements version from 4.3.0 to 4.3.1
inactive-30d
#2823
opened Nov 29, 2025 by
jeromeku
Loading…
Fix processing of relative imports in AST preprocessing
#2821
opened Nov 28, 2025 by
danieldk
Loading…
Previous Next
ProTip!
Exclude everything labeled
bug with -label:bug.