Skip to content

[None][feat] Add Mamba2 MTP SSM cache CUDA kernel for tree-based speculative decoding#12537

Merged
JadoTu merged 4 commits intoNVIDIA:mainfrom
JadoTu:mamba2_tree_based_mtp_CUDA_kernel
Apr 1, 2026
Merged

[None][feat] Add Mamba2 MTP SSM cache CUDA kernel for tree-based speculative decoding#12537
JadoTu merged 4 commits intoNVIDIA:mainfrom
JadoTu:mamba2_tree_based_mtp_CUDA_kernel

Commits

Commits on Mar 25, 2026

Commits on Mar 30, 2026

Commits on Mar 31, 2026