mamba2

Here are 6 public repositories matching this topic...

pathcosmos / EVAFRILL-Mo

Hybrid Mamba-2 + Transformer 2.94B LLM (Nemotron-H style) — Korean 3B model pretrained from scratch on 7× NVIDIA B200 GPUs with SFT + DPO alignment

transformer sft dpo pretraining fp8 korean-llm nemotron hybrid-architecture mamba2 nvidia-b200

Updated Mar 26, 2026
Python

Rust-native MoE inference runtime with custom CUDA kernels for Blackwell GPUs. Includes DFlash speculative decoding, multi-tier Engram memory, and entropy-adaptive routing. Targets Qwen3.5-35B-A3B on a single RTX 5060 Ti 16GB.

rust ffi cuda inference moe quantization mamba state-space-models deltanet blackwell engram llm qwen speculative-decoding sm120 mamba2 nemotron-h hybrid-ssm

Updated Apr 7, 2026
Rust

gxcsoccer / alloy

Star

Hybrid SSM-Attention language model on Apple Silicon with MLX — interleaving Mamba-2 and Transformer for efficient inference

python machine-learning deep-learning transformer attention language-model ssm mamba hybrid-model mlx state-space-model apple-silicon llm mamba2

Updated Mar 29, 2026
Python

hujiyo / Pawlette

Star

This is a complete testing and construction project for a recurrent small-parameter language model based on the Mamba2 architecture.这是一个完整的基于mamba2架构的循环小参数语言模型的测试与构建项目.And it try to be built with a Mano Optimiters.Mano is a new Optimister

mano mamba2

Updated Feb 7, 2026
Python

9to5ninja-projects / groundthink

Star

[RWKV6/mamba2(parallel fusion)] SSM

ssm rwkv6 mamba2

Updated Jan 19, 2026
Jupyter Notebook

AIdevsmartdata / ik_llama.cpp

Star

llama.cpp fork with additional SOTA quants and improved performance

cuda inference llama cuda-kernels quantization ssm mamba state-space-models blackwell llama-cpp gguf sm120 mamba2 nemotron-h hybrid-ssm

Updated Apr 7, 2026
C++

Improve this page

Add a description, image, and links to the mamba2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mamba2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mamba2

Here are 6 public repositories matching this topic...

pathcosmos / EVAFRILL-Mo

AIdevsmartdata / chimere

gxcsoccer / alloy

hujiyo / Pawlette

9to5ninja-projects / groundthink

AIdevsmartdata / ik_llama.cpp

Improve this page

Add this topic to your repo