Skip to content

(WIP) DeepseekV3 (and Multi-Head Latent Attention)#2012

Closed
ysjprojects wants to merge 25 commits intoLightning-AI:mainfrom
ysjprojects:pr-feature-mla
Closed

(WIP) DeepseekV3 (and Multi-Head Latent Attention)#2012
ysjprojects wants to merge 25 commits intoLightning-AI:mainfrom
ysjprojects:pr-feature-mla

Commits

Commits on Feb 24, 2025

Commits on Feb 25, 2025

Commits on Mar 11, 2025

Commits on Mar 12, 2025

Commits on Apr 3, 2025

Commits on Apr 7, 2025

Commits on Apr 13, 2025

Commits on Apr 23, 2025

Commits on May 15, 2025

Commits on May 16, 2025