feat: reduce nb experts per token in moe architectures #450
+193
−1
GitHub Advanced Security / CodeQL
succeeded
Jan 19, 2026 in 20s
No new alerts in code changed by this pull request
Loading