v3.0.11
What's Changed
- feat: add cute hgemv implement by @kitecats in #331
- Update README.md by @DefTruth in #333
- feat: add a cute bank-free mat transpose vectorize impelment by @kitecats in #334
- bugfix: fix layernorm & rmsnorm f16 overflow by @hebangwen in #335
- Bugfix: fix a compilation error by @lixiaoquan in #336
New Contributors
- @hebangwen made their first contribution in #335
- @lixiaoquan made their first contribution in #336
Full Changelog: v3.0.10...v3.0.11