Skip to content

v2.2.5

Latest

Choose a tag to compare

@kzjeef kzjeef released this 29 Jul 15:50
· 2 commits to main since this release

What's Changed

  • Support Single Node P-D Disaggregate function for CUDA.
  • Support Qwen3 Model, Currently only for Dense, MoE Models support is WIP.
  • Support EP in MOE OP, currently only support BF16 and FP16.
  • CPU Support is not work in this release, Use a v2.1.x version for CPU support.

In Detail

New Contributors

Full Changelog: v2.1.0...v3.0.0-rc1

Existing Issue

  • CPU Support is not work in this release, Use a v2.1.x version for CPU support.