We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent 439995b commit 2ab327eCopy full SHA for 2ab327e
_posts/2025-09-11-qwen3-next.md
@@ -66,7 +66,7 @@ Another innovation in Qwen3-Next is **multi-token prediction**, which boosts bot
66
Our Qwen3-Next integration is just the beginning. On the roadmap:
67
68
* Further kernel optimizations for GatedDeltaNet layers.
69
-* Better memory management and prefix caching for hybrid models.
+* Better memory management, plus the support of automatic prefix caching and P/D disaggregation for hybrid models.
70
* Continuous throughput and CPU overhead reductions.
71
72
## **Acknowledgements**
0 commit comments