Slow prompt processing? #13180
Unanswered
pt13762104
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I get 100t/s on Qwen3-30B-A3B on prompt processing, while with Qwen2.5 3B I get 3x that. Is this because due to a slow implementation, or I am just expecting too much?
Beta Was this translation helpful? Give feedback.
All reactions