Slow prompt processing? #13180

pt13762104 · 2025-04-29T11:30:24Z

pt13762104
Apr 29, 2025

I get 100t/s on Qwen3-30B-A3B on prompt processing, while with Qwen2.5 3B I get 3x that. Is this because due to a slow implementation, or I am just expecting too much?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Slow prompt processing? #13180

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Slow prompt processing? #13180

Uh oh!

Uh oh!

pt13762104 Apr 29, 2025

Replies: 0 comments

pt13762104
Apr 29, 2025