Qwen3 demo batch size support for non-mirage baseline#403
Qwen3 demo batch size support for non-mirage baseline#403dcw02 wants to merge 4 commits intomirage-project:mpkfrom
Conversation
|
I haven't looked into it, but the mirage path of the Qwen3 demo no longer generates/outputs tokens since commit 22a0bdf |
|
Hi @dcw02 can I know the context to reproduce the no generation problem? Thanks |
@NorthmanPKU Here is a repro script using Modal: The output should look something like: I think only |
|
@NorthmanPKU I fixed the no generation problem in #412 |
|
Where is the CUDA version code with mirage? How should this support the base size parameter? Or is it already supported, or is there a related PR? |
Description of changes:
This PR adds batch size > 1 support to the non-mirage baseline in the Qwen3 demo.
Related Issues:
Linked Issues: