-
Notifications
You must be signed in to change notification settings - Fork 180
Open
Open
Copy link
Labels
area/bencharea/test-and-releasegood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is neededpriority/P1Important / Should-HaveImportant / Should-Have
Milestone
Description
Is your feature request related to a problem? Please describe.
Since the router is to support Mixture-of-Models, we need to have integration test suites to verify such functionalities.
Now with #228, we are able to use a real yet lightweight models as LLM server, we can extend such server to support multiple and specialized models.
Describe the solution you'd like
- Fine tune the lightweight models, Qwen3 0.6B or Gemma 270m, for different specializations.
- Run the entire model_eval and config generation workflow
- Run benchmark to evaluate the router's performance and accuracy
yuluo-yx
Metadata
Metadata
Assignees
Labels
area/bencharea/test-and-releasegood first issueGood for newcomersGood for newcomershelp wantedExtra attention is neededExtra attention is neededpriority/P1Important / Should-HaveImportant / Should-Have