Options for using multiple models #200

BradKML · 2025-08-12T02:26:21Z

BradKML
Aug 12, 2025

Other than agentics and mathematics, usually FOSS is better. Ideally OpenEvolve for proof-of-concept work should heavily bias Qwen3-32B or Qwen3-30B-A3B, and only use larger Qwen coder for heavyweight development SomeOddCodeGuy/WilmerAI#26 (reply in thread)

This line of thought makes me think of a few things:

How do we find better reinforcement learning algorithms? What comes after "Dr. GRPO" and RLVR? Implicit rewards? Maybe self-play? Open-ended ("free range") research optimization #197 What about "Intuitor"? https://github.com/sunblaze-ucb/Intuitor
How do we transfer the skills of one extremely unhostable model into open weight industrial models and then to something more consumer-friendly? (yes I am essentially asking for a multi-tiered distillation solution)
Since RouteLLM only interpolates between 2 models, and WilmerAI only routes through task type, what are the possible ways of handling both at the same time, and using more than 2 models per task type? https://github.com/lm-sys/RouteLLM

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Options for using multiple models #200

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Options for using multiple models #200

Uh oh!

Uh oh!

BradKML Aug 12, 2025

Replies: 0 comments

BradKML
Aug 12, 2025