Skip to content

[v0.1]Docs: Model performance evaluation guide #53

@rootfs

Description

@rootfs

Acceptance

  • Documents automated workflow to evaluate models (including but not limited to MMLU-Pro), generate performance-based routing config, and update categories[].model_scores
  • Includes example evaluation->config pipeline.

Metadata

Metadata

Assignees

Labels

Type

No type

Projects

No projects

Relationships

None yet

Development

No branches or pull requests

Issue actions