Skip to content

[v0.1]Bench: Router benchmark CLI #40

@rootfs

Description

@rootfs

Acceptance

Command run_bench.sh with:

  • Per-category metrics: accuracy, response time, token counts (prompt/completion/total)
  • Per-model metrics: success rate, error distribution, latency distribution
  • Export to CSV/JSON for analysis

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions