Skip to content

Tool for analyzing and visualizing differences between two Aider benchmark runs through an interactive dashboard.

License

Notifications You must be signed in to change notification settings

CedarVerse/benchmark-delta-analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

BDA - Benchmark Delta Analysis Tool

A Python tool for analyzing and visualizing differences between two Aider benchmark runs through an interactive dashboard.

PyPI version Python Versions Code style: black License

Features

  • Compares two benchmark run folders (raw or tar-bz2 archives)
  • Interactive web dashboard built with Dash
  • Detailed test-by-test comparison analysis
  • Visualizes performance changes and differences

Dashboard Features

  • Test result comparison
  • Performance metrics visualization
  • Interactive data filtering and sorting
  • Detailed test case analysis

img.png

img_1.png

img_2.png

img_3.png

Installation

pipx install cedarverse-bda

Usage

# Info on a single benchmark run
bda --info <run-path>

# Delta Analysis
bda <run1-path> [run2-path]

# Interactive dashboard
bda --dashboard <run1-path> [run2-path]

Development

# Install development dependencies
pip install -e .[dev]

About

Tool for analyzing and visualizing differences between two Aider benchmark runs through an interactive dashboard.

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published