A CLI tool for parsing and comparing GitHub Actions performance metrics from tt-forge and tt-xla repositories.
- Report Mode: Analyze single workflow run with simulation and device performance metrics
- Compare Mode: Compare two runs with delta calculations and regression detection
- Rich Output: Color-coded console tables with performance metrics
pip install -e .- Python 3.8+
- GitHub CLI (
gh) installed and authenticated
gh-perf-report report <run_id> --repo <tt-xla|tt-forge>Example:
gh-perf-report report 20770713621 --repo tt-forgegh-perf-report compare <baseline_run_id> <current_run_id> --baseline-repo <tt-xla|tt-forge>Example:
gh-perf-report compare 20770713621 20780226487 --baseline-repo tt-xlagh-perf-report list-jobs <run_id> --repo <tt-xla|tt-forge>--owner: Repository owner (default: tenstorrent)--no-cache: Disable caching--workers: Number of parallel workers (default: 5)--current-repo: Current repository for comparison (defaults to baseline-repo)
- Extracts Simulation Metrics: Parses "Sample per second" from Step 10 ("Run Perf Benchmark") logs
- Extracts Device Performance: Downloads and parses device-perf CSV artifacts, calculates sum of filtered operation durations
- Reports Failures: Identifies failed tests and reports which step failed with error messages
- Compares Runs: Calculates deltas and percentage changes, highlights regressions and improvements
Shows table with:
- Job name
- Status (success/failure)
- Samples per second
- Device perf (milliseconds)
- Error messages for failed jobs
Shows table with:
- Job name
- Status changes
- Samples/sec delta and percentage
- Device perf delta and percentage
- Result (regression/improvement/neutral)
MIT