Skip to content

Conversation

@codegen-sh
Copy link
Contributor

@codegen-sh codegen-sh bot commented Jul 6, 2025

Summary

This PR implements auto-generated small benchmarks that run on every merge into the grainchain repo, as requested. The system now supports a dual benchmark architecture:

πŸƒβ€β™‚οΈ Small Benchmarks (New - Auto on Merge)

  • Trigger: Automatically run on every PR merge to main
  • Iterations: 10 per provider (fast execution ~2-3 minutes)
  • Providers: local, e2b
  • Purpose: Quick regression detection and merge validation
  • Configuration: benchmarks/configs/merge-small.json

πŸ”¬ Comprehensive Benchmarks (Existing - Manual/Scheduled)

  • Trigger: Manual execution or daily scheduled runs
  • Iterations: 3+ per provider (thorough analysis)
  • Purpose: Detailed performance analysis and provider comparison
  • Providers: local, e2b, daytona, morph

Changes Made

New Files

  • .github/workflows/merge-benchmark.yml - New workflow that triggers on PR merges
  • benchmarks/configs/merge-small.json - Lightweight config with 10 iterations

Modified Files

  • .github/workflows/benchmark.yml - Renamed to "Comprehensive Benchmarks" for clarity
  • README.md - Updated documentation to explain the dual benchmark system

Key Features

βœ… Merge-triggered: Runs automatically when PRs are merged to main
βœ… Fast execution: 10 iterations per provider for quick feedback
βœ… PR comments: Automatically posts benchmark results as PR comments
βœ… Artifact storage: Results saved as GitHub Actions artifacts
βœ… Git integration: Results committed back to repo for tracking
βœ… Manual testing: Supports workflow_dispatch for manual testing

Testing

The workflow can be tested manually using the "Run workflow" button in GitHub Actions, or will automatically trigger on the next PR merge.

Compliance

  • βœ… Follows repository rules (no direct push to main)
  • βœ… Uses existing benchmark infrastructure
  • βœ… Maintains backward compatibility
  • βœ… Respects pre-commit hooks and formatting

πŸ’» View my work β€’ About Codegen

- Create merge-benchmark.yml workflow that triggers on PR merges
- Run small benchmarks with 10 iterations per provider
- Add merge-small.json config for lightweight benchmark settings
- Update comprehensive benchmark workflow naming for clarity
- Document dual benchmark system in README
- Distinguish between small (merge-triggered) and comprehensive (manual/scheduled) benchmarks
@codegen-sh codegen-sh bot force-pushed the codegen-bot/setup-merge-benchmarks-1751837819 branch from 89ee9ff to fb31df7 Compare July 6, 2025 21:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant