enhance(test-benchmark): use config file for fixed opcode count scenarios #1790

spencer-tb · 2025-11-14T17:45:48Z

🗒️ Description

This PR adds a CLI tool benchmark_parser to automatically scan benchmark tests and generate a configuration file .fixed_opcode_counts.json for the --fixed-opcode-count feature from #1747.

Key Changes

New CLI tool: uv run benchmark_parser
- Uses Python AST to scan tests/benchmark/ for tests with @pytest.mark.repricing marker
- Extracts opcode patterns from @pytest.mark.parametrize decorators
- Generates .fixed_opcode_counts.json at repo root with opcode counts mapping
- Supports --check mode for CI validation: uv run benchmark_parser --check
Config file format: .fixed_opcode_counts.json
- Gitignored (user-local configuration)
- All patterns default to [1] (1K opcodes)
- Users can customize counts per pattern, [1, 10, 100] for 1K, 10K, 100K, by manually editing the file
- Custom counts are preserved when re-running the parser.
Help text improvements:
- Added benchmark options to fill --fill-help and execute remote --execute-remote-help
- Simplified help text with examples for --gas-benchmark-values and --fixed-opcode-count
Test updates:
- Renamed op parameters to opcode in test_arithmetic.py for consistency

Usage

Generate/update config (first time or after benchmark test changes): uv run benchmark_parser
Customize counts by editing .fixed_opcode_counts.json:

{
  "scenario_configs": {
    "test_codecopy.*": [
      1
    ],
    ...
}

Run with configured opcode counts:

# Fill fixtures (useful for fast one shot checks if count is 1)
uv run fill --fixed-opcode-count --fork Prague -m repricing tests/benchmark

# Execute on remote RPC
uv run execute remote --fixed-opcode-count --fork Prague -m repricing tests/benchmark --rpc-seed-key <key> --rpc-endpoint <url> --chain-id <id>

Fill works correctly and I tested execute remote on Hoodi with 1K opcode count set, the latest txs: https://hoodi.etherscan.io/address/0x83fd666bfb2b345f932c3e4e04b6d85e5ed3568d

Future Items

Add CI for fill/execute with --fixed-opcode-count after generating the config file.
Verify --fixed-opcode-count with debug_traceTransaction using execute hive.
Add documentation & framework tests.

🔗 Related Issues or PRs

#1747

✅ Checklist

All: Ran fast tox checks to avoid unnecessary CI fails, see also Code Standards and Enabling Pre-commit Checks:
```
uvx tox -e static
```
All: PR title adheres to the repo standard - it will be used as the squash commit message and should start type(scope):.
All: Considered adding an entry to CHANGELOG.md.
All: Considered updating the online docs in the ./docs/ directory.
All: Set appropriate labels for the changes (only maintainers can apply labels).

LouisTsai-Csie

Thanks a lot for this! I left some suggestions, but I’m happy to discuss further. I’ll share this with Kamil to confirm it aligns with their needs.

tests/benchmark/configs/fixed_opcode_counts.py

LouisTsai-Csie · 2025-12-04T13:31:10Z

packages/testing/src/execution_testing/cli/pytest_commands/plugins/shared/benchmarking.py

        if has_repricing:
-            if fixed_opcode_counts:
-                opcode_counts = [
-                    int(x.strip()) for x in fixed_opcode_counts.split(",")
+            opcode_counts_to_use = None


This contains a few logic issues, but they will be resolved in my upcoming PR.

codecov · 2025-12-04T17:32:20Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 87.31%. Comparing base (2a6f9ee) to head (4427257).
⚠️ Report is 1 commits behind head on forks/osaka.

Additional details and impacted files

@@             Coverage Diff              @@
##           forks/osaka    #1790   +/-   ##
============================================
  Coverage        87.31%   87.31%           
============================================
  Files              541      541           
  Lines            32832    32832           
  Branches          3015     3015           
============================================
  Hits             28668    28668           
  Misses            3557     3557           
  Partials           607      607

Flag	Coverage Δ
unittests	`87.31% <ø> (ø)`

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

LouisTsai-Csie

Add some suggestion for the parser! Thanks