feat: add parameter sweep command for variant generation and ranking by claytonlin1110 · Pull Request #118 · llmsresearch/paperbanana

claytonlin1110 · 2026-03-25T03:53:19Z

Summary

Closes #119

add new paperbanana sweep CLI command to run a cartesian sweep across providers, models, iterations, and optimization/auto-refine modes
add paperbanana/core/sweep.py with structured variant planning, CSV axis parsing, ranking, and summary helpers
persist sweep outputs under sweep_<id>/variant_<id>/ and write sweep_report.json with per-variant status, runtime, and ranked results
include dry-run mode to preview planned variants without API calls
add tests for sweep core helpers and CLI dry-run / validation behavior

Motivation

One diagram depends on many settings (providers, models, iterations, optimize/auto-refine). Changing flags and comparing runs by hand is slow and easy to lose track of. A sweep runs those combinations in one go, writes a single ranked report, and supports dry-run so you can plan before spending API quota.

claytonlin1110 · 2026-03-25T03:55:25Z

@dippatel1994 Would you please review this?

claytonlin1110 · 2026-03-25T04:43:37Z

@dippatel1994 please review

claytonlin1110 · 2026-03-30T04:40:06Z

@dippatel1994 Any update about this feature?

dippatel1994

CI passes, nice work. A few things to fix:

Settings constructed per-variant inside the loop — load_dotenv() should be called once before the loop, and Settings should be built once then copied with model_copy(update=overrides) per variant. Currently re-parses YAML on every iteration.
Missing --pdf-pages option — Every other command that accepts --input with PDF support also exposes --pdf-pages. The sweep command calls load_methodology_source(input_path) without it.
No non-dry-run test — Only --dry-run and validation are tested. Add a test that mocks the pipeline and verifies the sweep report structure (status, ranked results, timing). test_ablate_retrieval_writes_report is a good template.

Non-blocking: The quality proxy formula (100 - 12.5 * suggestions) is undocumented and fragile — consider at minimum documenting it in --help. Also missing --budget and --auto-download-data flags for parity with generate/batch.

claytonlin1110 · 2026-04-02T20:22:31Z

Thank you for yuor feedback, @dippatel1994
Just updated PR

dippatel1994

All 3 points addressed. Settings built once with model_copy per variant, --pdf-pages added, non-dry-run test added. CI green. LGTM.

claytonlin1110 · 2026-04-02T21:09:55Z

Thanks @dippatel1994
ready to be merged?

feat: add parameter sweep command for variant generation and ranking

d385cb8

claytonlin1110 added 2 commits March 24, 2026 22:55

fix: lint

85a5817

fix: lint

465f2ab

Merge branch 'main' into feat/cli-parameter-sweep

1da07bb

dippatel1994 requested changes Apr 2, 2026

View reviewed changes

fix: update

3a570d3

claytonlin1110 requested a review from dippatel1994 April 2, 2026 20:21

dippatel1994 approved these changes Apr 2, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add parameter sweep command for variant generation and ranking#118

feat: add parameter sweep command for variant generation and ranking#118
claytonlin1110 wants to merge 5 commits intollmsresearch:mainfrom
claytonlin1110:feat/cli-parameter-sweep

claytonlin1110 commented Mar 25, 2026 •

edited

Loading

Uh oh!

claytonlin1110 commented Mar 25, 2026

Uh oh!

claytonlin1110 commented Mar 25, 2026

Uh oh!

claytonlin1110 commented Mar 30, 2026

Uh oh!

dippatel1994 left a comment

Uh oh!

claytonlin1110 commented Apr 2, 2026

Uh oh!

dippatel1994 left a comment

Uh oh!

claytonlin1110 commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

claytonlin1110 commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Uh oh!

claytonlin1110 commented Mar 25, 2026

Uh oh!

claytonlin1110 commented Mar 25, 2026

Uh oh!

claytonlin1110 commented Mar 30, 2026

Uh oh!

dippatel1994 left a comment

Choose a reason for hiding this comment

Uh oh!

claytonlin1110 commented Apr 2, 2026

Uh oh!

dippatel1994 left a comment

Choose a reason for hiding this comment

Uh oh!

claytonlin1110 commented Apr 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

claytonlin1110 commented Mar 25, 2026 •

edited

Loading