No Generated Tests Argument #988

aseembits93 · 2025-12-23T20:31:29Z

PR Type

Enhancement

Description

Add '--no-gen-test' CLI flag
Disable test generation when flag set
Honor flag in optimization test pipeline

Diagram Walkthrough

flowchart LR
  CLI["CLI args parser"] -- "--no-gen-test flag" --> Args["Runtime args"]
  Args["Runtime args"] -- "no_gen_test=True" --> Gen["Test generation"]
  Gen["Test generation"] -- "skip creating tests" --> Pipeline["Optimization pipeline"]

File Walkthrough

Relevant files

Enhancement

cli.py `Introduce '--no-gen-test' CLI argument` codeflash/cli_cmds/cli.py Add new CLI argument `--no-gen-test`. Provide help text describing behavior.	+3/-0
function_optimizer.py `Conditionally disable test generation in optimizer` codeflash/optimization/function_optimizer.py Respect `args.no_gen_test` to skip generated tests. Return empty `GeneratedTestsList` when flag enabled.	+4/-1

CLAassistant · 2025-12-23T20:31:36Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.

Codeflash Bot seems not to be a GitHub user. You need a GitHub account to be able to sign the CLA. If you have already a GitHub account, please add the email address used for this commit to your account.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

github-actions · 2025-12-23T20:32:13Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 2 🔵🔵⚪⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Logic Consistency When --no-gen-test is set, count_tests still includes concolic tests in the logged count, but the returned GeneratedTestsList is empty. Verify downstream consumers don’t rely on count_tests to match the length of generated_tests, and whether concolic tests should also be skipped under this flag. count_tests = len(tests) if concolic_test_str: count_tests += 1 logger.info(f"!lsp\|Generated '{count_tests}' tests for '{self.function_to_optimize.function_name}'") if self.args.no_gen_test: generated_tests = GeneratedTestsList(generated_tests=[]) else: generated_tests = GeneratedTestsList(generated_tests=tests) return Success((count_tests, generated_tests, function_to_concolic_tests, concolic_test_str)) Help Text Clarity The flag name is --no-gen-test but the help mentions “Do not generate tests”. Consider clarifying whether concolic tests are also disabled to align user expectations with behavior in the optimizer. parser.add_argument( "--no-gen-test", action="store_true", help="Do not generate tests, use only existing tests for optimization." )

github-actions · 2025-12-23T20:32:29Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
General	Make logging reflect skip Adjust the log message when `--no-gen-test` is used to avoid misleading output. Explicitly state that generation was skipped so operators don't misinterpret that tests were produced. codeflash/optimization/function_optimizer.py [1345-1346] -logger.info(f"!lsp\|Generated '{count_tests}' tests for '{self.function_to_optimize.function_name}'") +if self.args.no_gen_test: + logger.info(f"!lsp\|Skipped test generation for '{self.function_to_optimize.function_name}' due to --no-gen-test") +else: + logger.info(f"!lsp\|Generated '{count_tests}' tests for '{self.function_to_optimize.function_name}'") Suggestion importance[1-10]: 6 __ Why: Updating the log when `--no-gen-test` is active improves accuracy and avoids confusion; it’s straightforward and contextually correct with minimal risk.	Low
General	Add backward-compatible plural flag Normalize the flag name to plural to match phrasing elsewhere and reduce confusion. Keep a backward-compatible alias so existing scripts don't break. codeflash/cli_cmds/cli.py [83-85] parser.add_argument( - "--no-gen-test", action="store_true", help="Do not generate tests, use only existing tests for optimization." + "--no-gen-tests", + "--no-gen-test", + dest="no_gen_test", + action="store_true", + help="Do not generate tests; use only existing tests for optimization." ) Suggestion importance[1-10]: 5 __ Why: Adding a plural alias enhances UX while preserving `dest` for compatibility; it’s a minor, correct improvement with modest impact.	Low
Possible issue	Keep counts consistent with outputs Preserve `count_tests` consistency with `generated_tests`. If `--no-gen-test` is set, also set `count_tests` based on the empty list so downstream logic doesn't assume tests exist. This avoids mismatches between the reported number of tests and the actual generated list. codeflash/optimization/function_optimizer.py [1347-1351] if self.args.no_gen_test: generated_tests = GeneratedTestsList(generated_tests=[]) + count_tests = 0 + concolic_test_str = None + function_to_concolic_tests = {} else: generated_tests = GeneratedTestsList(generated_tests=tests) Suggestion importance[1-10]: 3 __ Why: Adjusting `count_tests` for `--no-gen-test` avoids a potential mismatch, but zeroing counts and nulling `concolic_test_str`/`function_to_concolic_tests` may be incorrect if concolic results were still computed. Partial correctness and uncertain impact lower the score.	Low

claude · 2025-12-23T20:33:41Z

Code Review for PR #988: No Generated Tests Argument

Thanks for adding this feature! I've reviewed the changes and have some feedback on code quality, potential bugs, and improvements.

🐛 Critical Bug: Logic Issue in generate_tests

Location: codeflash/optimization/function_optimizer.py:1347-1350

Problem: This implementation has a significant logic flaw. The function still performs all the expensive test generation work (lines 1299-1340) including:

Submitting test generation tasks to executor
Generating concolic tests
Waiting for all futures to complete
Processing and validating results

Then it simply discards the results at the end if --no-gen-test is set. This wastes computational resources and time.

Impact:

Users expecting performance benefits from --no-gen-test will be disappointed
The flag name suggests tests won't be generated, but they actually are - just not used
Misleading behavior that doesn't match user expectations

Recommended Fix:
Add an early return at the beginning of the generate_tests method to skip test generation entirely when the flag is set.

⚠️ Potential Issue: Downstream Impact

Location: codeflash/optimization/function_optimizer.py:391-410

The code that processes generated_tests results (lines 391-410) iterates over the generated tests. When generated_tests is empty, this loop won't execute, which means:

No test files will be written
No TestFile objects will be added to self.test_files
The optimization will proceed with only existing tests

Verification Needed:

Does the optimization pipeline gracefully handle the case where no generated tests exist?
Are there any assertions or validations downstream that expect at least some generated tests?
Should there be a log message indicating the optimizer is proceeding with existing tests only?

📝 Code Quality Suggestions

Add Logging for Transparency - Users should be informed when test generation is skipped
CLI Help Text Enhancement - Could be more descriptive about performance benefits
Consider Validation - Should there be validation that existing tests are sufficient when --no-gen-test is used?

🧪 Test Coverage

Missing: No tests were added for this new functionality.

Recommended Tests:

Unit test for the CLI argument parsing
Integration test verifying that when --no-gen-test is set test generation is actually skipped
Performance test confirming the flag improves execution time

⚡ Performance Considerations

Current Implementation: ❌ Poor - Does all the work then discards results
After Fix: ✅ Good - Skips expensive operations entirely

Expected performance improvement with proper implementation:

Avoids AI service calls for test generation
Saves thread pool executor time
Reduces overall optimization time when users have existing tests

✅ What's Good

Clear flag naming (--no-gen-test)
Consistent with existing patterns (similar to --no-pr)
Simple boolean flag approach
Good PR description with diagram

📋 Summary

Must Fix:

Logic bug: Move the flag check to the beginning of generate_tests() to actually skip generation

Should Consider:

Add logging when skipping test generation
Verify downstream handling of empty generated tests
Add test coverage
Enhance help text

Let me know if you'd like help implementing any of these suggestions!

misrasaurabh1 · 2025-12-23T21:26:38Z

codeflash/cli_cmds/cli.py

        "--no-pr", action="store_true", help="Do not create a PR for the optimization, only update the code locally."
    )
+    parser.add_argument(
+        "--no-gen-test", action="store_true", help="Do not generate tests, use only existing tests for optimization."


--no-gen-tests

misrasaurabh1 · 2025-12-23T21:27:25Z

codeflash/optimization/function_optimizer.py

        logger.info(f"!lsp|Generated '{count_tests}' tests for '{self.function_to_optimize.function_name}'")

-        generated_tests = GeneratedTestsList(generated_tests=tests)
+        if self.args.no_gen_test:


dont do this this way! dont generate the tests in the first place

gotcha, was done in a hurry, i've fixed it, will add/modify an end-to-end ci to ensure it works

claude · 2025-12-23T22:15:16Z

Pull Request Review: No Generated Tests Argument

Thank you for this PR! The feature to disable test generation is useful for optimization workflows. However, I've identified several issues that need to be addressed.

Critical Bug: Undefined Variable Error

Location: codeflash/optimization/function_optimizer.py:1298-1320

The code has a critical bug that will cause a NameError when --no-gen-tests is enabled. When self.args.no_gen_tests is True, the variable future_tests is never defined (line 1300), but it's referenced in lines 1314 and 1320. Both references are inside if not self.args.no_gen_tests: blocks, so they won't execute when the flag is enabled. However, this is fragile code structure.

Fix: Initialize future_tests before the conditional:
future_tests = []
if not self.args.no_gen_tests:
future_tests = self.submit_test_generation_tasks(...)

Logic Issue: Empty Tests List Not Handled Properly

Location: codeflash/optimization/function_optimizer.py:1340-1342

When --no-gen-tests is set, the warning and failure return are skipped, which means the tests list remains empty, count_tests will be 0, and the function returns Success with an empty GeneratedTestsList.

Recommendation: Add a comment explaining that an empty tests list is valid when --no-gen-tests is set.

Code Quality Issues

Code Duplication: The condition if not self.args.no_gen_tests: is repeated 3 times. Consider refactoring with a boolean variable.
Missing Documentation: The generate_tests() method docstring doesn't mention the behavior when test generation is disabled.

Test Coverage Concerns

Critical: No tests found for this new feature. The PR should include:

Unit tests for the CLI argument parsing
Integration tests verifying test generation is skipped when flag is set
Tests confirming concolic tests are still generated
Tests verifying the optimization pipeline continues to work correctly

Performance Considerations

Positive: This change should improve performance when users don't need generated tests, as it skips expensive AI test generation calls.

Summary of Required Changes

Must Fix:

Initialize future_tests before the conditional
Add unit/integration tests for the new feature

Should Consider:

Add clarifying comments about empty tests list being valid
Update method docstring
Consider refactoring repeated conditionals

Overall Assessment

The core concept is solid and useful, but the implementation has a critical scoping issue that needs to be fixed. With the suggested changes and proper test coverage, this will be a valuable addition to the codebase.

no gen test arg

cb361d4

aseembits93 requested a review from misrasaurabh1 December 23, 2025 20:31

github-actions bot added the Review effort 2/5 label Dec 23, 2025

misrasaurabh1 reviewed Dec 23, 2025

View reviewed changes

dont generate if not needed

cd8bc3f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

No Generated Tests Argument #988

No Generated Tests Argument #988

aseembits93 commented Dec 23, 2025 •

edited by github-actions bot

Loading

Uh oh!

CLAassistant commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

Uh oh!

claude bot commented Dec 23, 2025

Uh oh!

misrasaurabh1 Dec 23, 2025

Uh oh!

misrasaurabh1 Dec 23, 2025

Uh oh!

aseembits93 Dec 23, 2025

Uh oh!

claude bot commented Dec 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

No Generated Tests Argument #988

Are you sure you want to change the base?

No Generated Tests Argument #988

Conversation

aseembits93 commented Dec 23, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

CLAassistant commented Dec 23, 2025

Uh oh!

github-actions bot commented Dec 23, 2025

PR Reviewer Guide 🔍

Uh oh!

github-actions bot commented Dec 23, 2025

PR Code Suggestions ✨

Uh oh!

claude bot commented Dec 23, 2025

Code Review for PR #988: No Generated Tests Argument

🐛 Critical Bug: Logic Issue in generate_tests

⚠️ Potential Issue: Downstream Impact

📝 Code Quality Suggestions

🧪 Test Coverage

⚡ Performance Considerations

✅ What's Good

📋 Summary

Uh oh!

misrasaurabh1 Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

misrasaurabh1 Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

aseembits93 Dec 23, 2025

Choose a reason for hiding this comment

Uh oh!

claude bot commented Dec 23, 2025

Pull Request Review: No Generated Tests Argument

Critical Bug: Undefined Variable Error

Logic Issue: Empty Tests List Not Handled Properly

Code Quality Issues

Test Coverage Concerns

Performance Considerations

Summary of Required Changes

Overall Assessment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

aseembits93 commented Dec 23, 2025 •

edited by github-actions bot

Loading