Don't repeatedly optimize gh actions #275

dasarchan · 2025-06-03T18:03:34Z

Adds a call to cfapi to push hashes of the function code context in order to check if the function is being optimized again.

One thing to note here is that this checking logic happens early, in function discovery.

codeflash/discovery/functions_to_optimize.py

Saga4 · 2025-06-03T23:00:13Z

codeflash/discovery/functions_to_optimize.py



+def check_optimization_status(
+        functions_by_file: dict[Path, list[FunctionToOptimize]],


Q: What will be the scenario where the function is already an optimized code by CF?

Right now it would try to reoptimize it, actually - @misrasaurabh1 what's the desired behavior here

…com/codeflash-ai/codeflash into dont-optimize-repeatedly-gh-actions

…275 (`dont-optimize-repeatedly-gh-actions`) Here is the optimized version of your program, focusing on speeding up the slow path in `make_cfapi_request`, which is dominated by `json.dumps(payload, indent=None, default=pydantic_encoder)` and the use of `requests.post(..., data=json_payload, ...)`. Key optimizations. - **Use `requests.post(..., json=payload, ...)`:** This lets `requests` do the JSON serialization more efficiently (internally uses `json.dumps`). Furthermore, `requests` will add the `Content-Type: application/json` header if you use the `json` argument. - **Only use the custom encoder if really needed:** Only pass `default=pydantic_encoder` if payload contains objects requiring it. If not, the standard encoder is much faster. You can try a direct serialization, and fallback if a `TypeError` is raised. - **Avoid repeated `.upper()`** inside the POST/GET dispatch by normalizing early. - **Avoid unnecessary string interpolation.** - **Avoid updating headers dict when not needed.** - **Other micro-optimizations:** Use local variables, merge dicts once, etc. with all comments preserved and only modified/added where code changed. **Explanation of biggest win:** The largest bottleneck was in JSON encoding and in manually setting the content-type header. Now, `requests.post(..., json=payload)` is used for the fastest path in the vast majority of requests, only falling back to a slower path if necessary. This should substantially speed up both serialization and POST. This approach is backward-compatible and will produce exactly the same results as before.

codeflash-ai · 2025-06-05T20:35:33Z

⚡️ Codeflash found optimizations for this PR

📄 44% (0.44x) speedup for `is_function_being_optimized_again` in `codeflash/api/cfapi.py`

⏱️ Runtime : 2.79 milliseconds → 1.94 milliseconds (best of 74 runs)

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function is_function_being_optimized_again by 44% in PR #275 (dont-optimize-repeatedly-gh-actions) #290

If you approve, it will be merged into this PR (branch dont-optimize-repeatedly-gh-actions).

…in PR #275 (`dont-optimize-repeatedly-gh-actions`) Here is an optimized version of your code, targeting the areas highlighted as slowest in your line profiling. ### Key Optimizations 1. **Read Only Necessary Lines:** - When `starting_line` and `ending_line` are provided, instead of reading the entire file and calling `.splitlines()`, read only the lines needed. This drastically lowers memory use and speeds up file operations for large files. - Uses `itertools.islice` to efficiently pluck only relevant lines. 2. **String Manipulation Reduction:** - Reduce the number of intermediate string allocations by reusing objects as much as possible and joining lines only once. - Avoids `strip()` unless absolutely necessary (as likely only for code content). 3. **Variable Lookup:** - Minimize attribute lookups that are inside loops. The function semantics are preserved exactly. All comments are retained or improved for code that was changed for better understanding. ### Rationale - The main bottleneck is reading full files and splitting them when only a small region is needed. By slicing only the relevant lines from file, the function becomes much faster for large files or high call counts. - All behaviors, including fallback and hash calculation, are unchanged. - Import of `islice` is local and lightweight. **This should significantly improve both runtime and memory usage of `get_code_context_hash`.**

codeflash-ai · 2025-06-05T20:42:08Z

⚡️ Codeflash found optimizations for this PR

📄 15% (0.15x) speedup for `FunctionToOptimize.get_code_context_hash` in `codeflash/discovery/functions_to_optimize.py`

⏱️ Runtime : 3.67 milliseconds → 3.20 milliseconds (best of 72 runs)

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up method FunctionToOptimize.get_code_context_hash by 15% in PR #275 (dont-optimize-repeatedly-gh-actions) #291

If you approve, it will be merged into this PR (branch dont-optimize-repeatedly-gh-actions).

openhands-ai · 2025-06-05T21:24:16Z

Looks like there are a few issues preventing this PR from being merged!

GitHub Actions are failing:
- Lint
- Mypy Type Checking for CLI
- end-to-end-test
- end-to-end-test

If you'd like me to help, just leave a comment, like

@OpenHands please fix the failing actions on PR #275

Feel free to include any additional details that might help me get this PR into a better state.

_{^{You can manage your notification settings}}

misrasaurabh1 · 2025-06-08T09:32:28Z

codeflash/optimization/function_optimizer.py

            )
            for test_index, (test_path, test_perf_path) in enumerate(
-                zip(generated_test_paths, generated_perf_test_paths)
+                zip(generated_test_paths, generated_perf_test_paths, strict=False)


Revert this for 3.9

misrasaurabh1 and others added 4 commits April 27, 2025 18:04

WIP

1f7124a

WIP

41378a0

batch code hash check

e9746c9

implemented hash check into filter_functions

5760316

dasarchan requested a review from misrasaurabh1 June 3, 2025 18:03

dasarchan self-assigned this Jun 3, 2025

Merge branch 'main' into dont-optimize-repeatedly-gh-actions

905b1a0

Saga4 reviewed Jun 3, 2025

View reviewed changes

codeflash/discovery/functions_to_optimize.py Outdated Show resolved Hide resolved

Saga4 reviewed Jun 3, 2025

View reviewed changes

dasarchan added 2 commits June 5, 2025 16:31

removed prints, added cfapi.py func

2367160

Merge branch 'dont-optimize-repeatedly-gh-actions' of https://github.…

f2733b3

…com/codeflash-ai/codeflash into dont-optimize-repeatedly-gh-actions

codeflash-ai bot mentioned this pull request Jun 5, 2025

⚡️ Speed up function is_function_being_optimized_again by 44% in PR #275 (dont-optimize-repeatedly-gh-actions) #290

Closed

removed unused import

c1fb089

codeflash-ai bot mentioned this pull request Jun 5, 2025

⚡️ Speed up method FunctionToOptimize.get_code_context_hash by 15% in PR #275 (dont-optimize-repeatedly-gh-actions) #291

Closed

misrasaurabh1 and others added 11 commits June 6, 2025 14:41

Merge branch 'main' into dont-optimize-repeatedly-gh-actions

3443404

fix no git error

eb3d305

add low prob of repeating optimization

c862b4d

changes to cli for code context hash

96ee580

update the cli

87fe086

added separate write route, changed return format for api route

4cb823e

merge

1cc39e3

removed empty test file

dd8dceb

updates

5989b26

Add a first version of hashing code context

5c0a028

Might work?

2686682

misrasaurabh1 added 14 commits June 7, 2025 19:47

get it working

4f39794

10% chance of optimizing again

50f4c33

Merge branch 'main' into dont-optimize-repeatedly-gh-actions

81f96ed

fix a bug

c856f1e

ruff fix

b48ed5c

fix bugs with docstring removal

9e14cfe

fix a type

5d4870f

fix more tests

2c1314d

fix types for python 3.9

32a8001

clearer message

e2f1ba0

fix mypy types

f6b3275

add more tests

6ed9387

fix for test

be1ef9b

double the context length

9137921

misrasaurabh1 requested a review from a team June 8, 2025 08:36

misrasaurabh1 reviewed Jun 8, 2025

View reviewed changes

misrasaurabh1 added 4 commits June 8, 2025 14:19

ruff revert

797cba3

improve some github actions logging

d0f84f6

some refactor

2d62171

remove unncessary line

226acd7

misrasaurabh1 approved these changes Jun 9, 2025

View reviewed changes

misrasaurabh1 enabled auto-merge June 9, 2025 06:18

misrasaurabh1 merged commit a2e78e1 into main Jun 9, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't repeatedly optimize gh actions #275

Don't repeatedly optimize gh actions #275

Uh oh!

dasarchan commented Jun 3, 2025

Uh oh!

Uh oh!

Saga4 Jun 3, 2025

Uh oh!

dasarchan Jun 4, 2025

Uh oh!

codeflash-ai bot commented Jun 5, 2025

⚡️ Speed up function `is_function_being_optimized_again` by 44% in PR #275 (`dont-optimize-repeatedly-gh-actions`) #290

Uh oh!

codeflash-ai bot commented Jun 5, 2025

⚡️ Speed up method `FunctionToOptimize.get_code_context_hash` by 15% in PR #275 (`dont-optimize-repeatedly-gh-actions`) #291

Uh oh!

openhands-ai bot commented Jun 5, 2025

Uh oh!

misrasaurabh1 Jun 8, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants



		def check_optimization_status(
		functions_by_file: dict[Path, list[FunctionToOptimize]],

Don't repeatedly optimize gh actions #275

Don't repeatedly optimize gh actions #275

Uh oh!

Conversation

dasarchan commented Jun 3, 2025

Uh oh!

Uh oh!

Saga4 Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

dasarchan Jun 4, 2025

Choose a reason for hiding this comment

Uh oh!

codeflash-ai bot commented Jun 5, 2025

⚡️ Codeflash found optimizations for this PR

📄 44% (0.44x) speedup for is_function_being_optimized_again in codeflash/api/cfapi.py

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function is_function_being_optimized_again by 44% in PR #275 (dont-optimize-repeatedly-gh-actions) #290

Uh oh!

codeflash-ai bot commented Jun 5, 2025

⚡️ Codeflash found optimizations for this PR

📄 15% (0.15x) speedup for FunctionToOptimize.get_code_context_hash in codeflash/discovery/functions_to_optimize.py

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up method FunctionToOptimize.get_code_context_hash by 15% in PR #275 (dont-optimize-repeatedly-gh-actions) #291

Uh oh!

openhands-ai bot commented Jun 5, 2025

Uh oh!

misrasaurabh1 Jun 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

📄 44% (0.44x) speedup for `is_function_being_optimized_again` in `codeflash/api/cfapi.py`

⚡️ Speed up function `is_function_being_optimized_again` by 44% in PR #275 (`dont-optimize-repeatedly-gh-actions`) #290

📄 15% (0.15x) speedup for `FunctionToOptimize.get_code_context_hash` in `codeflash/discovery/functions_to_optimize.py`

⚡️ Speed up method `FunctionToOptimize.get_code_context_hash` by 15% in PR #275 (`dont-optimize-repeatedly-gh-actions`) #291