[LSP] validating python and git environment #670

mohammedahmed18 · 2025-08-19T19:40:23Z

PR Type

Enhancement, Bug fix

Description

Introduce LSP project validation feature
Extract is_valid_pyproject_toml for pyproject checks
Enhance git utils: staging, whitespace ignore, remote validation
Bypass function optimization under LSP context

Diagram Walkthrough

flowchart LR
  A["LSP client"] -- "validateProject request" --> B["validate_project"]
  B -- "invalid pyproject" --> C["error response"]
  B -- "valid config" --> D["process_pyproject_config"]
  D --> E["repo commit checks"]
  E -- "success" --> F["init optimizer args"]

File Walkthrough

Relevant files

Enhancement

cli.py `Skip optimize-all under LSP` codeflash/cli_cmds/cli.py import `is_LSP_enabled` helper early return when LSP is enabled disable optimize-all argument under LSP	+4/-0
cmd_init.py `Validate pyproject and check git remote` codeflash/cli_cmds/cmd_init.py add `is_valid_pyproject_toml` config validator refactor `should_modify_pyproject_toml` to use validator skip GitHub app install if remote missing	+23/-9
functions_to_optimize.py `Disable optimization check under LSP` codeflash/discovery/functions_to_optimize.py import `is_LSP_enabled` helper disable previous optimization check under LSP	+4/-0
beta.py `Add validateProject and refactor optimizer init` codeflash/lsp/beta.py add `validateProject` LSP feature for project checks refactor optimizer init to separate API key validation ensure tests directory exists in worktree update API key handlers to new initializer	+39/-4
server.py `Remove config processing in server init` codeflash/lsp/server.py remove `process_pyproject_config` from server init avoid config parsing during startup	+1/-2

Bug fix

git_utils.py `Fix worktree commit and whitespace apply` codeflash/code_utils/git_utils.py stage all changes before worktree commit apply patches with `--whitespace=nowarn`	+3/-2

github-actions · 2025-08-19T19:42:23Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 No relevant tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Missing Function Call The condition uses `is_LSP_enabled` instead of calling `is_LSP_enabled()`, causing the branch to always be truthy and skipping the intended optimization logic. if is_LSP_enabled: return False Missing Config Processing `prepare_optimizer_arguments` no longer invokes `process_pyproject_config`, so `self.args` may lack resolved paths (module_root, tests_root) from the pyproject.toml, potentially breaking optimizer setup. from codeflash.cli_cmds.cli import parse_args args = parse_args() args.config_file = config_file args.no_pr = True # LSP server should not create PRs args.worktree = True self.args = args Commit Behavior Change Replacing `git commit -am` with separate `add` and `commit` calls may stage and commit untracked files now, altering the previous behavior of committing only tracked changes. repository.git.add(".") repository.git.commit("-m", commit_message, "--no-verify")

github-actions · 2025-08-19T19:43:50Z

PR Code Suggestions ✨

Explore these optional code suggestions:

Category	Suggestion	Impact
Possible issue	Invoke LSP check properly Invoke `is_LSP_enabled()` instead of checking the function object so the LSP mode is detected correctly. codeflash/discovery/functions_to_optimize.py [474-475] -if is_LSP_enabled: +if is_LSP_enabled(): return False Suggestion importance[1-10]: 9 __ Why: The code checks the function object `is_LSP_enabled` instead of calling it, so it always skips optimization; invoking `is_LSP_enabled()` fixes this critical logic error.	High
	Return config alongside confirmation Return both the confirmation result and the `config` so the function matches its declared `(bool, dict[str, Any] \| None)` return type. codeflash/cli_cmds/cmd_init.py [187-188] return Confirm.ask( "✅ A valid Codeflash config already exists in this project. Do you want to re-configure it?", +), config Suggestion importance[1-10]: 8 __ Why: The function signature declares returning a `(bool, dict)` but only a `bool` is returned, leading to inconsistent return types; adding `, config` corrects this functional bug.	Medium
	Import Any for annotations Add a missing import for `Any` so the type annotation does not raise a NameError at runtime. codeflash/cli_cmds/cmd_init.py [158] +from typing import Any + def is_valid_pyproject_toml(pyproject_toml_path: Path) -> dict[str, Any] \| None: Suggestion importance[1-10]: 4 __ Why: The new function uses `Any` in its return type but `Any` is not imported, causing a NameError; adding the import fixes this minor but necessary type annotation issue.	Low

…codeflash into vsc/environment-validation

Saga4 · 2025-08-20T23:08:53Z

codeflash/discovery/functions_to_optimize.py

        Tuple of (filtered_functions_dict, remaining_count)

    """
+    if is_LSP_enabled():


Q: could this raise questions on underterministic nature of optimizations if the function has is same.

Saga4 · 2025-08-20T23:18:35Z

codeflash/code_utils/git_utils.py

    uni_diff_text = repository.git.diff(None, "HEAD", ignore_blank_lines=True, ignore_space_at_eol=True)

+    # HACK: remove binary files from the diff, find a better way  # noqa: FIX004
+    uni_diff_text = re.sub(


nit: rather than regex we can us git diff exclude and list all the exclude file types. or we can alter the .gitignore or gitattributes file too.

or we can also go with --diff-filter=AMT

Also I am thinking either we can just change the file type to text rather than excluding them?

yeah exclude sounds like a good option

Also I am thinking either we can just change the file type to text rather than excluding them?

actually this shouldn't happen in the first place, git can't apply patches with binary diffs, so this would be the user mistake if he forgot to ignore them

…c/environment-validation`) The optimization applies **LRU caching to API calls** that were being repeatedly executed in loops, eliminating redundant network requests. **Key Changes:** - **Wrapped `is_github_app_installed_on_repo` with LRU cache**: Added `@lru_cache(maxsize=128)` to a new `_cached_is_github_app_installed_on_repo` function that handles the actual API request - **Cache key includes all parameters**: Caches based on `(owner, repo, suppress_errors)` tuple to ensure correct behavior across different call contexts **Why This Provides Massive Speedup:** - **Eliminates redundant API calls**: The original code made multiple identical API requests in the retry loop within `install_github_app` - each network request took ~100ms based on profiling data - **Network I/O is the bottleneck**: Line profiler shows 99%+ of execution time was spent in `make_cfapi_request` calls (10+ seconds total) - **Cache hits are microsecond-fast**: Subsequent calls with same parameters return cached results instead of making new HTTP requests **Test Case Performance:** - **Scenarios with repeated checks benefit most**: Tests involving retry loops see 8000000%+ speedups (from ~900ms to ~10μs) - **Single API call scenarios**: Still see significant gains (6-7% faster) due to reduced function call overhead - **Large-scale scenarios**: Tests with many remotes see dramatic improvements when the same repo is checked multiple times This optimization is particularly effective for CLI workflows where the same repository's app installation status is checked multiple times during user interaction flows.

codeflash-ai · 2025-08-21T11:33:02Z

⚡️ Codeflash found optimizations for this PR

📄 235,820% (2,358.20x) speedup for `install_github_app` in `codeflash/cli_cmds/cmd_init.py`

⏱️ Runtime : 10.0 seconds → 4.26 milliseconds (best of 148 runs)

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function install_github_app by 235,820% in PR #670 (vsc/environment-validation) #674

If you approve, it will be merged into this PR (branch vsc/environment-validation).

…ent-validation`) The optimization replaces the expensive repeated `import` statement with a cached import pattern using a global variable and helper function. **Key optimization:** - **Cached Import Pattern**: The original code executes `from codeflash.optimization.optimizer import Optimizer` on every function call (line showing 4.6% of total time in profiler). The optimized version introduces `_get_optimizer()` which imports and caches the `Optimizer` class only once in the global `_cached_optimizer` variable. **Why this works:** Python imports are not free - they involve module lookup, loading, and namespace operations. While Python's import system caches modules internally, the `from ... import ...` statement still has overhead for symbol resolution on each execution. By caching the imported class reference, we eliminate this repeated work. **Performance impact:** The line profiler shows the import line went from 4.6% of execution time (12.3ms) to 3.8% (11.0ms) in the optimized version, contributing to the overall 40% speedup. This optimization is particularly effective for: - **High-frequency calls**: The test results show consistent 36-43% improvements across all test cases - **Large-scale operations**: The 1000-iteration tests maintain the same 40% improvement, demonstrating the optimization scales well - **Any scenario where `check_api_key` is called repeatedly**: Since LSP servers typically handle many requests, this caching prevents redundant import overhead The optimization maintains identical functionality while reducing per-call overhead through one-time import caching.

codeflash-ai · 2025-08-21T11:43:24Z

⚡️ Codeflash found optimizations for this PR

📄 40% (0.40x) speedup for `check_api_key` in `codeflash/lsp/beta.py`

⏱️ Runtime : 6.60 milliseconds → 4.70 milliseconds (best of 59 runs)

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function check_api_key by 40% in PR #670 (vsc/environment-validation) #675

If you approve, it will be merged into this PR (branch vsc/environment-validation).

mohammedahmed18 added 2 commits August 19, 2025 22:02

validating python and git environment with the lsp

0cd4d5e

revert

27745fb

mohammedahmed18 requested review from KRRT7 and Saga4 August 19, 2025 19:40

github-actions bot added the Review effort 3/5 label Aug 19, 2025

mohammedahmed18 and others added 7 commits August 20, 2025 11:05

fix: processing args in lsp

d7d1722

Merge branch 'main' into vsc/environment-validation

d598283

typo

53660de

Merge branch 'vsc/environment-validation' of github.com:codeflash-ai/…

7d3de7c

…codeflash into vsc/environment-validation

lsp: handle when the optimized function is not found

4ad1e99

refactoring

0dcfc15

Merge branch 'main' into vsc/environment-validation

9b69e7d

KRRT7 previously approved these changes Aug 20, 2025

View reviewed changes

Saga4 reviewed Aug 20, 2025

View reviewed changes

native exclude for binary files in the temp diff

47a5100

mohammedahmed18 dismissed KRRT7’s stale review via 47a5100 August 21, 2025 11:13

codeflash-ai bot mentioned this pull request Aug 21, 2025

⚡️ Speed up function install_github_app by 235,820% in PR #670 (vsc/environment-validation) #674

Closed

codeflash-ai bot mentioned this pull request Aug 21, 2025

⚡️ Speed up function check_api_key by 40% in PR #670 (vsc/environment-validation) #675

Closed

comment

a0e2edb

Saga4 previously approved these changes Aug 21, 2025

View reviewed changes

mohammedahmed18 dismissed Saga4’s stale review via a0e2edb August 21, 2025 12:34

Saga4 approved these changes Aug 21, 2025

View reviewed changes

mohammedahmed18 merged commit aaafe47 into main Aug 21, 2025
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[LSP] validating python and git environment #670

[LSP] validating python and git environment #670

Uh oh!

mohammedahmed18 commented Aug 19, 2025 •

edited by github-actions bot

Loading

Uh oh!

github-actions bot commented Aug 19, 2025

Uh oh!

github-actions bot commented Aug 19, 2025

Uh oh!

Saga4 Aug 20, 2025

Uh oh!

Saga4 Aug 20, 2025

Uh oh!

Saga4 Aug 20, 2025

Uh oh!

Saga4 Aug 20, 2025

Uh oh!

mohammedahmed18 Aug 21, 2025

Uh oh!

codeflash-ai bot commented Aug 21, 2025

⚡️ Speed up function `install_github_app` by 235,820% in PR #670 (`vsc/environment-validation`) #674

Uh oh!

codeflash-ai bot commented Aug 21, 2025

⚡️ Speed up function `check_api_key` by 40% in PR #670 (`vsc/environment-validation`) #675

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[LSP] validating python and git environment #670

[LSP] validating python and git environment #670

Uh oh!

Conversation

mohammedahmed18 commented Aug 19, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Description

Diagram Walkthrough

File Walkthrough

Uh oh!

github-actions bot commented Aug 19, 2025

PR Reviewer Guide 🔍

Uh oh!

github-actions bot commented Aug 19, 2025

PR Code Suggestions ✨

Uh oh!

Saga4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Saga4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Saga4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

Saga4 Aug 20, 2025

Choose a reason for hiding this comment

Uh oh!

mohammedahmed18 Aug 21, 2025

Choose a reason for hiding this comment

Uh oh!

codeflash-ai bot commented Aug 21, 2025

⚡️ Codeflash found optimizations for this PR

📄 235,820% (2,358.20x) speedup for install_github_app in codeflash/cli_cmds/cmd_init.py

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function install_github_app by 235,820% in PR #670 (vsc/environment-validation) #674

Uh oh!

codeflash-ai bot commented Aug 21, 2025

⚡️ Codeflash found optimizations for this PR

📄 40% (0.40x) speedup for check_api_key in codeflash/lsp/beta.py

I created a new dependent PR with the suggested changes. Please review:

⚡️ Speed up function check_api_key by 40% in PR #670 (vsc/environment-validation) #675

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mohammedahmed18 commented Aug 19, 2025 •

edited by github-actions bot

Loading

📄 235,820% (2,358.20x) speedup for `install_github_app` in `codeflash/cli_cmds/cmd_init.py`

⚡️ Speed up function `install_github_app` by 235,820% in PR #670 (`vsc/environment-validation`) #674

📄 40% (0.40x) speedup for `check_api_key` in `codeflash/lsp/beta.py`

⚡️ Speed up function `check_api_key` by 40% in PR #670 (`vsc/environment-validation`) #675