Integrate harness.py functionality into codegen-on-oss package #14

codegen-sh · 2025-05-01T18:49:45Z

This PR integrates the functionality from src/codegen/extensions/swebench/harness.py into the codegen-on-oss package to enhance its codebase analysis and context saving capabilities.

Changes Made

1. Created New Modules

CodebaseAnalysisHarness (codegen_on_oss/analysis/harness_integration.py)
- Integrates the core functionality from harness.py
- Provides comprehensive codebase analysis
- Supports diff generation and file tracking
- Includes context saving and retrieval capabilities
CodebaseContextSnapshot (codegen_on_oss/snapshot/context_snapshot.py)
- Allows saving and restoring codebase state
- Integrates with S3-compatible storage via BucketStore
- Preserves analysis results and context
CodeContextRetrievalServer (codegen_on_oss/context_server/server.py)
- Implements a FastAPI server for accessing functionality
- Provides endpoints for analysis, context management, and agent execution
- Serves as the backend for code context retrieval

2. Updated Package Structure

Added new dependencies to pyproject.toml
Updated __init__.py files to expose the new modules
Added a new CLI command serve to start the CodeContextRetrievalServer
Created example scripts demonstrating usage

3. Documentation

Updated the README with information about the new features
Added docstrings to all new modules and functions
Created example scripts showing how to use the new functionality

Key Features

Comprehensive Codebase Analysis: Detailed metrics about files, classes, functions, and their relationships
Context Saving and Retrieval: Save and restore codebase state for later use
Agent Integration: Run AI agents with saved context for code analysis and modification
API Access: Access all functionality through a REST API

How to Use

You can now use the enhanced codegen-on-oss package as a backend CodeContextRetrievalServer by:

Starting the server:

cgparse serve --host 0.0.0.0 --port 8000

Making API requests to analyze codebases, save context, and run agents

Or using the modules directly in your code:

from codegen_on_oss.analysis import CodebaseAnalysisHarness
from codegen_on_oss.snapshot import CodebaseContextSnapshot

# Create a harness and analyze a codebase
harness = CodebaseAnalysisHarness.from_repo("owner/repo")
results = harness.analyze_codebase()

# Save the state for later
snapshot = CodebaseContextSnapshot(harness)
snapshot_id = snapshot.create_snapshot()

💻 View my work • About Codegen

Summary by Sourcery

Integrate harness.py functionality into the codegen-on-oss package, enhancing its codebase analysis and context management capabilities

New Features:

Comprehensive codebase analysis with CodebaseAnalysisHarness
Context saving and retrieval via CodebaseContextSnapshot
REST API server for accessing codebase analysis and context functionality

Enhancements:

Modular design for codebase analysis
Flexible snapshot management with local and remote storage
Improved agent integration and execution

Build:

Added new dependencies in pyproject.toml
Updated package structure with new modules

CI:

Added new CLI command 'serve' to start the CodeContextRetrievalServer

Documentation:

Updated README with new features and usage instructions
Added docstrings to new modules
Created example script demonstrating usage

Summary by CodeRabbit

New Features
- Introduced a REST API server for codebase analysis, context snapshot management, and agent execution, accessible via a new CLI command.
- Added advanced codebase analysis tools and snapshot functionality, supporting both local and remote storage.
- Provided a sample script to demonstrate repository analysis and snapshot creation.
Documentation
- Expanded and restructured the README into a comprehensive user guide with installation, usage examples, and architectural details.
Chores
- Added FastAPI, Pydantic, and Uvicorn as new dependencies.

Zeeeepa · 2025-05-01T18:49:48Z

@CodiumAI-Agent /review

korbit-ai · 2025-05-01T18:49:49Z

By default, I don't review pull requests opened by bots. If you would like me to review this pull request anyway, you can request a review via the /korbit-review command in a comment.

sourcery-ai · 2025-05-01T18:49:49Z

Reviewer's Guide

This pull request integrates harness.py functionality into the codegen-on-oss package by introducing new modules for codebase analysis (CodebaseAnalysisHarness), context snapshotting (CodebaseContextSnapshot with local/remote storage), and a FastAPI server (CodeContextRetrievalServer) to expose these features via a REST API and a new serve CLI command.

File-Level Changes

Change	Details	Files
Integrated harness logic into a new analysis module.	Created `CodebaseAnalysisHarness` class. Implemented methods for codebase analysis, diff generation, and agent execution.	`codegen-on-oss/codegen_on_oss/analysis/harness_integration.py` `codegen-on-oss/codegen_on_oss/analysis/__init__.py`
Implemented context snapshotting for saving/loading state.	Created `CodebaseContextSnapshot` class. Added methods for creating, loading, and saving snapshots locally and remotely (via `BucketStore`). Defined snapshot-related error classes.	`codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py` `codegen-on-oss/codegen_on_oss/snapshot/__init__.py`
Added a FastAPI server to expose functionality via a REST API.	Created `CodeContextRetrievalServer` using FastAPI. Defined Pydantic models for API requests/responses. Added API endpoints for analysis, snapshot management, and agent execution.	`codegen-on-oss/codegen_on_oss/context_server/server.py` `codegen-on-oss/codegen_on_oss/context_server/__init__.py`
Added a new CLI command to start the API server.	Added `serve` command using `click`. Included options for host, port, reload, and log level. Integrated `uvicorn` to run the FastAPI application.	`codegen-on-oss/codegen_on_oss/cli.py`
Updated package dependencies and structure.	Added `fastapi`, `pydantic`, and `uvicorn` to dependencies. Updated `__init__.py` files to expose new modules.	`codegen-on-oss/pyproject.toml` `codegen-on-oss/codegen_on_oss/analysis/__init__.py` `codegen-on-oss/codegen_on_oss/snapshot/__init__.py` `codegen-on-oss/codegen_on_oss/context_server/__init__.py`
Refactored existing analysis and snapshot related code.	Updated imports and type hints. Refactored `lint_for_dev_import_violations` in `pr_tasks.py`. Improved error handling (e.g., finding git executable). Minor code style changes (e.g., `dict()` to `{}`).	`codegen-on-oss/codegen_on_oss/analysis/codebase_context.py` `codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py` `codegen-on-oss/codegen_on_oss/snapshot/event_handlers.py` `codegen-on-oss/codegen_on_oss/analysis/analysis.py` `codegen-on-oss/codegen_on_oss/analysis/analysis_import.py` `codegen-on-oss/codegen_on_oss/snapshot/helpers.py` `codegen-on-oss/codegen_on_oss/analysis/document_functions.py` `codegen-on-oss/codegen_on_oss/analysis/symbolattr.py` `codegen-on-oss/codegen_on_oss/snapshot/pr_review.py` `codegen-on-oss/codegen_on_oss/analysis/module_dependencies.py`
Updated documentation and added examples.	Significantly restructured and updated README with new features, API details, and usage. Added a new example script demonstrating harness and snapshot usage. Made minor formatting changes to the analysis view mockup.	`codegen-on-oss/README.md` `codegen-on-oss/scripts/example_usage.py` `ANALYSIS_VIEW_MOCKUP.md`

Tips and commands

Interacting with Sourcery

Trigger a new review: Comment @sourcery-ai review on the pull request.
Continue discussions: Reply directly to Sourcery's review comments.
Generate a GitHub issue from a review comment: Ask Sourcery to create an
issue from a review comment by replying to it. You can also reply to a
review comment with @sourcery-ai issue to create an issue from it.
Generate a pull request title: Write @sourcery-ai anywhere in the pull
request title to generate a title at any time. You can also comment
@sourcery-ai title on the pull request to (re-)generate the title at any time.
Generate a pull request summary: Write @sourcery-ai summary anywhere in
the pull request body to generate a PR summary at any time exactly where you
want it. You can also comment @sourcery-ai summary on the pull request to
(re-)generate the summary at any time.
Generate reviewer's guide: Comment @sourcery-ai guide on the pull
request to (re-)generate the reviewer's guide at any time.
Resolve all Sourcery comments: Comment @sourcery-ai resolve on the
pull request to resolve all Sourcery comments. Useful if you've already
addressed all the comments and don't want to see them anymore.
Dismiss all Sourcery reviews: Comment @sourcery-ai dismiss on the pull
request to dismiss all existing Sourcery reviews. Especially useful if you
want to start fresh with a new review - don't forget to comment
@sourcery-ai review to trigger a new review!

Customizing Your Experience

Access your dashboard to:

Enable or disable review features such as the Sourcery-generated pull request
summary, the reviewer's guide, and others.
Change the review language.
Add, remove or edit custom review instructions.
Adjust other review settings.

Getting Help

Contact our support team for questions or feedback.
Visit our documentation for detailed guides and information.
Keep in touch with the Sourcery team by following us on X/Twitter, LinkedIn or GitHub.

Zeeeepa · 2025-05-01T18:49:51Z

@sourcery-ai review

coderabbitai · 2025-05-01T18:49:52Z

Walkthrough

This update introduces a comprehensive codebase analysis and context management system for the Codegen on OSS package. It adds new modules for codebase analysis, snapshot management, and a FastAPI-based REST API server, along with corresponding CLI and example usage scripts. The README is rewritten to serve as a detailed user guide, and new dependencies for FastAPI, Pydantic, and Uvicorn are declared. The core functionalities now include repository analysis, agent execution, snapshot creation and retrieval, and RESTful API endpoints for these operations, with local and remote (bucket store) support for snapshot storage.

Changes

File(s)	Change Summary
`codegen-on-oss/README.md`	Rewritten and expanded README: detailed overview, installation, usage, architectural insights, and removal of old log/metrics examples.
`codegen-on-oss/codegen_on_oss/analysis/__init__.py` `codegen-on-oss/codegen_on_oss/analysis/harness_integration.py`	New analysis package: module docstring, exports `CodebaseAnalysisHarness` via `__all__`; `harness_integration.py` defines `CodebaseAnalysisHarness` class for codebase analysis, diffing, agent runs, and result saving.
`codegen-on-oss/codegen_on_oss/cli.py`	Adds new `serve` CLI command to launch REST API server with host, port, reload, and log-level options.
`codegen-on-oss/codegen_on_oss/context_server/__init__.py` `codegen-on-oss/codegen_on_oss/context_server/server.py`	New context server package: exports FastAPI `app` and `create_app`. Implements REST API for repository analysis, snapshot management, agent execution, and snapshot listing/loading, with Pydantic models and error handling.
`codegen-on-oss/codegen_on_oss/snapshot/__init__.py` `codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py`	New snapshot package: exports `CodebaseContextSnapshot`. Implements snapshot creation, saving, loading, and integration with local and remote storage, with logging and error handling.
`codegen-on-oss/pyproject.toml`	Adds dependencies: `fastapi`, `pydantic`, `uvicorn`.
`codegen-on-oss/scripts/example_usage.py`	New example script: demonstrates repository analysis and snapshot creation via CLI, using `CodebaseAnalysisHarness` and `CodebaseContextSnapshot`.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant CLI
    participant FastAPI_Server
    participant Harness
    participant Snapshot

    User->>CLI: Run "serve" or example script
    CLI->>FastAPI_Server: Start server (if "serve")
    User->>FastAPI_Server: POST /analyze/repository (repo info)
    FastAPI_Server->>Harness: Create & analyze codebase
    Harness-->>FastAPI_Server: Analysis results
    FastAPI_Server-->>User: Return analysis results

    User->>FastAPI_Server: POST /snapshot/create (harness key)
    FastAPI_Server->>Snapshot: Create snapshot from harness
    Snapshot-->>FastAPI_Server: Snapshot ID
    FastAPI_Server-->>User: Return snapshot ID

    User->>FastAPI_Server: POST /agent/run (prompt, harness key)
    FastAPI_Server->>Harness: Run agent on codebase
    Harness-->>FastAPI_Server: Agent result, diff, edited files
    FastAPI_Server-->>User: Return agent run result

Poem

🐇
A harness for code, a snapshot to keep,
FastAPI wakes from its restful sleep.
Analyze, serve, and let agents run,
Snapshots and diffs—oh, what fun!
With new docs and scripts, dependencies in tow,
Codegen on OSS is ready to go!
— A rabbit with a server to show

✨ Finishing Touches

📝 Generate Docstrings

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Generate unit testing code for this file.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai generate unit testing code for this file.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and generate unit testing code.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

Zeeeepa · 2025-05-01T18:49:54Z

@codecov-ai-reviewer review

Zeeeepa · 2025-05-01T18:49:57Z

@coderabbitai review

codecov-ai · 2025-05-01T18:49:58Z

On it! We are reviewing the PR and will provide feedback shortly.

Zeeeepa · 2025-05-01T18:50:00Z

/gemini review

Hey! Sure - on it... 🧐

💻 View my work

codegen-sh · 2025-05-01T18:50:02Z

Hey! 👋 I see one of the checks failed. I am on it! 🫡

💻 View my work

coderabbitai · 2025-05-01T18:50:02Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

Zeeeepa · 2025-05-01T18:50:03Z

/review

Zeeeepa · 2025-05-01T18:50:06Z

/improve

Zeeeepa · 2025-05-01T18:50:09Z

/korbit-review

Zeeeepa · 2025-05-01T18:50:15Z

@codegen Implement and upgrade this PR with above Considerations and suggestions from @CodiumAI-Agent, @sourcery-ai, @codecov-ai-reviewer, @coderabbitai, /gemini, /korbit-review

gemini-code-assist

Code Review

This PR introduces significant enhancements to the codegen-on-oss package, integrating codebase analysis, context management, and agent execution capabilities. The changes are well-structured and documented, providing a solid foundation for future development. However, there are a few areas that could benefit from further refinement to ensure robustness and maintainability.

Summary of Findings

Error Handling in API Endpoints: The API endpoints in context_server/server.py use a generic except Exception as e block. While this catches all errors, it can mask specific issues and make debugging more difficult. Consider using more specific exception handling to provide better error messages and improve the server's reliability.
Snapshot Storage Consistency: The CodebaseContextSnapshot class attempts to load snapshots from both remote (BucketStore) and local storage. If a snapshot exists in both locations, the code prioritizes loading from the BucketStore. Ensure that the snapshot data is consistent across both storage locations to prevent unexpected behavior.
File Extension Handling: The _get_file_statistics method in harness_integration.py calculates file statistics based on file extensions. Ensure that this method handles edge cases, such as files without extensions or files with multiple extensions, to provide accurate file statistics.

Merge Readiness

The pull request introduces valuable new features and is generally well-implemented. However, addressing the error handling and snapshot storage consistency issues is crucial for ensuring the stability and reliability of the codebase. I recommend addressing the high severity issues before merging. I am unable to directly approve this pull request, and users should have others review and approve this code before merging.

gemini-code-assist · 2025-05-01T18:50:17Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+    except Exception as e:
+        logger.error(f"Repository analysis failed: {e}")
+        raise HTTPException(status_code=500, detail=str(e))


Using a broad except Exception as e can mask underlying issues. It's better to catch specific exceptions to handle them appropriately and provide more informative error messages. For example, you could catch RepoNotFoundError, AnalysisError, etc.

What do you think about refining this to catch specific exceptions?

except RepoNotFoundError as e: logger.error(f"Repository not found: {e}") raise HTTPException(status_code=404, detail=str(e)) except AnalysisError as e: logger.error(f"Repository analysis failed: {e}") raise HTTPException(status_code=500, detail=str(e)) except Exception as e: logger.error(f"Repository analysis failed: {e}") raise HTTPException(status_code=500, detail=str(e))

gemini-code-assist · 2025-05-01T18:50:17Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+                    logger.warning(f"Failed to load snapshot {key}: {e}")
+        except Exception as e:
+            logger.warning(f"Failed to list snapshots from bucket store: {e}")


Similar to the previous comment, using a broad except Exception as e can hide potential issues. Consider catching specific exceptions like BucketStoreError, SnapshotNotFoundError, etc., to provide better error handling and logging.

What do you think about refining this to catch specific exceptions?

except BucketStoreError as e: logger.warning(f"Failed to list snapshots from bucket store: {e}") except Exception as e: logger.warning(f"Failed to list snapshots from bucket store: {e}")

gemini-code-assist · 2025-05-01T18:50:17Z

codegen-on-oss/codegen_on_oss/analysis/harness_integration.py

+            ext = Path(file.path).suffix
+            if ext in file_extensions:
+                file_extensions[ext] += 1
+            else:
+                file_extensions[ext] = 1


Consider handling cases where a file might not have an extension. Currently, Path(file.path).suffix will return an empty string if there is no extension, which is fine, but it might be useful to log these instances, or handle them differently.

What do you think about adding a check for an empty extension, and perhaps logging it, or assigning a default extension like .unknown?

Suggested change

ext = Path(file.path).suffix

if ext in file_extensions:

file_extensions[ext] += 1

else:

file_extensions[ext] = 1

ext = Path(file.path).suffix

if not ext:

logger.warning(f"File {file.path} has no extension.")

ext = ".unknown" # Or handle it as you see fit

if ext in file_extensions:

gemini-code-assist · 2025-05-01T18:50:17Z

codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py

+        if self.bucket_store:
+            try:
+                self.snapshot_data = self._load_remote(snapshot_id)
+                logger.info(f"Loaded snapshot {snapshot_id} from remote storage")
+                return self.snapshot_data
+            except Exception as e:
+                logger.warning(f"Failed to load snapshot from remote: {e}")
+


If a snapshot exists in both the bucket store and local storage, the bucket store version will always be loaded. This might be fine, but it's worth considering whether you want to add a check to see if the local version is newer, or give the user an option to prefer local snapshots.

What do you think about adding a check for a newer local snapshot, or a user preference?

if self.bucket_store: try: self.snapshot_data = self._load_remote(snapshot_id) logger.info(f"Loaded snapshot {snapshot_id} from remote storage") return self.snapshot_data except Exception as e: logger.warning(f"Failed to load snapshot from remote: {e}") # Check if local snapshot is newer try: local_data = self._load_local(snapshot_id) if self.bucket_store and Path(f"./snapshots/snapshot_{snapshot_id}.json").stat().st_mtime > datetime.fromisoformat(self.snapshot_data['created_at']).timestamp(): logger.info(f"Local snapshot {snapshot_id} is newer, loading from local storage") return local_data except: pass

coderabbitai · 2025-05-01T18:50:22Z

You are on the CodeRabbit Free Plan. In order to use the Chat feature, please upgrade to CodeRabbit Pro.

codiumai-pr-agent-free · 2025-05-01T18:50:22Z

PR Reviewer Guide 🔍

(Review updated until commit `18a0737`)

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 4 🔵🔵🔵🔵⚪
🧪 No relevant tests
🔒 Security concerns Sensitive information exposure: In `codegen-on-oss/codegen_on_oss/context_server/server.py`, the server exposes repository analysis results through API endpoints without authentication. This could potentially expose sensitive code information or analysis results to unauthorized users. The server should implement proper authentication and authorization mechanisms to protect this data.
⚡ Recommended focus areas for review Error Handling The run_agent method catches exceptions but then immediately re-raises them, making the try-except block ineffective. Consider either adding meaningful error handling or removing the try-except block. try: result = agent.run(prompt=prompt) # Get the diff between the current state and the original commit model_patch = self.get_diff(base=self.base_commit) edited_files = self.files_in_patch(model_patch) return { "result": result, "model_patch": model_patch, "edited_files": edited_files, } except Exception as agent_error: logger.error(f"Agent run failed with error: {agent_error}") raise Resource Management The server stores active harnesses in a global dictionary without any cleanup mechanism. This could lead to memory leaks if many repositories are analyzed but never removed from memory. active_harnesses: dict[str, CodebaseAnalysisHarness] = {} bucket_store: BucketStore \| None = None Redundant Method The _save_remote method simply calls save_to_remote, adding an unnecessary layer of indirection that could be confusing for maintainers. def _save_remote(self) -> str: """ Save the snapshot to remote storage. Returns: The key where the snapshot was saved. Raises: NoBucketStoreError: If no bucket store is configured. """ return self.save_to_remote()

qodo-code-review · 2025-05-01T18:50:22Z

Qodo Merge was enabled for this repository. To continue using it, please link your Git account with your Qodo account here.

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 4 🔵🔵🔵🔵⚪
🧪 No relevant tests
🔒 Security concerns Sensitive information exposure: The server implementation in `context_server/server.py` doesn't validate or sanitize repository URLs or commit hashes before processing them, which could potentially lead to command injection if these values are passed directly to git commands. Additionally, there's no authentication mechanism for the API endpoints, which could allow unauthorized access to codebase analysis and agent execution functionality.
⚡ Recommended focus areas for review Error Handling The server implementation has basic error handling but could benefit from more specific exception types and better error messages, especially in the snapshot loading and agent execution paths. try: snapshot = CodebaseContextSnapshot(snapshot_id=snapshot_id, bucket_store=bucket_store) data = snapshot.load_snapshot() return data except Exception as e: logger.error(f"Failed to load snapshot {snapshot_id}: {e}") raise HTTPException(status_code=404, detail=f"Snapshot {snapshot_id} not found") Exception Handling The run_agent method catches all exceptions generically, which might mask specific errors. Consider catching more specific exceptions and providing more detailed error information. except Exception as agent_error: logger.error(f"Agent run failed with error: {agent_error}") raise Race Condition The create_snapshot method doesn't handle potential race conditions when multiple instances try to create snapshots with the same ID simultaneously, especially with remote storage. def create_snapshot(self, local_path: Optional[Union[str, Path]] = None) -> str: """ Create a snapshot of the current codebase state and analysis results. Args: local_path: Optional local path to save the snapshot Returns: The snapshot ID """ if not self.harness: raise ValueError("No harness provided for snapshot creation") # Ensure we have analysis results if not self.harness.analysis_results: logger.info("No analysis results found, running analysis...") self.harness.analyze_codebase() # Create snapshot data self.snapshot_data = { "snapshot_id": self.snapshot_id, "created_at": datetime.now().isoformat(), "repo_info": { "repo_name": self.harness.codebase.repo_name, "commit": self.harness.base_commit, }, "analysis_results": self.harness.analysis_results, } # Save locally if path provided if local_path: self._save_local(Path(local_path)) # Save to bucket store if available if self.bucket_store: self._save_remote() logger.info(f"Created snapshot with ID: {self.snapshot_id}") return self.snapshot_id

gemini-code-assist · 2025-05-01T18:50:26Z

It looks like you've requested a lot of reviews from different bots! I'm just chiming in to say that I'm here to help if you have any specific questions about my previous review or need further assistance with the code. Let me know how I can be of service!

codegen-sh · 2025-05-01T18:50:46Z

I'll review PR #14 "Integrate harness.py functionality into codegen-on-oss package" right away! I'll examine the changes and provide feedback shortly.

💻 View my work

qodo-code-review · 2025-05-01T18:50:50Z

PR Code Suggestions ✨

Latest suggestions up to 18a0737

Category	Suggestion	Impact
Possible issue	Add data validation The code doesn't handle the case where a key might not contain valid snapshot data. Add validation to check if required fields exist in the data before creating a SnapshotInfo object. codegen-on-oss/codegen_on_oss/context_server/server.py [166-181] # List snapshots from bucket store if available if bucket_store: try: keys = bucket_store.list_keys(prefix="snapshots/") for key in keys: try: data = bucket_store.get_json(key) - snapshots.append( - SnapshotInfo( - snapshot_id=data["snapshot_id"], - created_at=data["created_at"], - repo_info=data["repo_info"], + if all(field in data for field in ["snapshot_id", "created_at", "repo_info"]): + snapshots.append( + SnapshotInfo( + snapshot_id=data["snapshot_id"], + created_at=data["created_at"], + repo_info=data["repo_info"], + ) ) - ) + else: + logger.warning(f"Snapshot {key} missing required fields") except Exception as e: logger.warning(f"Failed to load snapshot {key}: {e}") Apply this suggestion Suggestion importance[1-10]: 7 __ Why: The suggestion correctly points out that the code lacks validation for the structure of the JSON data loaded from the bucket store. Adding a check for required keys (`snapshot_id`, `created_at`, `repo_info`) before creating the `SnapshotInfo` object prevents potential `KeyError` exceptions and improves the robustness of the snapshot loading process.	Medium
	Add error handling The `get_changed_files` method might fail if the commit SHAs are invalid or if there are network issues. Add error handling to gracefully handle potential exceptions when retrieving changed files. codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py [48-52] # Get the files changed in the PR -changed_files = codebase.get_changed_files( - base_commit=event.pull_request.base.sha, - head_commit=event.pull_request.head.sha, -) +try: + changed_files = codebase.get_changed_files( + base_commit=event.pull_request.base.sha, + head_commit=event.pull_request.head.sha, + ) +except Exception as e: + logger.error(f"Failed to get changed files: {e}") + return { + "status": "error", + "message": f"Failed to retrieve changed files: {str(e)}", + } Apply this suggestion Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies that `codebase.get_changed_files` might fail and proposes adding a `try-except` block. This improves the robustness of the function by handling potential errors during file retrieval.	Medium
	Handle repository access failures The harness creation might fail if the repository doesn't exist or is inaccessible. Add error handling to catch potential exceptions during repository analysis initialization. codegen-on-oss/scripts/example_usage.py [27-31] # Create the harness -harness = CodebaseAnalysisHarness.from_repo( - repo_full_name=repo_name, - commit=commit, -) +try: + harness = CodebaseAnalysisHarness.from_repo( + repo_full_name=repo_name, + commit=commit, + ) +except Exception as e: + logger.error(f"Failed to create analysis harness for {repo_name}: {e}") + raise RuntimeError(f"Repository analysis failed: {str(e)}") Apply this suggestion Suggestion importance[1-10]: 7 __ Why: The suggestion correctly points out that creating the `CodebaseAnalysisHarness` via `from_repo` could fail due to repository access issues. Adding error handling improves the script's reliability.	Medium
	Handle JSON serialization errors The JSON serialization might fail if the results contain objects that aren't JSON serializable. Add error handling to catch potential serialization exceptions when saving analysis results. codegen-on-oss/scripts/example_usage.py [36-41] # Save the results output_dir.mkdir(parents=True, exist_ok=True) output_path = output_dir / f"{repo_name.replace('/', '_')}_analysis.json" -with open(output_path, "w") as f: - json.dump(results, f, indent=2) +try: + with open(output_path, "w") as f: + json.dump(results, f, indent=2) +except TypeError as e: + logger.error(f"Failed to serialize analysis results: {e}") + # Save a simplified version or handle the error appropriately + with open(output_path, "w") as f: + json.dump({"error": "Failed to serialize full results", "message": str(e)}, f) Apply this suggestion Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies a potential `TypeError` if the `results` object contains non-JSON-serializable data. Adding a `try-except` block for `json.dump` makes the result saving process more robust.	Medium
	Fix type union syntax The Union operator `\|` for types was introduced in Python 3.10. For better compatibility with older Python versions, use `Union` from the typing module instead. codegen-on-oss/codegen_on_oss/analysis/analysis.py [142-143] -elif isinstance(statement, ForLoopStatement \| WhileStatement): +elif isinstance(statement, (ForLoopStatement, WhileStatement)): complexity += 1 Apply this suggestion Suggestion importance[1-10]: 4 __ Why: The suggestion correctly identifies that the `\|` operator for type unions is a Python 3.10+ feature. Replacing it with a tuple `(ForLoopStatement, WhileStatement)` ensures compatibility with older Python versions. This is a minor improvement related to code style and compatibility.	Low
More

Previous suggestions

Suggestions up to commit 18a0737

Category	Suggestion	Impact
Possible issue	Add error handling Add error handling around the harness creation to gracefully handle repository access or parsing failures. This prevents the script from crashing unexpectedly when analyzing repositories. codegen-on-oss/scripts/example_usage.py [27-31] # Create the harness -harness = CodebaseAnalysisHarness.from_repo( - repo_full_name=repo_name, - commit=commit, -) +try: + harness = CodebaseAnalysisHarness.from_repo( + repo_full_name=repo_name, + commit=commit, + ) +except Exception as e: + logger.error(f"Failed to create harness for {repo_name}: {e}") + return None, None Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies that `CodebaseAnalysisHarness.from_repo` could raise exceptions (e.g., invalid repository, network issues) and proposes adding a `try...except` block. This improves the robustness of the example script by handling potential errors gracefully.	Medium
	Fix patch parsing The patch parsing logic is incomplete. It only captures file paths from the header lines but doesn't handle cases where the path doesn't follow the standard "a/" or "b/" prefix pattern, which can happen in some git diff formats. codegen-on-oss/codegen_on_oss/analysis/harness_integration.py [138-154] def files_in_patch(self, patch: str) -> list[str]: """ Extract the list of modified files from a unified diff patch string. Args: patch: A unified diff patch string Returns: A list of modified file paths """ files: set[str] = set() for line in patch.split("\n"): - if line.startswith("--- a/") or line.startswith("+++ b/"): - fname = line.split("/", 1)[1] - files.add(fname) + if line.startswith("--- ") or line.startswith("+++ "): + path = line[4:] + # Handle standard a/ and b/ prefixes + if path.startswith("a/") or path.startswith("b/"): + path = path[2:] + # Skip /dev/null which appears in new file diffs + if path != "/dev/null": + files.add(path) return list(files) Suggestion importance[1-10]: 6 __ Why: The suggestion improves the robustness of the `files_in_patch` method by handling diff lines that might not strictly follow the `--- a/` or `+++ b/` format and correctly ignoring `/dev/null`. This makes the parsing logic more resilient to variations in diff output.	Low
	Fix Union type syntax The Union type syntax `ForLoopStatement \| WhileStatement` requires Python 3.10+. For better compatibility, use the traditional Union syntax from the typing module. codegen-on-oss/codegen_on_oss/analysis/analysis.py [142-143] -elif isinstance(statement, ForLoopStatement \| WhileStatement): +elif isinstance(statement, (ForLoopStatement, WhileStatement)): complexity += 1 Suggestion importance[1-10]: 4 __ Why: The suggestion correctly points out that the `\|` syntax for union types requires Python 3.10+. However, the PR explicitly changed the code to use this syntax, replacing the older tuple-based approach. Reverting this change might contradict the PR's intent if the project targets Python 3.10+.	Low
General	Improve error handling The exception handling is too broad. If a snapshot is not found, a 404 is appropriate, but other exceptions should return a 500 status code to properly indicate server errors. codegen-on-oss/codegen_on_oss/context_server/server.py [207-223] @app.get("/snapshot/load/{snapshot_id}", response_model=dict[str, Any]) async def load_snapshot(snapshot_id: str): """ Load a snapshot by ID and return its data. """ try: snapshot = CodebaseContextSnapshot( snapshot_id=snapshot_id, bucket_store=bucket_store ) data = snapshot.load_snapshot() - except Exception as e: + except SnapshotNotFoundError as e: logger.error(f"Failed to load snapshot {snapshot_id}: {e}") raise HTTPException( status_code=404, detail=f"Snapshot {snapshot_id} not found" ) from e + except Exception as e: + logger.error(f"Error loading snapshot {snapshot_id}: {e}") + raise HTTPException( + status_code=500, detail=f"Server error loading snapshot" + ) from e else: return data Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies that catching a generic `Exception` and raising a 404 is inappropriate for errors other than 'Not Found'. Separating the handling for `SnapshotNotFoundError` (404) and other `Exception` types (500) improves the API's error reporting accuracy.	Medium
General	Use logger instead of print Replace the print statement with proper logging using the logger that's already defined at the top of the file. This ensures consistent error handling and log formatting throughout the application. codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py [54-65] # Check each changed file for violations for file_path in changed_files: # Skip non-JS/TS files if not file_path.endswith((".js", ".jsx", ".ts", ".tsx")): continue try: file_content = codebase.get_file_content(file_path) file_violations = _check_for_dev_imports(file_path, file_content) violations.extend([(file_path, *v) for v in file_violations]) except Exception as e: - print(f"Error checking file {file_path}: {e}") + logger.error(f"Error checking file {file_path}: {e}") Suggestion importance[1-10]: 5 __ Why: The suggestion correctly identifies a `print` statement used for error reporting (line 65) and suggests replacing it with the existing `logger`. This improves logging consistency and adheres to best practices within the application.	Low

Suggestions up to commit dd9b5d0

Category	Suggestion	Impact
Possible issue	Add error handling The `get_changed_files` method might fail if the commits don't exist or are inaccessible. Add error handling to gracefully handle potential Git errors when retrieving changed files. codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py [48-52] # Get the files changed in the PR -changed_files = codebase.get_changed_files( - base_commit=event.pull_request.base.sha, - head_commit=event.pull_request.head.sha, -) +try: + changed_files = codebase.get_changed_files( + base_commit=event.pull_request.base.sha, + head_commit=event.pull_request.head.sha, + ) +except Exception as e: + logger.error(f"Failed to get changed files: {e}") + return { + "status": "error", + "message": f"Failed to get changed files: {str(e)}", + } Suggestion importance[1-10]: 8 __ Why: The suggestion correctly identifies that `codebase.get_changed_files` could fail due to issues accessing commits or Git errors. Adding a `try...except` block enhances robustness by handling these potential failures gracefully.	Medium
Possible issue	Handle API failures The GitHub API call to create a PR comment could fail due to rate limits or permission issues. Add error handling to catch potential API exceptions when creating the PR comment. codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py [77-81] # Add the comment to the PR -codebase.github_client.create_pr_comment( - pr_number=event.pull_request.number, - body=comment, -) +try: + codebase.github_client.create_pr_comment( + pr_number=event.pull_request.number, + body=comment, + ) +except Exception as e: + logger.error(f"Failed to create PR comment: {e}") + # Still return violations even if comment creation fails + return { + "status": "failure", + "message": "Development imports found in production code (comment creation failed)", + "violations": violations, + } Suggestion importance[1-10]: 8 __ Why: The suggestion correctly identifies that the external API call `codebase.github_client.create_pr_comment` might fail. Adding error handling improves reliability by logging the failure and ensuring the primary function result (violations found) is still returned.	Medium
General	Fix type checking syntax The syntax `isinstance(statement, ForLoopStatement \| WhileStatement)` is only valid in Python 3.10+. For better compatibility, use a tuple of types instead: `isinstance(statement, (ForLoopStatement, WhileStatement))`. codegen-on-oss/codegen_on_oss/analysis/analysis.py [142-143] -elif isinstance(statement, ForLoopStatement \| WhileStatement): +elif isinstance(statement, (ForLoopStatement, WhileStatement)): complexity += 1 Suggestion importance[1-10]: 4 __ Why: The suggestion correctly notes that `isinstance(..., A \| B)` requires Python 3.10+ and suggests using a tuple `(A, B)` for backward compatibility. However, the PR explicitly changed the code from the tuple syntax to the `\|` syntax, indicating an intentional adoption of the newer style. While the suggestion is technically correct regarding compatibility, it contradicts the change made in the PR.	Low

Suggestions up to commit dd9b5d0

Category	Suggestion	Impact
Possible issue	Fix class method implementation The `load_from_remote` classmethod is trying to access `cls.bucket_store` and `cls._load_remote`, but these are instance methods/attributes, not class methods/attributes. This will cause an AttributeError when called. You need to create a new instance first. codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py [208-226] @classmethod -def load_from_remote(cls, snapshot_id: str) -> "CodebaseContextSnapshot": +def load_from_remote(cls, snapshot_id: str, bucket_store: BucketStore) -> "CodebaseContextSnapshot": """ Load a snapshot from remote storage. Args: snapshot_id: The ID of the snapshot to load. + bucket_store: BucketStore instance for remote storage access. Returns: A CodebaseContextSnapshot instance. Raises: NoBucketStoreError: If no bucket store is configured. ValueError: If the snapshot could not be loaded. """ - if not cls.bucket_store: - raise NoBucketStoreError() + instance = cls(snapshot_id=snapshot_id, bucket_store=bucket_store) + instance.load_snapshot() + return instance - return cls._load_remote(snapshot_id) - Suggestion importance[1-10]: 9 __ Why: The suggestion correctly identifies that the class method `load_from_remote` attempts to access instance attributes (`bucket_store`) and methods (`_load_remote`) via `cls`, which would cause an `AttributeError`. The proposed fix is valid and necessary for the method's functionality.	High
	Add error handling The function assumes all file paths in the diff start with "--- a/" or "+++ b/", but this isn't always true. Some diffs might use different formats or have files without the "a/" or "b/" prefix. Add error handling to avoid IndexError when splitting. codegen-on-oss/codegen_on_oss/analysis/harness_integration.py [138-154] def files_in_patch(self, patch: str) -> list[str]: """ Extract the list of modified files from a unified diff patch string. Args: patch: A unified diff patch string Returns: A list of modified file paths """ files: set[str] = set() for line in patch.split("\n"): if line.startswith("--- a/") or line.startswith("+++ b/"): - fname = line.split("/", 1)[1] - files.add(fname) + try: + fname = line.split("/", 1)[1] + files.add(fname) + except IndexError: + # Skip lines that don't match the expected format + continue return list(files) Suggestion importance[1-10]: 7 __ Why: The suggestion correctly points out that the `files_in_patch` function might raise an `IndexError` if a diff line starting with `--- a/` or `+++ b/` doesn't contain a `/` character after the prefix. Adding a `try-except` block improves the robustness of the function against potentially malformed diff input.	Medium
	Add error handling The code doesn't handle potential exceptions when creating the harness. If the repository doesn't exist or is inaccessible, the function will crash without providing useful error information. codegen-on-oss/scripts/example_usage.py [27-31] # Create the harness -harness = CodebaseAnalysisHarness.from_repo( - repo_full_name=repo_name, - commit=commit, -) +try: + harness = CodebaseAnalysisHarness.from_repo( + repo_full_name=repo_name, + commit=commit, + ) +except Exception as e: + logger.error(f"Failed to create harness for {repo_name}: {e}") + raise Suggestion importance[1-10]: 7 __ Why: The suggestion correctly identifies that creating the `CodebaseAnalysisHarness` might fail and proposes adding a `try...except` block. This improves the script's robustness by handling potential errors during harness initialization and logging them appropriately.	Medium
General	Use logger consistently The error handling uses `print()` instead of the logger that was set up at the top of the file. This inconsistency could lead to lost error messages in production environments where stdout isn't monitored. codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py [54-65] # Check each changed file for violations for file_path in changed_files: # Skip non-JS/TS files if not file_path.endswith((".js", ".jsx", ".ts", ".tsx")): continue try: file_content = codebase.get_file_content(file_path) file_violations = _check_for_dev_imports(file_path, file_content) violations.extend([(file_path, *v) for v in file_violations]) except Exception as e: - print(f"Error checking file {file_path}: {e}") + logger.error(f"Error checking file {file_path}: {e}") Suggestion importance[1-10]: 5 __ Why: The suggestion correctly points out the inconsistent use of `print()` for error handling instead of the configured `logger`. Using the `logger` consistently improves log management and adherence to standard logging practices.	Low

Suggestions up to commit 66cd331

Category	Suggestion	Impact
Security	Sanitize error messages The exception handling is too generic and exposes potentially sensitive error details to API clients. Replace the generic exception handling with a more secure approach that logs the full error internally but returns a sanitized error message to clients. codegen-on-oss/codegen_on_oss/context_server/server.py [213-240] @app.post("/agent/run", response_model=Dict[str, Any]) async def run_agent( request: AgentRunRequest, harness_key: str = Query(..., description="Key of the active harness"), ): """ Run an agent on the codebase with the given prompt. """ if harness_key not in active_harnesses: raise HTTPException(status_code=404, detail=f"Harness {harness_key} not found") harness = active_harnesses[harness_key] try: # Update metadata if provided if request.metadata: harness.metadata.update(request.metadata) # Run the agent result = harness.run_agent(prompt=request.prompt, model=request.model) return { "harness_key": harness_key, "result": result, } except Exception as e: logger.error(f"Agent run failed: {e}") - raise HTTPException(status_code=500, detail=str(e)) + raise HTTPException(status_code=500, detail="Agent execution failed") Suggestion importance[1-10]: 8 __ Why: The suggestion correctly identifies a potential security risk where internal error details (`str(e)`) might be exposed via the API. Replacing it with a generic error message enhances security by preventing information leakage.	Medium
Possible issue	Fix server startup The server is started with a module path string which assumes the module is importable. This can fail if the package isn't installed in development mode. Use the already imported app object from the create_app function instead. codegen-on-oss/codegen_on_oss/cli.py [170-177] # Start the server +app = create_app() uvicorn.run( - "codegen_on_oss.context_server:app", + app, host=host, port=port, reload=reload, log_level=log_level, ) Suggestion importance[1-10]: 6 __ Why: The suggestion correctly points out that using the imported `app` object directly for `uvicorn.run` is more robust than using the string path, especially in development environments or when the package isn't installed editable. The `improved_code` accurately reflects this change.	Low

codecov-ai · 2025-05-01T18:50:53Z

PR Description

This pull request introduces a new suite of tools for comprehensive codebase analysis, context management, and AI agent execution. It aims to enhance the existing codegen-on-oss package by providing capabilities for in-depth repository analysis, snapshotting codebase states, and running AI agents with saved context, all accessible through a REST API.

Click to see more

Key Technical Changes

Key changes include the addition of CodebaseAnalysisHarness for detailed codebase analysis (file statistics, diff generation), CodebaseContextSnapshot for saving and restoring codebase states to local or S3-compatible storage, and a FastAPI-based CodeContextRetrievalServer providing REST endpoints for accessing these functionalities. The CLI is extended with a serve command to launch the server.

Architecture Decisions

The architecture adopts a modular design, separating analysis, snapshotting, and server functionalities into distinct modules. The use of FastAPI enables easy API definition and interaction. The choice of JSON for snapshot storage facilitates portability and readability. The integration with BucketStore allows for flexible storage options (local or S3).

Dependencies and Interactions

This PR introduces new dependencies: fastapi, pydantic, and uvicorn. It interacts with the existing codegen library for codebase parsing and analysis. The BucketStore component interacts with S3-compatible storage if configured. The CodeAgent from codegen is used for agent execution.

Risk Considerations

Potential risks include increased memory usage due to storing active harnesses in the server, lack of authentication/authorization on the API endpoints, and potential race conditions during snapshot creation/loading. Error handling and input validation should be carefully reviewed. The performance of large repository analysis and snapshotting needs to be evaluated.

Notable Implementation Details

The CodebaseAnalysisHarness integrates functionality from the original harness.py. The CodebaseContextSnapshot uses UUIDs for snapshot IDs and supports both local and remote storage. The FastAPI server exposes endpoints for repository analysis, snapshot management, and agent execution. The example script demonstrates basic usage of the new components.

codecov-ai · 2025-05-01T18:50:56Z

codegen-on-oss/codegen_on_oss/analysis/harness_integration.py

+
+class CodebaseAnalysisHarness:
+    """
+    A comprehensive harness for analyzing codebases, generating diffs, and tracking file changes.
+    Integrates core functionality from the original harness.py with enhanced analysis capabilities.
+    """


The CodebaseAnalysisHarness class should implement proper cleanup mechanisms. Consider adding a __del__ method or context manager support (via __enter__ and __exit__) to ensure proper resource cleanup, especially for the codebase object which might hold file handles. This prevents resource leaks in long-running applications.

Suggested change

class CodebaseAnalysisHarness:

"""

A comprehensive harness for analyzing codebases, generating diffs, and tracking file changes.

Integrates core functionality from the original harness.py with enhanced analysis capabilities.

"""

class CodebaseAnalysisHarness:

def __enter__(self):

return self

def __exit__(self, exc_type, exc_val, exc_tb):

if hasattr(self, 'codebase'):

self.codebase.cleanup() # Implement appropriate cleanup

def __del__(self):

if hasattr(self, 'codebase'):

self.codebase.cleanup()

codecov-ai · 2025-05-01T18:50:58Z

codegen-on-oss/codegen_on_oss/analysis/harness_integration.py

+            "file_count": file_count,
+            "file_extensions": file_extensions,
+        }
+
+    def get_diff(self, base: Optional[str] = None) -> str:
+        """
+        Get the diff between the current state and a base commit.
+
+        Args:
+            base: The base commit to compare against (defaults to self.base_commit)
+
+        Returns:
+            A string containing the diff
+        """
+        base_commit = base or self.base_commit
+        if not base_commit:


The get_diff method should have input validation for the base commit. Also, consider adding a size limit for the diff to prevent memory issues with very large diffs. Add error handling for git command failures.

Suggested change

"file_count": file_count,

"file_extensions": file_extensions,

}

def get_diff(self, base: Optional[str] = None) -> str:

"""

Get the diff between the current state and a base commit.

Args:

base: The base commit to compare against (defaults to self.base_commit)

Returns:

A string containing the diff

"""

base_commit = base or self.base_commit

if not base_commit:

def get_diff(self, base: Optional[str] = None, max_diff_size: int = 1024 * 1024) -> str:

base_commit = base or self.base_commit

if not base_commit:

logger.warning("No base commit specified for diff generation")

return ""

try:

diff = self.codebase.get_diff(base=base_commit)

if len(diff) > max_diff_size:

raise ValueError(f"Diff size exceeds maximum allowed size of {max_diff_size} bytes")

return diff

except Exception as e:

logger.error(f"Failed to generate diff: {e}")

raise

codecov-ai · 2025-05-01T18:51:00Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+    prompt: str = Field(..., description="Prompt to send to the agent")
+    model: Optional[str] = Field(None, description="Model to use for the agent")
+    metadata: Optional[Dict[str, Any]] = Field(None, description="Metadata for the agent run")
+
+
+class SnapshotInfo(BaseModel):
+    snapshot_id: str = Field(..., description="ID of the snapshot")
+    created_at: str = Field(..., description="Creation timestamp")
+    repo_info: Dict[str, Any] = Field(..., description="Repository information")
+
+
+# Create FastAPI app
+app = FastAPI(
+    title="Code Context Retrieval Server",
+    description="API for codebase analysis, context management, and agent execution",


The server lacks proper authentication and rate limiting. In a production environment, this could lead to security vulnerabilities and resource exhaustion. Consider adding authentication middleware and rate limiting to protect the API endpoints.

Suggested change

prompt: str = Field(..., description="Prompt to send to the agent")

model: Optional[str] = Field(None, description="Model to use for the agent")

metadata: Optional[Dict[str, Any]] = Field(None, description="Metadata for the agent run")

class SnapshotInfo(BaseModel):

snapshot_id: str = Field(..., description="ID of the snapshot")

created_at: str = Field(..., description="Creation timestamp")

repo_info: Dict[str, Any] = Field(..., description="Repository information")

# Create FastAPI app

app = FastAPI(

title="Code Context Retrieval Server",

description="API for codebase analysis, context management, and agent execution",

from fastapi import FastAPI, HTTPException, Query, Depends

from fastapi.security import APIKeyHeader

from slowapi import Limiter

from slowapi.util import get_remote_address

api_key_header = APIKeyHeader(name="X-API-Key")

limiter = Limiter(key_func=get_remote_address)

app = FastAPI(

title="Code Context Retrieval Server",

description="API for codebase analysis, context management, and agent execution",

version="0.1.0"

)

app.state.limiter = limiter

async def verify_api_key(api_key: str = Depends(api_key_header)):

if not is_valid_api_key(api_key): # Implement your validation logic

raise HTTPException(status_code=403, detail="Invalid API key")

return api_key

codecov-ai · 2025-05-01T18:51:02Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+)
+


The global storage of active harnesses (active_harnesses dict) could lead to memory leaks in a long-running server. Implement a cache eviction strategy and maximum capacity to prevent unbounded memory growth.

Suggested change

)

from cachetools import TTLCache

from datetime import timedelta

# Cache harnesses with a 1-hour TTL and maximum of 100 entries

active_harnesses: TTLCache = TTLCache(

maxsize=100,

ttl=timedelta(hours=1).total_seconds()

)

def cleanup_inactive_harnesses():

"""Manually cleanup inactive harnesses"""

for key in list(active_harnesses.keys()):

if key not in active_harnesses: # Check if already evicted

continue

harness = active_harnesses[key]

if harness.is_inactive(): # Implement inactivity check

del active_harnesses[key]

codecov-ai · 2025-05-01T18:51:03Z

codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py

+
+
+class CodebaseContextSnapshot:
+    """
+    Allows saving and restoring codebase state and analysis results.
+    Integrates with S3-compatible storage via BucketStore.
+    """


The snapshot functionality doesn't handle concurrent access safely. When using this in a multi-threaded environment (like the FastAPI server), there could be race conditions when creating/loading snapshots. Consider adding proper locking mechanisms.

Suggested change

class CodebaseContextSnapshot:

"""

Allows saving and restoring codebase state and analysis results.

Integrates with S3-compatible storage via BucketStore.

"""

from threading import Lock

class CodebaseContextSnapshot:

_snapshot_locks = {}

_global_lock = Lock()

def __init__(self, ...):

with self._global_lock:

self._snapshot_lock = self._snapshot_locks.setdefault(self.snapshot_id, Lock())

def create_snapshot(self, local_path: Optional[Union[str, Path]] = None) -> str:

with self._snapshot_lock:

# Existing snapshot creation code

pass

Zeeeepa · 2025-05-01T19:19:50Z

/korbit-review

Zeeeepa · 2025-05-01T19:19:53Z

@codegen Implement and upgrade this PR with above Considerations and suggestions from @CodiumAI-Agent, @sourcery-ai, @codecov-ai-reviewer, @coderabbitai, /gemini, /korbit-review

qodo-code-review · 2025-05-01T19:20:14Z

Qodo Merge was enabled for this repository. To continue using it, please link your Git account with your Qodo account here.

Persistent review updated to latest commit 18a0737

gemini-code-assist

Code Review

This PR introduces significant enhancements to the codegen-on-oss package, including codebase analysis, context management, and a REST API server. The changes are well-structured and documented. However, there are some areas where error handling and code clarity could be improved.

Summary of Findings

Git Executable Not Found: The code introduces a GitExecutableNotFoundError exception and attempts to locate the git executable using shutil.which("git"). However, the exception is not handled in the main execution flow, which could lead to unhandled exceptions if git is not found.
Type Hinting: In codebase_context.py, several type hints are missing or could be more specific, especially for class attributes. Adding more precise type hints would improve code readability and maintainability.
Error Handling in get_directory: In codebase_context.py, the get_directory method raises an AssertionError when allow_external is not enabled and the directory is not a subdirectory. Consider using a more specific exception type and providing more context in the error message.
Inconsistent Use of List Comprehensions vs. Loops: In codebase_context.py, there's inconsistent use of list comprehensions and loops. Standardizing the approach would improve code readability.
Potential UnicodeDecodeError: In codebase_context.py, the _process_diff_files method catches UnicodeDecodeError but only logs a warning. Consider adding more robust error handling or skipping the file.

Merge Readiness

The pull request is not quite ready for merging. There are a few issues that need to be addressed before merging, particularly the error handling for the GitExecutableNotFoundError and the AssertionError in get_directory. Additionally, improving the type hinting in codebase_context.py would enhance the code's maintainability. I am unable to directly approve this pull request, and recommend that another reviewer approve this code after the changes have been made.

gemini-code-assist · 2025-05-01T19:20:14Z

codegen-on-oss/codegen_on_oss/analysis/codebase_context.py

        absolute_path = self.to_absolute(directory_path)
        if not self.is_subdir(absolute_path) and not self.config.allow_external:
-            assert False, f"Directory {absolute_path} is not part of repo path {self.repo_path}"
+            raise AssertionError(
+                f"Directory {absolute_path} is not part of repo path {self.repo_path}"
+            )
            return None


Consider using a more specific exception type (e.g., ValueError) and providing more context in the error message. Also, the return None is unreachable due to the AssertionError being raised.

raise ValueError( f"Directory {absolute_path} is not part of repo path {self.repo_path} and external paths are disallowed." )

gemini-code-assist · 2025-05-01T19:20:14Z

codegen-on-oss/codegen_on_oss/analysis/analysis.py

+            if not git_executable:
+                raise GitExecutableNotFoundError()


Consider handling the GitExecutableNotFoundError exception here to prevent the program from crashing if git is not installed.

except GitExecutableNotFoundError: print("Git executable not found. Please ensure git is installed and in your PATH.") return {}

gemini-code-assist · 2025-05-01T19:20:14Z

codegen-on-oss/codegen_on_oss/analysis/codebase_context.py

    transaction_manager: TransactionManager
-    pending_syncs: list[DiffLite]  # Diffs that have been applied to disk, but not the graph (to be used for sync graph)
-    all_syncs: list[DiffLite]  # All diffs that have been applied to the graph (to be used for graph reset)
+    pending_syncs: list[
+        DiffLite
+    ]  # Diffs that have been applied to disk, but not the graph (to be used for sync graph)
+    all_syncs: list[
+        DiffLite
+    ]  # All diffs that have been applied to the graph (to be used for graph reset)


Consider adding type hints for these class attributes to improve code readability and maintainability.

transaction_manager: TransactionManager pending_syncs: list[DiffLite] = field(default_factory=list) # Diffs that have been applied to disk, but not the graph (to be used for sync graph) all_syncs: list[DiffLite] = field(default_factory=list) # All diffs that have been applied to the graph (to be used for graph reset)

gemini-code-assist · 2025-05-01T19:20:14Z

codegen-on-oss/codegen_on_oss/analysis/codebase_context.py

        # Start the dependency manager. This may or may not run asynchronously, depending on the implementation
        if self.dependency_manager is not None:
            # Check if its inital start or a reparse
-            if not self.dependency_manager.ready() and not self.dependency_manager.error():
+            if (
+                not self.dependency_manager.ready()
+                and not self.dependency_manager.error()
+            ):
                # TODO: We do not reparse dependencies during syncs as it is expensive. We should probably add a flag for this
                logger.info("> Starting dependency manager")
                self.dependency_manager.start(async_start=False)


Consider handling the UnicodeDecodeError more robustly. Simply logging a warning might not be sufficient. Perhaps skip the file or use a different encoding.

except UnicodeDecodeError: logger.warning( f"Can't read file at:{filepath} since it contains non-unicode characters. File will be ignored!" ) continue

codiumai-pr-agent-free · 2025-05-01T19:20:15Z

Persistent review updated to latest commit 18a0737

codegen-sh · 2025-05-01T19:20:19Z

I'll review PR #14 "Integrate harness.py functionality into codegen-on-oss package" right away. Let me examine the changes and provide a thorough review.

💻 View my work

gemini-code-assist · 2025-05-01T19:20:26Z

It looks like you're managing a complex review process with multiple bots, which is great for thoroughness! I'm here to help clarify any of my previous suggestions or address new concerns that have arisen. Please let me know if you have any specific questions for me.

codecov-ai · 2025-05-01T19:20:33Z

PR Description

This pull request introduces significant enhancements to the codegen-on-oss package, transforming it into a more comprehensive toolkit for codebase analysis, context management, and AI-driven code modification. The primary goal is to provide a robust and modular system for understanding and manipulating large codebases, facilitating tasks such as automated refactoring, security auditing, and AI-assisted code generation.

Click to see more

Key Technical Changes

Key changes include:

Codebase Analysis: Integration of harness.py functionality into CodebaseAnalysisHarness for detailed codebase metrics, diff generation, and file statistics.
Context Saving and Retrieval: Implementation of CodebaseContextSnapshot for saving and restoring codebase state, with support for both local and S3-compatible storage via BucketStore.
CodeContextRetrievalServer: Introduction of a FastAPI server (server.py) providing REST APIs for repository analysis, snapshot management, and agent execution.
CLI Enhancements: Addition of new CLI commands (run-one, run, serve) for parsing repositories, managing the context server, and performing analysis.

Architecture Decisions

The architecture emphasizes modularity and separation of concerns. The codebase analysis is decoupled from the context management, allowing for independent use and extension. The introduction of a FastAPI server follows a microservices pattern, enabling remote access to codebase analysis and manipulation capabilities. The use of BucketStore provides a flexible and scalable solution for storing codebase snapshots.

Dependencies and Interactions

This pull request introduces new dependencies, including fastapi, uvicorn, and potentially boto3 (if S3 storage is used). The changes interact heavily with the core codegen library, leveraging its codebase parsing and manipulation capabilities. The CodebaseAnalysisHarness integrates directly with the Codebase object, while the CodeContextRetrievalServer provides a higher-level API for interacting with the analysis results and snapshots.

Risk Considerations

Potential risks include:

Performance: The codebase analysis can be resource-intensive, especially for large repositories. The server should be monitored for performance bottlenecks and optimized accordingly.
Security: The FastAPI server exposes codebase analysis and manipulation capabilities over a network. Proper authentication and authorization mechanisms should be implemented to prevent unauthorized access.
Complexity: The introduction of new features and dependencies increases the overall complexity of the system. Thorough testing and documentation are essential to ensure maintainability.

Notable Implementation Details

Notable implementation details include:

The use of rustworkx for graph-based codebase analysis.
The implementation of a custom MetricsProfiler for performance profiling.
The use of Click for building the command-line interface.
The integration with Modal for cloud-based execution (although this is not a new addition, it's a key part of the overall architecture).

korbit-ai

I've completed my review and didn't find any issues... but I did find this horse.

            .''
  ._.-.___.' (`\
 //(        ( `'
'/ )\ ).__. )
' <' `\ ._/'\
   `   \     \

Files scanned

File Path	Reviewed
codegen-on-oss/codegen_on_oss/context_server/server.py	✅
codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py	✅

Explore our documentation to understand the languages and file types we support and the files we ignore.

Check out our docs on how you can make Korbit work best for you and your team.

Loving Korbit!? Share us on LinkedIn Reddit and X

sourcery-ai

Hey @codegen-sh[bot] - I've reviewed your changes - here's some feedback:

Consider making server dependencies like FastAPI and Uvicorn optional extras for users who only need the core analysis features.
Ensure the core analysis and snapshotting logic remains easily usable independently of the new API server.

Here's what I looked at during the review

🟡 General issues: 4 issues found
🟢 Security: all looks good
🟢 Testing: all looks good
🟡 Complexity: 1 issue found
🟢 Documentation: all looks good

Sourcery is free for open source - if you like our reviews please consider sharing them ✨

_{Help me be more useful! Please click 👍 or 👎 on each comment and I'll use the feedback to improve your reviews.}

sourcery-ai · 2025-05-01T19:22:20Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+
+# Global storage for active harnesses and snapshots
+active_harnesses: dict[str, CodebaseAnalysisHarness] = {}
+bucket_store: BucketStore | None = None


suggestion: Global mutable bucket_store variable usage.

Mutable globals in async contexts can cause race conditions; encapsulate bucket_store or use dependency injection for safer access.

Suggested implementation:

# Global storage for active harnesses and snapshots active_harnesses: dict[str, CodebaseAnalysisHarness] = {}

app = FastAPI( title="Code Context Retrieval Server", description="API for codebase analysis, context management, and agent execution", version="0.1.0", ) @app.on_event("startup") async def startup_event(): # Initialize the bucket_store and attach it to the app state for safe access. app.state.bucket_store = BucketStore()

Ensure that endpoints or functions that previously referenced the bucket_store global now reference it via dependency injection or by accessing it from the request object, for example:

from fastapi import Request @app.get("/some-endpoint") async def some_endpoint(request: Request): bucket_store = request.app.state.bucket_store # Use bucket_store here

Adjust other parts of the code to remove direct dependencies on the global variable.

sourcery-ai · 2025-05-01T19:22:21Z

codegen-on-oss/codegen_on_oss/analysis/harness_integration.py

+        Returns:
+            The result of the agent run
+        """
+        tags = [str(value) for value in self.metadata.values() if value]


question: Dynamic tag generation from metadata.

Check that metadata keys and values don’t generate unexpected or overly verbose tags when passed to the agent.

sourcery-ai · 2025-05-01T19:22:21Z

codegen-on-oss/README.md

+- **Performance Profiling**: Log metrics for each parsing run
+- **Error Logging**: Log errors to help pinpoint parsing failures
+
+For more details on the original functionality, see the "Package Structure" and "Getting Started" sections below.


suggestion: Reference to sections below might be ambiguous.

With sections now titled (e.g., “Original Package Structure”), please update this reference to point to the exact section containing the original functionality details.

Suggested implementation:

For more details on the original functionality, see the "Original Functionality" section above.

If additional references in the document refer to "Package Structure" and "Getting Started" for original functionality details, you may want to update those as well to reduce ambiguity.

sourcery-ai · 2025-05-01T19:22:21Z

codegen-examples/examples/snapshot_event_handler/README.md

 # Event Handler with codebase snapshotting

-This project is designed to using Modal snapshotting to provide parsed codebase instances with minimal latency, make it more manageable to write event based handlers. 
+This project is designed to using Modal snapshotting to provide parsed codebase instances with minimal latency, make it more manageable to write event based handlers.


issue (typo): Grammatical error in sentence.

Replace “designed to using” with “designed to use” or “designed for using.”

Suggested implementation:

This project is designed to use Modal snapshotting to provide parsed codebase instances with minimal latency, making it more manageable to write event based handlers.

The change updates the wording from "designed to using" to "designed to use" and also adjusts "make" to "making" for grammatical consistency.

sourcery-ai · 2025-05-01T19:22:21Z

codegen-on-oss/codegen_on_oss/analysis/codebase_context.py

+from codegen.sdk.codebase.config_parser import (
+    ConfigParser,
+    get_config_parser_for_language,
+)


issue (complexity): Consider keeping simple one-line expressions and type annotations on one line to reduce vertical noise.

Consider keeping simple one‐line expressions and type annotations on one line when they’re not too long. In several places simple function calls or annotations are broken over multiple lines and cause extra vertical noise. For example, instead of:

self.config_parser = get_config_parser_for_language( context.programming_language, self )

use the concise version:

self.config_parser = get_config_parser_for_language(context.programming_language, self)

Similarly, inline conditionals that are simple can remain on one line. Only use multi‐line formatting when the call or expression exceeds your line-length limit or when grouping parameters enhances clarity. This reduces perceived complexity without losing functionality.

sourcery-ai · 2025-05-01T19:22:22Z

codegen-on-oss/codegen_on_oss/analysis/document_functions.py


    # Filter out test and tutorial functions first
-    functions = [f for f in codebase.functions if not any(pattern in f.name.lower() for pattern in ["test", "tutorial"]) and not any(pattern in f.filepath.lower() for pattern in ["test", "tutorial"])]
+    functions = [


issue (code-quality): We've found these issues:

Invert any/all to simplify comparisons [×2] (invert-any-all)

Use named expression to simplify assignment and conditional (use-named-expression)

sourcery-ai · 2025-05-01T19:22:22Z

codegen-on-oss/codegen_on_oss/snapshot/event_handlers.py

+        modal_cls = modal.Cls.from_name(
+            app_name="Events", name="CustomEventHandlersAPI"
+        )


issue (code-quality): Inline variable that is immediately returned (inline-immediately-returned-variable)

sourcery-ai · 2025-05-01T19:22:22Z

codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py

-            logger.info(f"Found violation: {violation}")
+    # Check for react-dev-overlay imports
+    for i, line in enumerate(lines):
+        line_num = i + 1


suggestion (code-quality): Move assignments closer to their usage (move-assign)

Suggested change

line_num = i + 1

line_num = i + 1

sourcery-ai · 2025-05-01T19:22:22Z

codegen-on-oss/codegen_on_oss/snapshot/pr_tasks.py

+        comment = "## Development Import Violations Found\n\n"
+        comment += "The following files contain imports of development modules that should not be used in production code:\n\n"
+


suggestion (code-quality): Replace assignment and augmented assignment with single assignment (merge-assign-and-aug-assign)

Suggested change

comment = "## Development Import Violations Found\n\n"

comment += "The following files contain imports of development modules that should not be used in production code:\n\n"

comment = (

"## Development Import Violations Found\n\n"

+ "The following files contain imports of development modules that should not be used in production code:\n\n"

)

sourcery-ai · 2025-05-01T19:22:22Z

src/codegen/agents/client/openapi_client/api_client.py

        except ValueError:
            raise rest.ApiException(status=0, reason=(f"Failed to parse `{string}` as datetime object"))


suggestion (code-quality): Explicitly raise from a previous error (raise-from-previous-error)

Suggested change

except ValueError:

raise rest.ApiException(status=0, reason=(f"Failed to parse `{string}` as datetime object"))

except ValueError as e:

raise rest.ApiException(

status=0, reason=(f"Failed to parse `{string}` as datetime object")

) from e

coderabbitai · 2025-05-01T19:22:52Z

Tip

For best results, initiate chat on the files or code changes.

A summary of the edits CodeRabbit can apply:

Will add JSON serialization methods and necessary imports to codegen-on-oss/codegen_on_oss/bucket_store.py

We will enhance server.py by updating imports, adding cleanup tasks, injecting input validation, tracking harness access times, and sanitizing error handling.

Will update cli.py to import and use create_app from context_server.server for starting the server.

Replace the bare raise in run_agent with a chained RuntimeError to preserve exception context in harness_integration.py.

Will enhance the _load_local method in context_snapshot.py to handle JSON parsing errors gracefully.

A plan of each step that CodeRabbit will take in order to execute the edits for each file (with inline rough draft snippets):

✏️ codegen-on-oss/codegen_on_oss/bucket_store.py

Immediately after the existing import of boto3 (after line 5), insert the following imports:

import json
import os
import tempfile
from typing import Any, Dict, List

After the end of the upload_file method (just below line 27), insert the put_json method:

    def put_json(self, data: Any, remote_path: str) -> str:
        """
        Uploads JSON-serializable data to S3 and returns the key.
        """
        local_file = tempfile.NamedTemporaryFile(delete=False, suffix=".json")
        try:
            with open(local_file.name, "w", encoding="utf-8") as f:
                json.dump(data, f)
            return self.upload_file(local_file.name, remote_path)
        finally:
            try:
                os.remove(local_file.name)
            except Exception:
                pass

Immediately after put_json, insert the get_json method:

    def get_json(self, remote_path: str) -> Any:
        """
        Downloads JSON data from S3 and returns the parsed object.
        """
        local_file = tempfile.NamedTemporaryFile(delete=False, suffix=".json")
        try:
            self.s3_client.download_file(
                self.bucket_name,
                remote_path,
                local_file.name,
            )
            with open(local_file.name, "r", encoding="utf-8") as f:
                return json.load(f)
        finally:
            try:
                os.remove(local_file.name)
            except Exception:
                pass

Immediately after get_json, insert the list_keys method:

    def list_keys(self, prefix: str = "") -> List[str]:
        """
        Lists keys in the S3 bucket under the given prefix.
        """
        paginator = self.s3_client.get_paginator("list_objects_v2")
        keys: List[str] = []
        for page in paginator.paginate(Bucket=self.bucket_name, Prefix=f"{self.key_prefix}/{prefix}"):
            for obj in page.get("Contents", []):
                keys.append(obj["Key"])
        return keys

✏️ codegen-on-oss/codegen_on_oss/context_server/server.py

Replace the existing import block (lines 5–15) with the following to include re, asyncio, datetime, typing hints, and Pydantic validators:

import json
import re
import asyncio
from datetime import datetime
from pathlib import Path
from typing import Any, Dict, List, Optional

from fastapi import FastAPI, HTTPException, Query
from loguru import logger
from pydantic import BaseModel, Field, validator

from codegen_on_oss.analysis.harness_integration import CodebaseAnalysisHarness
from codegen_on_oss.bucket_store import BucketStore
from codegen_on_oss.snapshot.context_snapshot import CodebaseContextSnapshot

Immediately after the global declarations on lines 47–48 for active_harnesses and bucket_store, insert:

# Track when harnesses were last accessed and expire after inactivity
last_accessed: Dict[str, datetime] = {}
MAX_INACTIVE_TIME = 3600  # seconds

After the existing @app.on_event("startup") async def startup_event(): block (ending around line 60), insert:

@app.on_event("startup")
async def setup_cleanup_task():
    """Launch background task to remove inactive harnesses."""
    asyncio.create_task(cleanup_inactive_harnesses())

async def cleanup_inactive_harnesses():
    """Periodically purge harnesses idle longer than MAX_INACTIVE_TIME."""
    while True:
        now = datetime.now()
        stale = [
            key
            for key, t in last_accessed.items()
            if (now - t).total_seconds() > MAX_INACTIVE_TIME
        ]
        for key in stale:
            logger.info(f"Removing inactive harness: {key}")
            active_harnesses.pop(key, None)
            last_accessed.pop(key, None)
        await asyncio.sleep(300)  # run every 5 minutes

Within class RepositoryInfo(BaseModel): (just after line 23), add these validators to enforce correct formats:

    @validator("repo_full_name")
    def validate_repo_full_name(cls, v):
        if not re.match(r"^[A-Za-z0-9_.-]+/[A-Za-z0-9_.-]+$", v):
            raise ValueError("repo_full_name must be in format owner/repo")
        return v

    @validator("commit")
    def validate_commit(cls, v):
        if v and not re.match(r"^[0-9a-f]{7,40}$", v):
            raise ValueError("commit must be a valid git hash")
        return v

    @validator("language")
    def validate_language(cls, v):
        allowed = {"python", "javascript", "typescript", "java", "go"}
        low = v.lower()
        if low not in allowed:
            raise ValueError(f"language must be one of: {', '.join(sorted(allowed))}")
        return low

In async def analyze_repository, immediately after storing the harness (line 98):

Insert:

last_accessed[harness_key] = datetime.now()

In the except Exception as e: block, change the raise to:

raise HTTPException(status_code=500, detail="Repository analysis failed") from e

Replace the body of get_file_stats (lines 119–128) with:

if harness_key not in active_harnesses:
    raise HTTPException(status_code=404, detail=f"Harness {harness_key} not found")
last_accessed[harness_key] = datetime.now()
try:
    harness = active_harnesses[harness_key]
    if not harness.analysis_results:
        harness.analyze_codebase()
    return harness.analysis_results.get("file_stats", {})
except Exception as e:
    logger.error(f"Failed to get file stats: {e}")
    raise HTTPException(status_code=500, detail="Failed to get file statistics")

In create_snapshot, after harness = active_harnesses[harness_key] (line 144):

Insert:

last_accessed[harness_key] = datetime.now()

In its except Exception as e: block, change the raise to:

raise HTTPException(status_code=500, detail="Failed to create snapshot") from e

In run_agent, after harness = active_harnesses[harness_key] (line 237):

Insert:

last_accessed[harness_key] = datetime.now()

In its except Exception as e: block, change the raise to:

raise HTTPException(status_code=500, detail="Agent execution failed") from e

✏️ codegen-on-oss/codegen_on_oss/cli.py

Immediately below line 167 (the comment "# Import here to avoid circular imports"), insert:

from codegen_on_oss.context_server.server import create_app

After the newly added import, insert the application factory invocation so that the FastAPI app instance is created. Add:

app = create_app()

Replace the existing uvicorn.run block (lines 170–176) with a call that uses the app instance instead of a module path string. Change:

uvicorn.run(
    "codegen_on_oss.context_server:app",
    host=host,
    port=port,
    reload=reload,
    log_level=log_level,
)

to:

uvicorn.run(
    app,
    host=host,
    port=port,
    reload=reload,
    log_level=log_level,
)

Ensure there are no leftover references to the string-based uvicorn invocation. Remove or comment out any redundant uvicorn.run calls that use module path strings.

Save the file and test the CLI serve command by running python -m codegen_on_oss.cli serve. Verify that the server starts correctly and logs the startup message, confirming create_app was used.

✏️ codegen-on-oss/codegen_on_oss/analysis/harness_integration.py

Replace lines 182–184 with the following block, preserving the existing logger call and indentation (8 spaces):

        except Exception as agent_error:
            logger.error(f"Agent run failed with error: {agent_error}")
            raise RuntimeError("Agent execution failed") from agent_error

✏️ codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py

Within the _load_local method, replace the existing block at lines 199–201:

if snapshot_path.exists():
    with open(snapshot_path) as f:
        return json.load(f)

with the following try/except to catch and log JSON decode errors, then continue to the next directory:

if snapshot_path.exists():
    try:
        with open(snapshot_path, "r") as f:
            return json.load(f)
    except json.JSONDecodeError as e:
        logger.warning(f"Invalid JSON in snapshot file {snapshot_path}: {e}")
        continue

A summary of the context that CodeRabbit is considering across the codebase:

Verify that json, os, and tempfile are not already imported in bucket_store.py to avoid duplicate imports.

Locate the import block in server.py so we know exactly which lines to replace when adding new imports.

Find the definition of analyze_repository to determine where to inject access‐tracking and sanitize error handling.

We need to locate the create_app function in the context_server module to import it into cli.py.

✅ Create PR with these edits
📋 Get copyable edits
📌 Create commit in current branch
🔄 Try a different approach
⏪ Go back to ordinary chat

codegen-sh · 2025-05-01T19:23:34Z

PR Review: Integrate harness.py functionality into codegen-on-oss package

I've reviewed the changes in this PR that integrate functionality from src/codegen/extensions/swebench/harness.py into the codegen-on-oss package. Overall, this is a well-structured integration that enhances the package with valuable codebase analysis and context management capabilities.

Strengths

Well-structured modularization: The PR effectively transforms functionality from a standalone script into a modular, reusable package structure.
Comprehensive API design: The implementation provides both programmatic access (via classes) and HTTP access (via FastAPI server), offering flexibility in how the functionality can be used.
Enhanced documentation: The README has been significantly improved with detailed information about the new features, installation instructions, and usage examples.
CLI integration: The addition of the serve command to the CLI makes it easy to start the CodeContextRetrievalServer.
Example script: The inclusion of example_usage.py provides a clear demonstration of how to use the new functionality.

Issues and Suggestions

Incomplete BucketStore implementation: The BucketStore class in bucket_store.py only implements the upload_file method, but CodebaseContextSnapshot expects get_json, put_json, and list_keys methods. This inconsistency needs to be resolved.
Missing tests: There are no tests for the new functionality, which makes it difficult to ensure correctness and robustness. Consider adding unit tests for the key components.
Error handling improvements: While there are custom exception classes defined, some error handling could be improved:
- In CodebaseContextSnapshot.load_snapshot(), if both remote and local loading fail, a more specific error message would be helpful.
- In CodeContextRetrievalServer, some endpoints could benefit from more detailed error responses.
Documentation completeness: While the docstrings are generally good, some methods could use more detailed parameter and return value descriptions.
Performance considerations: For large codebases, some operations might be expensive. Consider adding options for limiting analysis scope or implementing pagination for large result sets.

Recommendations

Complete the BucketStore implementation: Add the missing methods (get_json, put_json, list_keys) to the BucketStore class.
Add tests: Create unit tests for the key components, especially focusing on edge cases and error handling.
Enhance error handling: Provide more specific error messages and ensure all exceptions are properly caught and handled.
Consider configuration options: Add configuration options for customizing the behavior of the components, such as analysis depth, storage backends, etc.
Add pagination: For endpoints that might return large amounts of data, consider implementing pagination.

Overall, this PR represents a significant enhancement to the codegen-on-oss package, providing valuable tools for codebase analysis and context management. With the suggested improvements, it will be an even more robust and useful addition to the package.

coderabbitai

Actionable comments posted: 5

♻️ Duplicate comments (6)

codegen-on-oss/codegen_on_oss/context_server/server.py (6)

239-246: ⚠️ Potential issue

Add null check for harness.metadata

If the harness was created without metadata, harness.metadata.update(...) will raise an AttributeError.

Add a null check for harness.metadata:

        # Update metadata if provided
        if request.metadata:
+            if harness.metadata is None:
+                harness.metadata = {}
            harness.metadata.update(request.metadata)

97-99: 🛠️ Refactor suggestion

Store harness with wrapper object for TTL tracking

To work with the proposed cleanup mechanism, update how harnesses are stored:

        # Store the harness for later use
-        active_harnesses[harness_key] = harness
+        active_harnesses[harness_key] = HarnessEntry(harness)

89-104: 🛠️ Refactor suggestion

Use specific exceptions instead of broad exception handling

Using a broad except Exception as e can mask underlying issues. It's better to catch specific exceptions to provide more informative error messages.

Catch specific exceptions:

    try:
        # Create a new harness for the repository
        harness = CodebaseAnalysisHarness.from_repo(
            repo_full_name=repo_info.repo_full_name,
            commit=repo_info.commit,
            language=repo_info.language,
        )

        # Store the harness for later use
        active_harnesses[harness_key] = harness

        # Perform analysis
        results = harness.analyze_codebase()
-    except Exception as e:
+    except ValueError as e:  # For invalid input parameters
+        logger.error(f"Invalid repository parameters: {e}")
+        raise HTTPException(status_code=400, detail=str(e)) from e
+    except FileNotFoundError as e:  # If repository or files can't be found
+        logger.error(f"Repository not found: {e}")
+        raise HTTPException(status_code=404, detail=str(e)) from e
+    except Exception as e:  # Fallback for other exceptions
        logger.error(f"Repository analysis failed: {e}")
        raise HTTPException(status_code=500, detail=str(e)) from e

Apply similar improvements to other exception handling blocks in the file.

46-48: ⚠️ Potential issue

Implement a cleanup mechanism for harnesses

The global storage of active harnesses could lead to memory leaks in a long-running server. There's no mechanism to remove harnesses that are no longer needed.

Implement a cache eviction strategy with a time-to-live (TTL) mechanism:

-# Global storage for active harnesses and snapshots
-active_harnesses: dict[str, CodebaseAnalysisHarness] = {}
+from datetime import datetime, timedelta
+
+# Global storage for active harnesses and snapshots
+class HarnessEntry:
+    def __init__(self, harness: CodebaseAnalysisHarness):
+        self.harness = harness
+        self.last_accessed = datetime.now()
+        
+    def access(self):
+        self.last_accessed = datetime.now()
+        return self.harness
+
+active_harnesses: dict[str, HarnessEntry] = {}
+
+def cleanup_inactive_harnesses():
+    """Remove harnesses that haven't been accessed in the last hour"""
+    now = datetime.now()
+    expired_keys = [
+        key for key, entry in active_harnesses.items()
+        if (now - entry.last_accessed) > timedelta(hours=1)
+    ]
+    for key in expired_keys:
+        logger.info(f"Removing inactive harness: {key}")
+        del active_harnesses[key]

Then, update each access to harnesses:

-    harness = active_harnesses[harness_key]
+    harness = active_harnesses[harness_key].access()

And add a periodic task to run the cleanup:

@app.on_event("startup")
async def startup_event():
    """Initialize resources on server startup."""
    global bucket_store
+    
+    # Schedule periodic cleanup of inactive harnesses
+    from apscheduler.schedulers.asyncio import AsyncIOScheduler
+    scheduler = AsyncIOScheduler()
+    scheduler.add_job(cleanup_inactive_harnesses, 'interval', minutes=15)
+    scheduler.start()
+    logger.info("Scheduled periodic cleanup of inactive harnesses")

40-44: 🛠️ Refactor suggestion

Add authentication and rate limiting

The server lacks proper authentication and rate limiting. In a production environment, this could lead to security vulnerabilities and resource exhaustion.

Consider adding authentication middleware and rate limiting:

-from fastapi import FastAPI, HTTPException, Query
+from fastapi import FastAPI, HTTPException, Query, Depends
+from fastapi.security import APIKeyHeader
+from slowapi import Limiter
+from slowapi.util import get_remote_address

+api_key_header = APIKeyHeader(name="X-API-Key")
+limiter = Limiter(key_func=get_remote_address)

app = FastAPI(
    title="Code Context Retrieval Server",
    description="API for codebase analysis, context management, and agent execution",
    version="0.1.0",
)
+app.state.limiter = limiter

+def verify_api_key(api_key: str = Depends(api_key_header)):
+    # Implement your API key validation logic
+    # This is a simple example, replace with your actual validation
+    valid_api_keys = ["your-secret-api-key"]  # Store securely, e.g., in environment variables
+    if api_key not in valid_api_keys:
+        raise HTTPException(status_code=403, detail="Invalid API key")
+    return api_key

Then, add the authentication and rate limiting to the endpoints:

-@app.post("/analyze/repository", response_model=dict[str, Any])
+@app.post("/analyze/repository", response_model=dict[str, Any], dependencies=[Depends(verify_api_key)])
+@limiter.limit("10/minute")
async def analyze_repository(repo_info: RepositoryInfo):

50-60: ⚠️ Potential issue

Fix BucketStore initialization

The BucketStore is being initialized without a bucket name parameter, but the constructor requires one. This will cause an error when the server starts up.

Consider reading from environment variables:

@app.on_event("startup")
async def startup_event():
    """Initialize resources on server startup."""
    global bucket_store
    try:
-        bucket_store = BucketStore()
+        import os
+        bucket_name = os.environ.get("BUCKET_NAME")
+        if bucket_name:
+            bucket_store = BucketStore(bucket_name=bucket_name)
+            logger.info(f"Initialized bucket store for remote storage using bucket: {bucket_name}")
+        else:
+            logger.warning("BUCKET_NAME environment variable not set")
+            logger.info("Continuing without remote storage capabilities")
        logger.info("Initialized bucket store for remote storage")
    except Exception as e:
        logger.warning(f"Failed to initialize bucket store: {e}")
        logger.info("Continuing without remote storage capabilities")

🧹 Nitpick comments (3)

codegen-on-oss/codegen_on_oss/context_server/server.py (3)

159-204: Simplify snapshot listing logic by extracting helper functions

The /snapshot/list endpoint contains complex nested try-except logic which makes it hard to read and maintain.

Extract helper functions to simplify the logic:

+def load_remote_snapshot(key: str, bucket_store: BucketStore) -> SnapshotInfo | None:
+    """Load a snapshot from remote storage."""
+    try:
+        data = bucket_store.get_json(key)
+        return SnapshotInfo(
+            snapshot_id=data["snapshot_id"],
+            created_at=data["created_at"],
+            repo_info=data["repo_info"],
+        )
+    except Exception as e:
+        logger.warning(f"Failed to load snapshot {key}: {e}")
+        return None
+
+
+def load_local_snapshot(snapshot_file: Path) -> SnapshotInfo | None:
+    """Load a snapshot from local storage."""
+    try:
+        with open(snapshot_file) as f:
+            data = json.load(f)
+            return SnapshotInfo(
+                snapshot_id=data["snapshot_id"],
+                created_at=data["created_at"],
+                repo_info=data["repo_info"],
+            )
+    except Exception as e:
+        logger.warning(f"Failed to load local snapshot {snapshot_file}: {e}")
+        return None


@app.get("/snapshot/list", response_model=list[SnapshotInfo])
async def list_snapshots():
    """
    List all available snapshots.
    """
    snapshots = []

    # List snapshots from bucket store if available
    if bucket_store:
        try:
            keys = bucket_store.list_keys(prefix="snapshots/")
-            for key in keys:
-                try:
-                    data = bucket_store.get_json(key)
-                    snapshots.append(
-                        SnapshotInfo(
-                            snapshot_id=data["snapshot_id"],
-                            created_at=data["created_at"],
-                            repo_info=data["repo_info"],
-                        )
-                    )
-                except Exception as e:
-                    logger.warning(f"Failed to load snapshot {key}: {e}")
+            remote_snapshots = [load_remote_snapshot(key, bucket_store) for key in keys]
+            snapshots.extend([s for s in remote_snapshots if s])
        except Exception as e:
            logger.warning(f"Failed to list snapshots from bucket store: {e}")

    # List local snapshots
    for directory in [Path("./snapshots"), Path("./data/snapshots")]:
        if directory.exists():
-            for snapshot_file in directory.glob("snapshot_*.json"):
-                try:
-                    with open(snapshot_file) as f:
-                        data = json.load(f)
-                        snapshots.append(
-                            SnapshotInfo(
-                                snapshot_id=data["snapshot_id"],
-                                created_at=data["created_at"],
-                                repo_info=data["repo_info"],
-                            )
-                        )
-                except Exception as e:
-                    logger.warning(
-                        f"Failed to load local snapshot {snapshot_file}: {e}"
-                    )
+            local_snapshots = [load_local_snapshot(f) for f in directory.glob("snapshot_*.json")]
+            snapshots.extend([s for s in local_snapshots if s])

    return snapshots

146-152: Improve error handling in snapshot creation

The current error handling is too broad. Catch specific exceptions for better error reporting.

    try:
        snapshot = CodebaseContextSnapshot(harness=harness, bucket_store=bucket_store)
        snapshot_id = snapshot.create_snapshot(local_path=local_path)
-    except Exception as e:
+    except FileNotFoundError as e:
+        logger.error(f"Snapshot creation failed - directory not found: {e}")
+        raise HTTPException(status_code=404, detail=f"Directory not found: {str(e)}") from e
+    except PermissionError as e:
+        logger.error(f"Snapshot creation failed - permission denied: {e}")
+        raise HTTPException(status_code=403, detail="Permission denied writing snapshot") from e
+    except Exception as e:
        logger.error(f"Snapshot creation failed: {e}")
        raise HTTPException(status_code=500, detail=str(e)) from e

212-223: Provide more specific error details in snapshot loading

The current error handling returns a 404 for all types of errors, but some might be server errors:

    try:
        snapshot = CodebaseContextSnapshot(
            snapshot_id=snapshot_id, bucket_store=bucket_store
        )
        data = snapshot.load_snapshot()
-    except Exception as e:
-        logger.error(f"Failed to load snapshot {snapshot_id}: {e}")
-        raise HTTPException(
-            status_code=404, detail=f"Snapshot {snapshot_id} not found"
-        ) from e
+    except FileNotFoundError as e:
+        logger.error(f"Failed to load snapshot {snapshot_id}: {e}")
+        raise HTTPException(
+            status_code=404, detail=f"Snapshot {snapshot_id} not found"
+        ) from e
+    except Exception as e:
+        logger.error(f"Error loading snapshot {snapshot_id}: {e}")
+        raise HTTPException(
+            status_code=500, detail=f"Error loading snapshot: {str(e)}"
+        ) from e
    else:
        return data

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dd9b5d0 and 18a0737.

📒 Files selected for processing (2)

codegen-on-oss/codegen_on_oss/context_server/server.py (1 hunks)
codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py (1 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py

🧰 Additional context used

🧬 Code Graph Analysis (1)

codegen-on-oss/codegen_on_oss/context_server/server.py (3)

codegen-on-oss/codegen_on_oss/analysis/harness_integration.py (4)

CodebaseAnalysisHarness (15-207)

from_repo (41-69)

analyze_codebase (71-97)

run_agent (156-184)

codegen-on-oss/codegen_on_oss/bucket_store.py (1)

BucketStore (11-26)

codegen-on-oss/codegen_on_oss/snapshot/context_snapshot.py (3)

CodebaseContextSnapshot (47-262)

create_snapshot (75-113)

load_snapshot (115-147)

⏰ Context from checks skipped due to timeout of 90000ms (2)

GitHub Check: Build 3.13 ubuntu-22.04-arm
GitHub Check: Build 3.12 ubuntu-22.04-arm

coderabbitai · 2025-05-01T19:23:35Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+async def analyze_repository(repo_info: RepositoryInfo):
+    """
+    Analyze a repository and return the results.
+
+    Creates a new CodebaseAnalysisHarness for the repository and performs analysis.
+    """
+    harness_key = f"{repo_info.repo_full_name}:{repo_info.commit or 'latest'}"
+


🛠️ Refactor suggestion

Add input validation for repository name and commit hash

There's no validation for repository name format or commit hash, which could lead to security issues like path traversal or command injection.

Add regex-based validation to the RepositoryInfo model:

from pydantic import BaseModel, Field +from pydantic import validator +import re class RepositoryInfo(BaseModel): repo_full_name: str = Field(..., description="Full repository name (owner/repo)") commit: str | None = Field(None, description="Commit hash to analyze") language: str = Field("python", description="Primary language of the repository") + @validator('repo_full_name') + def validate_repo_name(cls, v): + if not re.match(r'^[a-zA-Z0-9_.-]+/[a-zA-Z0-9_.-]+$', v): + raise ValueError('Invalid repository name format. Must be in the format "owner/repo"') + return v + + @validator('commit') + def validate_commit(cls, v): + if v is not None and not re.match(r'^[0-9a-f]{7,40}$', v): + raise ValueError('Invalid commit hash format. Must be a valid Git commit hash') + return v

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

async def analyze_repository(repo_info: RepositoryInfo):

"""

Analyze a repository and return the results.

Creates a new CodebaseAnalysisHarness for the repository and performs analysis.

"""

harness_key = f"{repo_info.repo_full_name}:{repo_info.commit or 'latest'}"

from pydantic import BaseModel, Field

from pydantic import validator

import re

class RepositoryInfo(BaseModel):

repo_full_name: str = Field(..., description="Full repository name (owner/repo)")

commit: str | None = Field(None, description="Commit hash to analyze")

language: str = Field("python", description="Primary language of the repository")

@validator('repo_full_name')

def validate_repo_name(cls, v):

if not re.match(r'^[a-zA-Z0-9_.-]+/[a-zA-Z0-9_.-]+$', v):

raise ValueError('Invalid repository name format. Must be in the format "owner/repo"')

return v

@validator('commit')

def validate_commit(cls, v):

if v is not None and not re.match(r'^[0-9a-f]{7,40}$', v):

raise ValueError('Invalid commit hash format. Must be a valid Git commit hash')

return v

🤖 Prompt for AI Agents (early access)

In codegen-on-oss/codegen_on_oss/context_server/server.py around lines 81 to 88, add regex-based validation to the RepositoryInfo model to ensure the repository name and commit hash conform to expected safe formats. This prevents security risks such as path traversal or command injection by validating inputs before processing. Update the RepositoryInfo class to include regex validators for repo_full_name and commit fields.

coderabbitai · 2025-05-01T19:23:35Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+    harness = active_harnesses[harness_key]
+


🛠️ Refactor suggestion

Update harness access for TTL tracking

Update harness retrieval to work with the proposed TTL tracking:

- harness = active_harnesses[harness_key] + harness = active_harnesses[harness_key].access()

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

harness = active_harnesses[harness_key]

- harness = active_harnesses[harness_key]

+ harness = active_harnesses[harness_key].access()

🤖 Prompt for AI Agents (early access)

In codegen-on-oss/codegen_on_oss/context_server/server.py around lines 122 to 123, update the way the harness is accessed from active_harnesses to integrate TTL tracking. Modify the retrieval logic to ensure that accessing the harness also updates or respects its TTL, such as by refreshing its timestamp or using a method that handles TTL internally, instead of directly indexing the dictionary.

coderabbitai · 2025-05-01T19:23:35Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+    harness = active_harnesses[harness_key]
+


🛠️ Refactor suggestion

Update harness access for TTL tracking

Update harness retrieval to work with the proposed TTL tracking:

- harness = active_harnesses[harness_key] + harness = active_harnesses[harness_key].access()

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

harness = active_harnesses[harness_key]

harness = active_harnesses[harness_key].access()

🤖 Prompt for AI Agents (early access)

In codegen-on-oss/codegen_on_oss/context_server/server.py around lines 237 to 238, the harness is accessed directly from active_harnesses without considering TTL tracking. Update the harness retrieval logic to integrate with the TTL tracking mechanism, ensuring that accessing the harness also refreshes or updates its TTL as per the new design. Modify the code to retrieve the harness in a way that supports TTL updates, such as using a method or function that handles TTL refresh on access.

coderabbitai · 2025-05-01T19:23:35Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+    harness = active_harnesses[harness_key]
+


🛠️ Refactor suggestion

Update harness access for TTL tracking

Update harness retrieval to work with the proposed TTL tracking:

- harness = active_harnesses[harness_key] + harness = active_harnesses[harness_key].access()

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents (early access)

In codegen-on-oss/codegen_on_oss/context_server/server.py at lines 144-145, update the way the harness is accessed from active_harnesses to integrate TTL tracking. Modify the retrieval logic to ensure that accessing the harness also refreshes or updates its TTL, so the harness does not expire prematurely. Implement this by replacing the direct dictionary access with a method or mechanism that updates the TTL upon retrieval.

coderabbitai · 2025-05-01T19:23:35Z

codegen-on-oss/codegen_on_oss/context_server/server.py

+async def run_agent(
+    request: AgentRunRequest,
+    harness_key: str = Query(..., description="Key of the active harness"),
+):
+    """
+    Run an agent on the codebase with the given prompt.
+    """
+    if harness_key not in active_harnesses:
+        raise HTTPException(status_code=404, detail=f"Harness {harness_key} not found")
+
+    harness = active_harnesses[harness_key]
+
+    try:
+        # Update metadata if provided
+        if request.metadata:
+            harness.metadata.update(request.metadata)
+
+        # Run the agent
+        result = harness.run_agent(prompt=request.prompt, model=request.model)
+    except Exception as e:
+        logger.error(f"Agent run failed: {e}")
+        raise HTTPException(status_code=500, detail=str(e)) from e
+    else:
+        return {
+            "harness_key": harness_key,
+            "result": result,
+        }


🛠️ Refactor suggestion

Sanitize error messages in API responses

The current implementation exposes raw exception messages in API responses, which might contain sensitive information.

Sanitize the error messages:

+def sanitize_error_message(message: str) -> str: + """Remove potentially sensitive information from error messages.""" + # Remove file paths + sanitized = re.sub(r'(/[^\s:;]*)+', '[PATH]', str(message)) + # Remove URLs + sanitized = re.sub(r'https?://[^\s]+', '[URL]', sanitized) + return sanitized @app.post("/agent/run", response_model=dict[str, Any]) async def run_agent( request: AgentRunRequest, harness_key: str = Query(..., description="Key of the active harness"), ): """ Run an agent on the codebase with the given prompt. """ if harness_key not in active_harnesses: raise HTTPException(status_code=404, detail=f"Harness {harness_key} not found") harness = active_harnesses[harness_key].access() try: # Update metadata if provided if request.metadata: if harness.metadata is None: harness.metadata = {} harness.metadata.update(request.metadata) # Run the agent result = harness.run_agent(prompt=request.prompt, model=request.model) except Exception as e: logger.error(f"Agent run failed: {e}") - raise HTTPException(status_code=500, detail=str(e)) from e + raise HTTPException(status_code=500, detail=sanitize_error_message(str(e))) from e else: return { "harness_key": harness_key, "result": result, }

Apply similar sanitization to other error messages in the API responses throughout the file.

Committable suggestion skipped: line range outside the PR's diff.

🤖 Prompt for AI Agents (early access)

In codegen-on-oss/codegen_on_oss/context_server/server.py around lines 227 to 253, the current error handling exposes raw exception messages in API responses, risking sensitive information leakage. Modify the except block to replace the raw exception message with a sanitized, generic error message before raising the HTTPException. Additionally, review other error handling in this file and apply similar sanitization to all API response error messages to ensure no sensitive details are exposed.

codegen-sh · 2025-05-01T19:24:24Z

BucketStore Implementation Issue

The BucketStore class in codegen-on-oss/codegen_on_oss/bucket_store.py is incomplete. It only implements the upload_file method, but the CodebaseContextSnapshot class expects methods like get_json, put_json, and list_keys.

Here's a suggested implementation to add the missing methods:

import json
from typing import Any

def get_json(self, key: str) -> dict[str, Any]:
    """Get a JSON object from the bucket."""
    response = self.s3_client.get_object(Bucket=self.bucket_name, Key=key)
    return json.loads(response['Body'].read().decode('utf-8'))

def put_json(self, key: str, data: dict[str, Any]) -> None:
    """Put a JSON object in the bucket."""
    self.s3_client.put_object(
        Bucket=self.bucket_name,
        Key=key,
        Body=json.dumps(data).encode('utf-8'),
        ContentType='application/json'
    )

def list_keys(self, prefix: str = "") -> list[str]:
    """List all keys in the bucket with the given prefix."""
    response = self.s3_client.list_objects_v2(
        Bucket=self.bucket_name,
        Prefix=prefix
    )
    return [obj['Key'] for obj in response.get('Contents', [])]

This will ensure compatibility with the CodebaseContextSnapshot class and enable the remote storage functionality to work correctly.

codegen-sh · 2025-05-01T19:24:37Z

Testing Recommendations

I noticed there are no tests for the new functionality added in this PR. Adding tests would help ensure the correctness and robustness of the implementation.

Here are some suggestions for tests that could be added:

Unit tests for CodebaseAnalysisHarness:
- Test initialization with different parameters
- Test the analyze_codebase method
- Test the get_diff and files_in_patch methods
- Test the run_agent method with mocked agent responses
Unit tests for CodebaseContextSnapshot:
- Test snapshot creation and loading
- Test local and remote storage operations (with mocked BucketStore)
- Test error handling for various failure scenarios
Integration tests for CodeContextRetrievalServer:
- Test API endpoints with test client
- Test error handling and response formats
- Test server startup and initialization
End-to-end tests:
- Test the full workflow from analysis to snapshot creation to retrieval
- Test the CLI commands

These tests would help catch any issues early and ensure the functionality works as expected across different scenarios.

codegen-sh · 2025-05-01T19:24:55Z

Error Handling Improvements

While the PR includes custom exception classes and basic error handling, there are a few areas where error handling could be improved:

In CodebaseContextSnapshot.load_snapshot():

# Current implementation
try:
    self.snapshot_data = self._load_local(snapshot_id)
except Exception as e:
    logger.error(f"Failed to load snapshot {snapshot_id}: {e}")
    raise SnapshotLoadError() from e

Consider providing more specific error information:

try:
    self.snapshot_data = self._load_local(snapshot_id)
except Exception as e:
    logger.error(f"Failed to load snapshot {snapshot_id}: {e}")
    raise SnapshotLoadError(f"Failed to load snapshot {snapshot_id} from local storage: {str(e)}") from e

In CodeContextRetrievalServer endpoints:
The error responses could include more detailed information about what went wrong. For example:

# Current implementation
except Exception as e:
    logger.error(f"Repository analysis failed: {e}")
    raise HTTPException(status_code=500, detail=str(e)) from e

Consider categorizing errors and providing more context:

except FileNotFoundError as e:
    logger.error(f"Repository file not found: {e}")
    raise HTTPException(status_code=404, detail=f"Repository file not found: {str(e)}") from e
except PermissionError as e:
    logger.error(f"Permission denied: {e}")
    raise HTTPException(status_code=403, detail=f"Permission denied: {str(e)}") from e
except Exception as e:
    logger.error(f"Repository analysis failed: {e}")
    raise HTTPException(status_code=500, detail=f"Repository analysis failed: {str(e)}") from e

Add validation for user inputs:
Consider adding more validation for user inputs, especially in the API endpoints, to catch errors early and provide clear feedback.

These improvements would make the code more robust and provide better feedback to users when things go wrong.

codegen-sh · 2025-05-01T19:25:08Z

I've completed a thorough review of PR #14 "Integrate harness.py functionality into codegen-on-oss package".

The PR is well-structured and effectively integrates the functionality from harness.py into the codegen-on-oss package, enhancing it with valuable codebase analysis and context management capabilities.

I've left several detailed comments on the PR with specific feedback and suggestions for improvement:

A general review highlighting strengths and areas for improvement
A specific issue with the incomplete BucketStore implementation, including suggested code to fix it
Recommendations for adding tests to ensure correctness and robustness
Suggestions for improving error handling throughout the codebase

Overall, this is a valuable addition to the package that will enhance its capabilities for codebase analysis and context management. With the suggested improvements, it will be even more robust and user-friendly.

Let me know if you'd like me to elaborate on any specific aspect of the review or if you have any questions!

💻 View my work • React 👍 or 👎

Integrate harness.py functionality into codegen-on-oss package

66cd331

gemini-code-assist bot suggested changes May 1, 2025

View reviewed changes

qodo-code-review bot added the Review effort 4/5 label May 1, 2025

codecov-ai bot reviewed May 1, 2025

View reviewed changes

gemini-code-assist bot suggested changes May 1, 2025

View reviewed changes

korbit-ai bot reviewed May 1, 2025

View reviewed changes

sourcery-ai bot approved these changes May 1, 2025

View reviewed changes

coderabbitai bot reviewed May 1, 2025

View reviewed changes

Zeeeepa force-pushed the develop branch 11 times, most recently from 27f0eca to f4656a2 Compare May 8, 2025 04:25

codegen-sh bot force-pushed the develop branch from f4656a2 to c2a4884 Compare May 8, 2025 08:27

codegen-sh bot closed this May 8, 2025

-class CodebaseAnalysisHarness:
-    """
-    A comprehensive harness for analyzing codebases, generating diffs, and tracking file changes.
-    Integrates core functionality from the original harness.py with enhanced analysis capabilities.
-    """
+class CodebaseAnalysisHarness:
+    def __enter__(self):
+        return self
+    def __exit__(self, exc_type, exc_val, exc_tb):
+        if hasattr(self, 'codebase'):
+            self.codebase.cleanup()  # Implement appropriate cleanup
+    def __del__(self):
+        if hasattr(self, 'codebase'):
+            self.codebase.cleanup()

-)
+from cachetools import TTLCache
+from datetime import timedelta
+# Cache harnesses with a 1-hour TTL and maximum of 100 entries
+active_harnesses: TTLCache = TTLCache(
+    maxsize=100,
+    ttl=timedelta(hours=1).total_seconds()
+)
+def cleanup_inactive_harnesses():
+    """Manually cleanup inactive harnesses"""
+    for key in list(active_harnesses.keys()):
+        if key not in active_harnesses:  # Check if already evicted
+            continue
+        harness = active_harnesses[key]
+        if harness.is_inactive():  # Implement inactivity check
+            del active_harnesses[key]

-class CodebaseContextSnapshot:
-    """
-    Allows saving and restoring codebase state and analysis results.
-    Integrates with S3-compatible storage via BucketStore.
-    """
+from threading import Lock
+class CodebaseContextSnapshot:
+    _snapshot_locks = {}
+    _global_lock = Lock()
+    def __init__(self, ...):
+        with self._global_lock:
+            self._snapshot_lock = self._snapshot_locks.setdefault(self.snapshot_id, Lock())
+    def create_snapshot(self, local_path: Optional[Union[str, Path]] = None) -> str:
+        with self._snapshot_lock:
+            # Existing snapshot creation code
+            pass

		comment = "## Development Import Violations Found\n\n"
		comment += "The following files contain imports of development modules that should not be used in production code:\n\n"

		except ValueError:
		raise rest.ApiException(status=0, reason=(f"Failed to parse `{string}` as datetime object"))

-async def analyze_repository(repo_info: RepositoryInfo):
-    """
-    Analyze a repository and return the results.
-    Creates a new CodebaseAnalysisHarness for the repository and performs analysis.
-    """
-    harness_key = f"{repo_info.repo_full_name}:{repo_info.commit or 'latest'}"
+from pydantic import BaseModel, Field
+from pydantic import validator
+import re
+class RepositoryInfo(BaseModel):
+    repo_full_name: str = Field(..., description="Full repository name (owner/repo)")
+    commit: str | None = Field(None, description="Commit hash to analyze")
+    language: str = Field("python", description="Primary language of the repository")
+    @validator('repo_full_name')
+    def validate_repo_name(cls, v):
+        if not re.match(r'^[a-zA-Z0-9_.-]+/[a-zA-Z0-9_.-]+$', v):
+            raise ValueError('Invalid repository name format. Must be in the format "owner/repo"')
+        return v
+    @validator('commit')
+    def validate_commit(cls, v):
+        if v is not None and not re.match(r'^[0-9a-f]{7,40}$', v):
+            raise ValueError('Invalid commit hash format. Must be a valid Git commit hash')
+        return v

	harness = active_harnesses[harness_key]
	- harness = active_harnesses[harness_key]
	+ harness = active_harnesses[harness_key].access()

	harness = active_harnesses[harness_key]
	harness = active_harnesses[harness_key].access()

Integrate harness.py functionality into codegen-on-oss package #14

Integrate harness.py functionality into codegen-on-oss package #14

Uh oh!

Conversation

codegen-sh bot commented May 1, 2025 • edited by sourcery-ai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes Made

1. Created New Modules

2. Updated Package Structure

3. Documentation

Key Features

How to Use

Summary by Sourcery

Summary by CodeRabbit

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

korbit-ai bot commented May 1, 2025

Uh oh!

sourcery-ai bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reviewer's Guide

File-Level Changes

Interacting with Sourcery

Customizing Your Experience

Getting Help

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

coderabbitai bot commented May 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

codecov-ai bot commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025 • edited by codegen-sh bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codegen-sh bot commented May 1, 2025

Uh oh!

coderabbitai bot commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

Zeeeepa commented May 1, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Summary of Findings

Merge Readiness

Uh oh!

gemini-code-assist bot May 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot May 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot May 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist bot May 1, 2025

Choose a reason for hiding this comment

codegen-sh bot commented May 1, 2025 •

edited by sourcery-ai bot

Loading

sourcery-ai bot commented May 1, 2025 •

edited

Loading

coderabbitai bot commented May 1, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

Zeeeepa commented May 1, 2025 •

edited by codegen-sh bot

Loading

codiumai-pr-agent-free bot commented May 1, 2025 •

edited

Loading

(Review updated until commit `18a0737`)

qodo-code-review bot commented May 1, 2025 •

edited by codiumai-pr-agent-free bot

Loading