feat(backend): add SQLAlchemy infrastructure for database operations #11419

Swiftyos · 2025-11-20T09:36:54Z

Summary

Adds SQLAlchemy infrastructure to the backend as foundation for incrementally replacing Prisma for runtime database operations, while maintaining Prisma for migration generation.

Changes

Core Infrastructure

backend/data/sqlalchemy.py: Async SQLAlchemy engine with connection pooling
- Engine creation with QueuePool (10 persistent + 5 overflow connections)
- Session factory for dependency injection
- get_session() FastAPI dependency
- Lifecycle management (initialize(), dispose())
backend/data/sqlalchemy_test.py: Comprehensive test suite
- URL conversion, schema extraction, engine creation
- Session factory and dependency injection tests
- All tests passing ✅

Configuration

backend/util/settings.py: SQLAlchemy settings
- Pool size, overflow, timeouts
- Echo mode for debugging
backend/.env.default: Default environment variables

Service Integration

backend/executor/database.py: DatabaseManager lifespan
backend/server/rest_api.py: AgentServer lifespan

Both services now initialize SQLAlchemy on startup and dispose on shutdown.

Dependencies

pyproject.toml: Added sqlalchemy[asyncio] and asyncpg

Technical Details

Connection Pool:

10 persistent connections per service
5 overflow connections
30s pool timeout
Pre-ping enabled

Schema Handling:

Extracts from existing DATABASE_URL
Sets via search_path parameter
Compatible with Prisma configuration

Session Lifecycle:

Automatic transaction management
Commit on success, rollback on error
Connection returned to pool after use

Migration Approach

This PR establishes infrastructure only. Both Prisma and SQLAlchemy will coexist during incremental migration:

✅ Infrastructure (this PR)
Next: Proof of concept with new features
Then: Systematic table migration
Finally: Remove Prisma runtime usage

Testing

poetry run pytest backend/backend/data/sqlalchemy_test.py -xvs

All tests passing with coverage of:

URL conversion and schema extraction
Engine and session factory creation
Dependency injection lifecycle
Error handling and rollback

Breaking Changes

None - purely additive. Prisma continues to work unchanged.

Checklist 📋

For code changes:

I have clearly listed my changes in the PR description
I have made a test plan
I have tested my changes according to the test plan:
- I have added tests for the new functionality

netlify · 2025-11-20T09:37:00Z

✅ Deploy Preview for auto-gpt-docs-dev canceled.

Name	Link
🔨 Latest commit	`39839a5`
🔍 Latest deploy log	https://app.netlify.com/projects/auto-gpt-docs-dev/deploys/6920c304a7314d00087de5df

coderabbitai · 2025-11-20T09:37:01Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

✨ Finishing touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch swiftyos/sqlalchemy-plumbing

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

netlify · 2025-11-20T09:37:08Z

✅ Deploy Preview for auto-gpt-docs canceled.

Name	Link
🔨 Latest commit	`39839a5`
🔍 Latest deploy log	https://app.netlify.com/projects/auto-gpt-docs/deploys/6920c30410b6290008fdeabf

qodo-merge-pro · 2025-11-20T09:37:29Z

PR Reviewer Guide 🔍

Here are some key observations to aid the review process:

⏱️ Estimated effort to review: 3 🔵🔵🔵⚪⚪
🧪 PR contains tests
🔒 No security concerns identified
⚡ Recommended focus areas for review Possible Misuse Using QueuePool with create_async_engine may be unnecessary or problematic since async engines manage pooling internally via the underlying driver; verify that specifying QueuePool is supported and won’t lead to unexpected behavior with asyncpg. # Connection pool configuration poolclass=QueuePool, # Standard connection pool pool_size=config.sqlalchemy_pool_size, # Persistent connections max_overflow=config.sqlalchemy_max_overflow, # Burst capacity pool_timeout=config.sqlalchemy_pool_timeout, # Wait time for connection pool_pre_ping=True, # Validate connections before use # Async configuration Transaction Semantics The FastAPI dependency commits after yielding regardless of whether any writes occurred; this can surprise callers that expect explicit commit control. Consider scoping transactions explicitly or documenting that each request auto-commits and ensure read-only routes don’t incur unnecessary commits. # Create session (borrows connection from pool) async with _session_factory() as session: try: yield session # Inject into route handler or context manager # If we get here, route succeeded - commit any pending changes await session.commit() except Exception: # Error occurred - rollback transaction URL Sanitization Regex-based stripping of schema query params may miss edge cases (ordering, URL encoding, additional params). Consider parsing via urllib.parse to robustly remove only schema while preserving other parameters. async_url = prisma_url.replace("postgresql://", "postgresql+asyncpg://") # Remove schema parameter (we'll handle via MetaData) async_url = re.sub(r"\?schema=\w+", "", async_url) # Remove any remaining query parameters that might conflict async_url = re.sub(r"&schema=\w+", "", async_url) return async_url

deepsource-io · 2025-11-20T09:38:15Z

Here's the code health analysis summary for commits 0edc669..39839a5. View details on DeepSource ↗.

Analysis Summary

Analyzer	Status	Summary	Link
JavaScript	✅ Success		View Check ↗
Python	✅ Success	❗ 20 occurences introduced 🎯 2 occurences resolved	View Check ↗

💡 If you’re a repository administrator, you can configure the quality gates from the settings.

AutoGPT-Agent · 2025-11-20T10:11:15Z

Thank you for this well-structured PR that adds SQLAlchemy infrastructure to the backend. The code looks well-designed with comprehensive test coverage and clear documentation.

A few items to address before merging:

Missing checklist: Your PR is missing the required checklist. Even though this is primarily infrastructure code, we still need the checklist filled out. You can mark the testing sections as completed with your test plan since you've clearly tested the SQLAlchemy integration.
Configuration design: The SQLAlchemy configuration in settings.py looks good, but should we add some comments about reasonable values for these settings in different environments (dev/test/prod)?
Documentation: While you mentioned SQLAlchemy_INTEGRATION.md, I don't see it in the diff. Make sure this documentation is included to help other developers understand the migration plan.
Error handling: The error handling in the lifespan hooks looks good, but consider adding more specific error types in your exception handlers where possible for better debugging.

Overall, this is a well-structured foundation for the gradual migration from Prisma to SQLAlchemy. Once you address the checklist issue, this should be ready for merging.

qodo-merge-pro · 2025-11-20T10:59:08Z