Add support for postgres schema selection #373

sirudog · 2026-01-09T09:07:29Z

Currently the schema is hardcoded to 'public'.
This PR adds support for postgres schema selection by using the options parameter in the connection string, which is the proper Postgres-native way.

The connection string would look like (before URL encoding):
postgresql://user:pass@host:port/database?options=-c search_path=custom_schema

How it works:

Native Postgres handling - psycopg2/libpq processes the options parameter directly, so search_path is set automatically on connection
Parse for metadata queries - Extract search_path value from the options parameter to use in information_schema queries
Default to public - If no options or no search_path specified, fall back to 'public'

Why this approach:

Minimal changes - Only postgres_loader.py needs modification
Backward compatible - Defaults to 'public' if not specified
User-friendly - Follows familiar URL parameter pattern
Postgres-specific - Doesn't affect MySQL or other loaders

Summary by CodeRabbit

New Features
- Added support for specifying PostgreSQL schemas via connection URLs and a new schema input field in the database connection modal.
- Users can now target specific schemas instead of defaulting to "public."
Documentation
- Updated PostgreSQL loader documentation with schema configuration guide, examples, and troubleshooting steps.
Tests
- Added comprehensive test suite for schema parsing functionality covering edge cases and various input formats.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2026-01-09T09:07:36Z

Important

Review skipped

Auto reviews are disabled on base/target branches other than the default branch.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

📝 Walkthrough

Walkthrough

The changes add PostgreSQL schema support throughout the application stack. A new schema parsing method extracts the default schema from connection URLs, updated extraction methods accept and respect a schema parameter, the UI provides schema input for PostgreSQL connections, and comprehensive tests and documentation cover the new functionality.

Changes

Cohort / File(s)	Summary
PostgreSQL Loader - Schema Parsing & Extraction `api/loaders/postgres_loader.py`	Introduces `_parse_schema_from_url()` static method to extract schema from connection URLs with support for `options=-csearch_path` parameter. All extraction methods (`extract_tables_info`, `extract_columns_info`, `extract_foreign_keys`, `extract_relationships`) now accept `schema` parameter (default `'public'`) and constrain queries to specified schema. Load flow now derives schema from URL and threads it through extraction calls.
User Interface - Schema Configuration `app/src/components/modals/DatabaseModal.tsx`	Adds schema state and PostgreSQL-specific UI input field. When manual connection mode is used with PostgreSQL, appends schema as URL option parameter (`options=-csearch_path%3D<schema>`) to connection string. Schema state resets after successful connection.
Tests & Documentation `tests/test_postgres_loader.py`, `docs/postgres_loader.md`	New comprehensive test suite (`TestParseSchemaFromUrl`) with 15 test cases covering URL parsing edge cases (empty options, multiple schemas, quoted names, case-insensitivity). Documentation adds Custom Schema Support section with configuration format, examples, permission requirements, URL encoding notes, and troubleshooting guidance for schema configuration verification.
Documentation Format `README.md`	Adds closing code fence (```) after streaming example code block.

Sequence Diagram

sequenceDiagram
    participant User as User
    participant Modal as DatabaseModal
    participant URL as URL Constructor
    participant Loader as PostgresLoader
    participant Parser as _parse_schema_from_url
    participant DB as PostgreSQL

    User->>Modal: Select PostgreSQL + Input Schema
    Modal->>URL: Append schema to connection URL
    URL-->>Modal: Connection URL with options=-csearch_path=<schema>
    Modal->>Loader: load(prefix, connection_url)
    Loader->>Parser: _parse_schema_from_url(connection_url)
    Parser->>Parser: Extract schema from options parameter
    Parser-->>Loader: schema (or 'public' default)
    Loader->>DB: Connect & extract_tables_info(cursor, schema)
    Loader->>DB: extract_columns_info(cursor, table, schema)
    Loader->>DB: extract_foreign_keys(cursor, table, schema)
    Loader->>DB: extract_relationships(cursor, schema)
    DB-->>Loader: Filtered schema-specific metadata
    Loader-->>Modal: Schema-constrained database structure

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

Poem

🐰 A rabbit hops through schemas with care,
Parsing URLs with URLs to spare,
From frontend form to database query,
Schema support—no need to worry,
Tests verify each tunneled path,
PostgreSQL bends to our rabbit's wrath!

🚥 Pre-merge checks | ✅ 3

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Add support for postgres schema selection' directly and clearly describes the main objective of the pull request—enabling schema selection in PostgreSQL, which is the core functionality across all changes.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (4)

app/src/components/modals/DatabaseModal.tsx (1)

98-103: URL encoding approach is correct.

The schema option construction follows PostgreSQL's options parameter format. The use of hardcoded %3D with encodeURIComponent for the schema value ensures proper encoding.

One minor consideration: schema names with spaces or special characters might need additional validation or quoting. PostgreSQL schema names containing spaces require double quotes, but this is an edge case rarely encountered in practice.

docs/postgres_loader.md (1)

277-289: Reconsider documenting private method usage.

The verification example calls PostgresLoader._parse_schema_from_url(), which is a private method (indicated by the leading underscore). Documenting private methods in user-facing documentation can create maintenance issues if the internal implementation changes.

Consider one of these alternatives:

Make the method public if it's intended for external use

Document verification through connection testing instead

Remove this verification section if it's primarily for internal debugging

api/loaders/postgres_loader.py (1)

100-147: Schema parsing logic is generally sound with minor edge case.

The implementation correctly handles most common scenarios including URL-encoded formats, case-insensitive matching, comma-separated schemas, and the $user special variable.

Minor edge case: The regex pattern ([^\s]+) at line 128 will not correctly capture quoted schema names containing spaces (e.g., search_path="my schema"). This is an uncommon edge case since PostgreSQL schema names with spaces are rare, but it's worth noting.

Regarding the static analysis hints: The broad exception handler at line 145 is reasonable for a parser that should gracefully degrade, though it could optionally log parse failures for debugging. The return statement structure (TRY300 hint) is a style preference and functionally equivalent to using an else block.

tests/test_postgres_loader.py (1)

153-239: LGTM! Comprehensive test coverage for schema parsing.

The test suite thoroughly covers the main scenarios including defaults, format variations, special cases ($user), quoted values, case insensitivity, and error handling. The tests validate the expected behavior of _parse_schema_from_url across a wide range of inputs.

Optional enhancement: Consider adding a test case for quoted schemas containing spaces (e.g., search_path="my schema"), which would verify handling of the edge case where PostgreSQL schema names include spaces. This is uncommon in practice but would complete the test coverage.

📜 Review details

Configuration used: defaults

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1e82c19 and ef85f13.

📒 Files selected for processing (5)

README.md
api/loaders/postgres_loader.py
app/src/components/modals/DatabaseModal.tsx
docs/postgres_loader.md
tests/test_postgres_loader.py

🧰 Additional context used

📓 Path-based instructions (4)

app/**/*.{ts,tsx}

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Frontend source changes in app/**/*.{ts,tsx} must be compiled before production runs

Files:

app/src/components/modals/DatabaseModal.tsx

README.md

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

When changing MCP wiring or adding integrations, update README.md with usage and configuration details

Files:

README.md

**/*.py

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

All Python code should pass pylint (use make lint)

Files:

api/loaders/postgres_loader.py
tests/test_postgres_loader.py

tests/test_*.py

📄 CodeRabbit inference engine (.github/copilot-instructions.md)

Unit test files should be named tests/test_*.py and runnable with pytest

Files:

tests/test_postgres_loader.py

🧬 Code graph analysis (2)

api/loaders/postgres_loader.py (2)

api/loaders/base_loader.py (1)

load (12-22)

api/loaders/mysql_loader.py (5)

load (155-202)

extract_tables_info (205-252)

extract_relationships (372-411)

extract_columns_info (255-332)

extract_foreign_keys (335-369)

tests/test_postgres_loader.py (1)

api/loaders/postgres_loader.py (1)

_parse_schema_from_url (101-147)

🪛 LanguageTool

docs/postgres_loader.md

[style] ~269-~269: This adverb was used twice in the sentence. Consider removing one of them or replacing them with a synonym.
Context: ...ted - Ensure the schema name is spelled correctly (schema names are case-sensitive) 4. **...

(ADVERB_REPETITION_PREMIUM)

🪛 Ruff (0.14.10)

api/loaders/postgres_loader.py

143-143: Consider moving this statement to an else block

(TRY300)

145-145: Do not catch blind exception: Exception

(BLE001)

🔇 Additional comments (13)

README.md (1)

219-219: LGTM! Formatting fix.

The closing code fence properly completes the Python streaming example code block.

app/src/components/modals/DatabaseModal.tsx (3)

31-31: LGTM! Schema state follows existing patterns.

The state initialization is consistent with other form fields in the component.

185-185: LGTM! Schema state reset follows existing pattern.

Consistent with other form field resets after successful connection.

394-412: LGTM! Schema UI field is well-implemented.

The conditional rendering, optional labeling, placeholder, helper text, and test ID all follow best practices and provide clear UX guidance.

docs/postgres_loader.md (3)

94-123: LGTM! Comprehensive schema configuration documentation.

The Custom Schema Configuration section accurately describes the feature with clear examples, URL encoding notes, default behavior, permission requirements, and UI guidance. The PostgreSQL documentation reference is helpful.

256-256: LGTM! Limitation accurately described.

The updated limitation correctly states that extraction is limited to one schema at a time, with clear indication of the default and configuration method.

266-276: LGTM! Enhanced troubleshooting guidance.

The expanded troubleshooting section provides actionable guidance for schema-related issues, including verification steps and permission checks with correct PostgreSQL GRANT syntax.

api/loaders/postgres_loader.py (6)

8-8: LGTM! Appropriate imports for URL parsing.

The urllib.parse imports are standard library components suitable for extracting schema from connection URLs.

150-201: LGTM! Schema integration in load() is well-implemented.

The schema is correctly parsed before connection and propagated to extraction methods. The connection URL is passed unchanged to psycopg2, allowing native handling of the options parameter, while the parsed schema constrains metadata queries. The updated docstring provides helpful usage guidance.

204-258: LGTM! extract_tables_info correctly filters by schema.

The method signature with default parameter maintains backward compatibility. The SQL query properly filters both the main query and the comment subquery by schema. Schema propagation to column and foreign key extraction is correct.

260-350: LGTM! extract_columns_info properly filters by schema.

The method signature maintains backward compatibility. All SQL subqueries and the main query correctly filter by schema, including the pg_namespace join and the PRIMARY KEY/FOREIGN KEY lookups.

352-392: LGTM! extract_foreign_keys correctly scoped to schema.

The method signature with default parameter and the SQL query properly constrain foreign key extraction to the specified schema while maintaining backward compatibility.

394-442: LGTM! extract_relationships properly scoped to schema.

The method filters all foreign key relationships by the specified schema. The default parameter ensures backward compatibility. The SQL query correctly constrains the results to the target schema.

galshubeli · 2026-01-12T08:16:08Z

@sirudog Thanks for the contribution

galshubeli · 2026-01-12T09:49:29Z

app/src/components/modals/DatabaseModal.tsx

+        // Append schema option for PostgreSQL if provided
+        if (selectedDatabase === 'postgresql' && schema.trim()) {
+          const schemaOption = `options=-csearch_path%3D${encodeURIComponent(schema.trim())}`;
+          dbUrl += (dbUrl.includes('?') ? '&' : '?') + schemaOption;


better to use:
const url = new URL(dbUrl);
url.searchParams.set('options', -csearch_path=${schema.trim()});
dbUrl = url.toString();

Indeed, fixed

galshubeli · 2026-01-12T09:51:14Z

@sirudog I left single comment. Please fix the spellcheck

…ring

sirudog · 2026-01-13T14:15:03Z

@galshubeli I'll revisit the spell checks... There are still a few :-(

sirudog · 2026-01-14T08:33:33Z

@galshubeli I'll revisit the spell checks... There are still a few :-(

@galshubeli, it is done, I hope I did not miss any this time :-)

Add support for postgres schema selection

85e4697

Merge branch 'main' into select_postgres_schema

ef85f13

sirudog marked this pull request as ready for review January 9, 2026 09:14

coderabbitai bot reviewed Jan 9, 2026

View reviewed changes

gkorland requested a review from galshubeli January 11, 2026 06:45

galshubeli changed the base branch from main to staging January 12, 2026 09:02

galshubeli reviewed Jan 12, 2026

View reviewed changes

sirudog and others added 3 commits January 13, 2026 13:57

Use URL.searchParams.set() for adding the schema to the connection st…

142bedc

…ring

Fix postgres_loader.md spelling issues

ad6397d

Merge branch 'staging' into select_postgres_schema

fe5e04c

sirudog requested a review from galshubeli January 13, 2026 14:06

Fix remaining postgres_loader.md spelling issues

9620535

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for postgres schema selection #373

Add support for postgres schema selection #373

Uh oh!

sirudog commented Jan 9, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 9, 2026 •

edited

Loading

Review skipped

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Uh oh!

galshubeli commented Jan 12, 2026

Uh oh!

galshubeli Jan 12, 2026

Uh oh!

sirudog Jan 13, 2026

Uh oh!

galshubeli commented Jan 12, 2026

Uh oh!

sirudog commented Jan 13, 2026 •

edited

Loading

Uh oh!

sirudog commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add support for postgres schema selection #373

Are you sure you want to change the base?

Add support for postgres schema selection #373

Uh oh!

Conversation

sirudog commented Jan 9, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram

Estimated code review effort

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

galshubeli commented Jan 12, 2026

Uh oh!

galshubeli Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

sirudog Jan 13, 2026

Choose a reason for hiding this comment

Uh oh!

galshubeli commented Jan 12, 2026

Uh oh!

sirudog commented Jan 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sirudog commented Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sirudog commented Jan 9, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 9, 2026 •

edited

Loading

sirudog commented Jan 13, 2026 •

edited

Loading