ADR: Per-User API Keys for Multi-Tenant SaaS Support by StuMason · Pull Request #7 · StuMason/polar-flow-server

StuMason · 2026-01-12T15:25:35Z

Summary

This PR proposes an architecture decision for implementing per-user API keys to support multi-tenant SaaS deployments.

Problem

Currently, a single API key grants access to ALL users' data. If compromised in a SaaS context, it's a total data breach.

Solution

Each user gets their own API key, scoped to only their data:

Key generated on OAuth completion
Returned to calling app (Laravel) for storage
All API calls validated against user_id in path
Compromised key = single user affected

Key Changes Proposed

Add api_key_hash, api_key_prefix columns to User model
Generate API key during OAuth callback
New auth guard that validates key ownership
Backward compatible with existing single API key mode

Test plan

Review ADR for architectural soundness
Identify any security concerns
Validate backward compatibility approach
Confirm implementation plan is complete

🤖 Generated with Claude Code

Proposes architecture for multi-tenant SaaS support where each user gets their own scoped API key instead of sharing a single key. Key points: - Per-user API keys scoped to only that user's data - API key generated on OAuth completion - Backward compatible with existing single API key mode - Enables secure Laravel/SaaS integration Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Major changes based on review bot analysis: Security Fixes: - Replace URL-based API key delivery with two-step code exchange - Remove user ID from key prefix to prevent enumeration attacks - Add rate limiting as Phase 1 (was future enhancement) Architecture Clarifications: - Extend existing APIKey model instead of adding User fields - Document relationship between service-level and user-scoped keys - Clarify self-hosted vs SaaS deployment modes Implementation Improvements: - Reorder phases: Database → OAuth → Auth → Key Management → Admin - Add comprehensive testing phase with security tests - Add key revocation endpoint (separate from regeneration) - Document webhook authentication strategy Documentation: - Add detailed OAuth flow sequence diagram - Answer all open questions from review - Add security considerations section - Include rate limit response headers Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Phase 1-4 of ADR-001 implementation: Database & Models: - Extended APIKey model with user_id FK, rate limiting fields - Added TempAuthCode model for secure OAuth code exchange - Created Alembic migration for new columns and tables - Added relationship between User and APIKey models API Key Generation: - New api_keys.py utility with secure key generation - SHA-256 hashing with constant-time comparison - Rate limiting logic with sliding window Secure OAuth Flow (Two-Step Code Exchange): - New /api/v1/oauth/start endpoint for SaaS clients - OAuth callback generates temp code, not API key - /api/v1/oauth/exchange for server-to-server key retrieval - Temp codes expire in 5 minutes, single-use Per-User Authentication Guard: - New per_user_api_key_guard with rate limit enforcement - User-scoped keys can only access their own data - Service-level keys (user_id=NULL) have full access - Rate limit headers in responses (X-RateLimit-*) Key Management Endpoints: - POST /users/{id}/api-key/regenerate - POST /users/{id}/api-key/revoke - GET /users/{id}/api-key/info - GET /users/{id}/status Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

The CSRF config was excluding paths that don't exist (/api/v1/...). Updated to exclude the actual paths: - /oauth/ - OAuth endpoints for SaaS (callback, exchange, start) - /users/ - API routes that use API key auth, not sessions Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

StuMason · 2026-01-12T16:01:30Z

@claude please rereview

claude · 2026-01-12T16:01:47Z

Claude finished @StuMason's task in 2m 0s —— View job

Re-review Complete: Per-User API Keys

I've completed a comprehensive re-review of this PR. Overall, the implementation is well-architected and production-ready with strong security foundations. Here's my detailed analysis:

✅ Strengths

1. Excellent ADR Documentation

The ADR is comprehensive, well-structured, and clearly articulates the problem, decision, and consequences
The two-step OAuth flow (temp code → API key exchange) is a critical security improvement that prevents API keys from appearing in URLs
Good use of diagrams and code examples

2. Solid Security Design

✅ SHA-256 hashing for API keys
✅ Constant-time comparison to prevent timing attacks (secrets.compare_digest in auth.py:254)
✅ Rate limiting implemented at the key level
✅ Proper foreign key cascades (CASCADE on user deletion)
✅ Single-use temporary codes with 5-minute expiration
✅ Key prefix for identification without exposing full key

3. Clean Database Schema

Migration in alembic/versions/c4d5e6f7g8h9_add_per_user_api_keys.py is well-structured
Proper indexes on key_hash, user_id, key_prefix, is_active
Server defaults for rate limiting columns ensure safe deployment
Good separation: APIKey model extended, TempAuthCode model added

4. Backward Compatibility

Existing API_KEY env var mode preserved (auth.py:194-197, 254-256)
Service-level keys (user_id=NULL) maintain full access pattern
Self-hosted deployments unaffected

⚠️ Issues Requiring Attention

CRITICAL: Missing Per-User Guard on Data Endpoints

The existing data endpoints (data_router, sleep_router) still use the old api_key_guard (data.py:573, sleep.py:144) instead of per_user_api_key_guard.

Security Impact:

User-scoped keys won't have their scope validated
Rate limiting won't be enforced
The core security benefit of this PR is not applied to the main data endpoints

Evidence:

# src/polar_flow_server/api/data.py:571-573
data_router = Router(
    path="/api/v1",
    guards=[api_key_guard],  # ❌ Should be per_user_api_key_guard

# src/polar_flow_server/api/sleep.py:142-144
sleep_router = Router(
    path="/api/v1",
    guards=[api_key_guard],  # ❌ Should be per_user_api_key_guard

Recommendation:
Replace api_key_guard with per_user_api_key_guard on all user data routers. The new guard already handles backward compatibility (it checks config-based keys first at auth.py:254-256).

Fix data router guard →

Fix sleep router guard →

HIGH: Duplicate Key Generation Logic

There are two implementations of API key generation:

core/api_keys.py:41-64 - New, preferred implementation
core/auth.py:46-57 - Legacy implementation with different prefix (pfs_ vs pfk_)

Issues:

Keys generated via auth.py:generate_api_key() won't match the pfk_ prefix convention
Inconsistent key length (64 hex chars vs 40 hex chars)
Potential confusion for maintainers

Current Usage:

✅ New OAuth flow uses core/api_keys.py (correct)
❓ Legacy auth.py function appears unused but still present

Recommendation:
Remove or deprecate the legacy generate_api_key() function in auth.py:46-57. If needed for backward compatibility, make it call the new implementation.

Remove duplicate function →

MEDIUM: Rate Limit Headers Missing from Response

The per_user_api_key_guard stores rate limit info in connection.state[RATE_LIMIT_STATE_KEY] (auth.py:275) and provides get_rate_limit_headers() helper (auth.py:306-323), but no response middleware adds these headers.

Impact:

Clients can't see their rate limit status
Unclear when they'll be reset
Reduces API usability

Recommendation:
Add a response middleware or use Litestar's after_response hook to inject rate limit headers on all authenticated endpoints.

Example:

@litestar.after_response
async def add_rate_limit_headers(response: Response, connection: ASGIConnection) -> Response:
    headers = get_rate_limit_headers(connection)
    response.headers.update(headers)
    return response

Add rate limit headers middleware →

MEDIUM: In-Memory OAuth State Storage

The SaaS OAuth flow stores state in a module-level dict: _saas_oauth_states (keys.py:40).

Issues:

Not horizontally scalable - won't work with multiple server instances
Lost on restart - in-flight OAuth flows will fail
Memory leak potential - expired states cleaned up only during new OAuth requests (keys.py:140-144)

Recommendation for Production:

Use Redis or database for state storage in SaaS mode
Add periodic cleanup job for expired states
Document this limitation in the ADR

Acceptable for MVP: This is fine for single-instance deployments and early SaaS testing, but should be flagged as a known limitation.

LOW: Missing Test Coverage

Based on file changes, there are no new tests for:

Per-user API key validation
Rate limiting logic
Temp code exchange flow
User scope enforcement
CSRF attacks on OAuth state

Recommendation:
Add integration tests covering at minimum:

User-scoped key can only access own data (✅ pass)
User-scoped key cannot access other user's data (❌ 403)
Rate limiting works correctly (✅ succeed → ❌ 429 → ✅ succeed after reset)
Temp code is single-use (❌ second exchange fails)
Expired temp code cannot be exchanged

🔍 Code Quality Observations

Good Patterns:

Clean separation of concerns (core/api_keys.py vs core/auth.py)
Proper use of NamedTuple for return values (api_keys.py:26-39)
Good use of TYPE_CHECKING to avoid circular imports (api_key.py:6, user.py:11)
Relationships properly configured with back_populates and cascade

Minor Suggestions:

api_keys.py:366 - Type annotation CursorResult[tuple[()]] is a workaround for mypy. Consider adding a # type: ignore comment explaining why.
keys.py:40 - Consider moving _saas_oauth_states to a class or singleton for better testability
auth.py:85 - The noqa: E712 comment is needed because SQLAlchemy requires == True instead of is True. Good to document this.

📊 Implementation Completeness

Checking against ADR Phase checklist:

Phase	Status	Notes
Phase 1: Database & Rate Limiting	✅ Complete	Migration, models, rate limiting implemented
Phase 2: OAuth Integration	✅ Complete	Temp codes, exchange endpoint working
Phase 3: Authentication Guard	⚠️ Partial	Guard implemented but not applied to data endpoints
Phase 4: Key Management	✅ Complete	Regenerate, revoke, status endpoints present
Phase 5: Admin Dashboard	❓ Unknown	Not reviewed in this PR
Phase 6: Testing	❌ Missing	No test files in changeset
Phase 7: Documentation	✅ Complete	Excellent ADR

🎯 Summary & Recommendation

Overall Assessment: This is high-quality work with strong architectural foundations. The ADR is exemplary, the security design is sound, and the code is clean.

Critical Path to Merge:

✅ MUST FIX: Apply per_user_api_key_guard to data/sleep routers
⚠️ SHOULD FIX: Remove duplicate key generation in auth.py
⚠️ SHOULD FIX: Add rate limit headers to responses
📝 NICE TO HAVE: Add integration tests
📝 NICE TO HAVE: Document OAuth state storage limitation

Verdict: Approve after addressing item #1 (critical guard replacement). Items #2-3 can be follow-up PRs if needed, but #1 is essential for the security model to work correctly.

CRITICAL: Replace api_key_guard with per_user_api_key_guard on all user data routers (data, sleep, sync) to enforce: - Per-user access restrictions (user keys can only access own data) - Rate limiting on all endpoints - Proper key validation Also removes legacy generate_api_key() function from auth.py that was superseded by core/api_keys.py implementation. Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

This comment was marked as outdated.

Sign in to view

StuMason and others added 4 commits January 12, 2026 15:34

style: Fix ruff formatting issues

820763e

Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

StuMason merged commit 67dd8f9 into main Jan 12, 2026
5 checks passed

StuMason deleted the feature/per-user-api-keys branch January 12, 2026 16:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR: Per-User API Keys for Multi-Tenant SaaS Support#7

ADR: Per-User API Keys for Multi-Tenant SaaS Support#7
StuMason merged 6 commits intomainfrom
feature/per-user-api-keys

StuMason commented Jan 12, 2026

Uh oh!

This comment was marked as outdated.

StuMason commented Jan 12, 2026

Uh oh!

claude bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

StuMason commented Jan 12, 2026

Summary

Problem

Solution

Key Changes Proposed

Test plan

Uh oh!

This comment was marked as outdated.

StuMason commented Jan 12, 2026

Uh oh!

claude bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Re-review Complete: Per-User API Keys

✅ Strengths

1. Excellent ADR Documentation

2. Solid Security Design

3. Clean Database Schema

4. Backward Compatibility

⚠️ Issues Requiring Attention

CRITICAL: Missing Per-User Guard on Data Endpoints

HIGH: Duplicate Key Generation Logic

MEDIUM: Rate Limit Headers Missing from Response

MEDIUM: In-Memory OAuth State Storage

LOW: Missing Test Coverage

🔍 Code Quality Observations

Good Patterns:

Minor Suggestions:

📊 Implementation Completeness

🎯 Summary & Recommendation

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

claude bot commented Jan 12, 2026 •

edited

Loading